183 次程式碼提交 (ec60b322abfa0174da8125f46dccaf3da2c939bf)

作者 SHA1 備註 提交日期
  Kevin Hu 93f5df716f
Fix: order chunks from docx by positions. (#7979) 5 月之前
  Yongteng Lei bd4678bca6
Fix: Unnecessary truncation in markdown parser (#7972) 5 月之前
  Yongteng Lei 46963ab1ca
Fix: add advanced delimiter detection for naive merge (#7941) 5 月之前
  Yongteng Lei 0c562f0a9f
Refa: change citation mark as [ID:n] (#7923) 5 月之前
  Sol 0d7cfce6e1
Update rag/nlp/query.py (#7816) 5 月之前
  Stephen Hu db4371c745
Fix: Improve First Chunk Size (#7806) 5 月之前
  Emmanuel Ferdman d4a123d6dd
Fix: resolve regex library warnings (#7782) 5 月之前
  Kevin Hu 321a280031
Feat: add image preview to retrieval test. (#7610) 5 月之前
  Stephen Hu 573d46a4ef
FIX:ZeroDivisionError when using large page_size in client.retrieve() (#7595) 5 月之前
  Kevin Hu a14865e6bb
Fix: empty query issue. (#7551) 5 月之前
  Kevin Hu c7310f7fb2
Refa: similarity calculations. (#7381) 6 月之前
  Stephen Hu 1662c7eda3
Feat: Markdown add image (#7124) 6 月之前
  Yongteng Lei 67dee2d74e
Fix: fix retrieval tesing wrong pagination (#7174) 6 月之前
  alulala d9266ed65a
Fix: incorrect total chunks count in retrieval function after similarity filtering (#6741) (#6932) 6 月之前
  kaiyuan Zhang ead5f7aba9
Fix infinite recursion in RagTokenizer when processing repetitive characters (#6109) 7 月之前
  Kevin Hu 0758c04941
Refa: token similarity calculations. (#6614) 7 月之前
  Kevin Hu cc8029a732
Fix: uploading in chat box issue. (#6547) 7 月之前
  Kevin Hu ee5aa51d43
Fix: point in tag issue. (#6436) 7 月之前
  Kevin Hu a087d13ccb
Feat: text file support position retaining. (#6231) 7 月之前
  Kevin Hu 6e8d0e3177
Fix: rank feat issue. (#6225) 7 月之前
  Kevin Hu 1333d3c02a
Fix: float transfer exception. (#6197) 7 月之前
  Kevin Hu fabc5e9259
Refa: fix re-rank scope. (#6152) 7 月之前
  Kevin Hu e5a8b23684
Fix: empty tag field issue. (#6103) 7 月之前
  Kevin Hu 485bc7d7d6
Fix: limit the depth of DFS (#6101) 7 月之前
  Kevin Hu e05cdc2f9c
Fix: encode detect error. (#6006) 7 月之前
  Kevin Hu 15736c57c3
Fix: empty query issue. (#5830) 7 月之前
  Kevin Hu c190086707
Fix: bad case for tokenizer. (#5543) 8 月之前
  Kevin Hu 4f40f685d9
Code refactor (#5371) 8 月之前
  Kevin Hu 53b9e7b52f
Add tavily as web searh tool. (#5349) 8 月之前
  Kevin Hu daddfc9e1b
Remove dup gb2312, solve currupt error. (#5326) 8 月之前
  Kevin Hu 3444cb15e3
Refine search query. (#5235) 8 月之前
  Kevin Hu cdb3e6434a
Fix empty question issue. (#5225) 8 月之前
  Kevin Hu 7b3d700d5f
Apply agentic searching. (#5196) 8 月之前
  Kevin Hu e6c024f8bf
Fix too many clause while searching. (#5119) 8 月之前
  ubbg 29a59ed7e2
Fix: Use self.dataStore.indexExist in all_tags method of Dealer (#5108) 8 月之前
  Kevin Hu 9ff825f39d
Ignore exceptions when no index ahead. (#5047) 8 月之前
  Mathias Panzenböck 9bcccadebd
Remove use of eval() from search.py (#4887) 8 月之前
  Kevin Hu f374dd38b6
Fix divided by zero issue. (#4784) 8 月之前
  Kevin Hu 448fa1c4d4
Robust for abnormal response from LLMs. (#4747) 8 月之前
  Kevin Hu 6f2c3a3c3c
Fix too long query exception. (#4729) 8 月之前
  Kevin Hu 4011c8f68c
Fix potential error. (#4650) 9 月之前
  Kevin Hu 86892959a0
Rebuild graph when it's out of time. (#4607) 9 月之前
  Kevin Hu dd0ebbea35
Light GraphRAG (#4585) 9 月之前
  Kevin Hu c5da3cdd97
Tagging (#4426) 9 月之前
  Kevin Hu d9a4e4cc3b
Fix page size error. (#4401) 9 月之前
  Kevin Hu f948c0d9f1
Clean query. (#4259) 10 月之前
  Kevin Hu 7e063283ba
Removing invisible chars before tokenization. (#4233) 10 月之前
  Bo Liu 321e9f3719
fix: stop rerank by model when search result is empty (#4203) 10 月之前
  Kevin Hu c373dba0bc
Fix raptor bug. (#4192) 10 月之前
  Kevin Hu 31d67c850e
Fetch chunk by batches. (#4177) 10 月之前