132 コミット (v0.17.0)

作成者 SHA1 メッセージ 日付
  yihong 37aacb3960
Refa: drop useless fasttext (#5470) 8ヶ月前
  Yongteng Lei 83d0949498
Fix: fix special delimiter parsing issue (#5448) 8ヶ月前
  Zhichang Yu db42d0e0ae
Optimize ocr (#5297) 8ヶ月前
  Zhichang Yu 0151d42156
Reuse loaded modules if possible (#5231) 8ヶ月前
  Zhichang Yu c326f14fed
Optimized Recognizer.sort_X_firstly and Recognizer.sort_Y_firstly (#5182) 8ヶ月前
  Kevin Hu b08bb56f6c
Display thinking for deepseek r1 (#4904) 8ヶ月前
  Mathias Panzenböck 6b389e01b5
Remove use of eval() from operators.py (#4888) 8ヶ月前
  SkyfireWXY 8fcca1b958
fix: big xls file error (#4859) 8ヶ月前
  Zhichang Yu 3411d0a2ce
Added cuda_is_available (#4725) 8ヶ月前
  Zhichang Yu e1526846da
Fixed GPU detection on CPU only environment (#4711) 9ヶ月前
  Kevin Hu 6f30397bb5
Infinity adapt to graphrag. (#4663) 9ヶ月前
  Kevin Hu 1bff6b7333
Fix t_ocr.py for PNG image. (#4625) 9ヶ月前
  Zhichang Yu 4230402fbb
deepdoc use GPU if possible (#4618) 9ヶ月前
  Mathias Panzenböck 1a367664f1
Remove usage of eval() from postprocess.py (#4571) 9ヶ月前
  Jin Hai 3894de895b
Update comments (#4569) 9ヶ月前
  Mathias Panzenböck 75e1981e13
Remove use of eval() from recognizer.py (#4480) 9ヶ月前
  Mathias Panzenböck 4f9f9405b8
Remove use of eval() from ocr.py (#4481) 9ヶ月前
  Kevin Hu c852a6dfbf
Accelerate titles' embeddings. (#4492) 9ヶ月前
  Kevin Hu e478586a8e
Refactor. (#4487) 9ヶ月前
  Zhi-Qiang You b7ce4e7e62
fix:t_recognizer TypeError: 'super' object is not callable (#4404) 9ヶ月前
  Kevin Hu 2e40c2a6f6
Fix t_recognizer issue. (#4387) 9ヶ月前
  Kevin Hu 983ec0666c
Fix param error. (#4355) 10ヶ月前
  Kevin Hu 59a78408be
Fix t_recognizer.py after model updating. (#4330) 10ヶ月前
  Kevin Hu 76cd23eecf
Catch the exception while parsing pptx. (#4202) 10ヶ月前
  Kevin Hu 2cbe064080
Add Llama3.3 (#4174) 10ヶ月前
  ly0303521 101b8ff813
fix chunk method "Table" losing content when the Excel file has multi… (#4123) 10ヶ月前
  Kevin Hu ce1e855328
Upgrades Document Layout Analysis model. (#4054) 10ヶ月前
  Jin Hai 275b5d14f2
Fix json file parse (#4004) 10ヶ月前
  Zhichang Yu 9a6d976252
Add back beartype (#3967) 10ヶ月前
  Zhichang Yu 1254ecf445
Added static check at PR CI (#3921) 10ヶ月前
  Zhichang Yu 0d68a6cd1b
Fix errors detected by Ruff (#3918) 10ヶ月前
  Jin Hai 821fdf02b4
Fix parsing JSON file error (#3829) 11ヶ月前
  Yuhao Tsui 7b6a5ffaff
Fix: page_chars attribute does not exist in some formats of PDF (#3796) 11ヶ月前
  Kevin Hu 7058ac0041
Fix out of boundary. (#3786) 11ヶ月前
  Zhichang Yu bc701d7b4c
Edit chunk shall update instead of insert it (#3709) 11ヶ月前
  Zhichang Yu 2249d5d413
Always open text file for write with UTF-8 (#3688) 11ヶ月前
  Zhichang Yu cad341e794 Added kb_id filter to knn. Fix #3458 (#3513) 11ヶ月前
  Zhichang Yu 4413683898
Introduced beartype (#3460) 11ヶ月前
  Jin Hai 1e90a1bf36
Move settings initialization after module init phase (#3438) 11ヶ月前
  Zhichang Yu 30f6421760
Use consistent log file names, introduced initLogger (#3403) 11ヶ月前
  Kevin Hu 4caf932808
fix bug about fetching knowledge graph (#3394) 11ヶ月前
  Zhichang Yu a2a5631da4
Rework logging (#3358) 11ヶ月前
  kuschzzp 9c6cc20356
Fix:#3230 When parsing a docx file using the Book parsing method, to_page is always -1, resulting in a block count of 0 even if parsing is successful (#3249) 11ヶ月前
  Kevin Hu 2d1fbefdb5
search between multiple indiices for team function (#3079) 1年前
  Kevin Hu bfc07fe4f9
bigger resolution for OCR (#2919) 1年前
  chongchuanbing 66172cef3e
fix: torch dependency start error (#2777) 1年前
  Ikko Eltociear Ashimine c552a02e7f
chore: update operators.py (#2724) 1年前
  Kevin Hu daa65199e8
trival (#2650) 1年前
  Kevin Hu fc867cb959
rename get_txt to get_text (#2649) 1年前
  yqkcn aea553c3a8
Add get_txt function (#2639) 1年前