184 Révisions (v0.20.2)

Auteur SHA1 Message Date
  Yongteng Lei 787e0c6786
Refa: OpenAI whisper-1 (#9552) il y a 2 mois
  Yongteng Lei eef43fa25c
Fix: unexpected truncated Excel files (#9500) il y a 2 mois
  Jay Xu 6d1078b538
fix 'KeyError: "There is no item named 'word/NULL' in the archive"' (#9455) il y a 2 mois
  HaiyangP 79399f7f25
Support the case of one cell split by multiple columns. (#9225) il y a 2 mois
  Jay Xu 7f08ba47d7
Fix "no `tc` element at grid_offset" (#9375) il y a 2 mois
  yzz 550e65bb22
Fix: PlainParser using fix in presentation (#9239) il y a 2 mois
  Jay Xu cae11201ef
fix "out of memory" if slide.get_thumbnail() to a huge image (#9211) il y a 2 mois
  Kevin Hu d9fe279dde
Feat: Redesign and refactor agent module (#9113) il y a 3 mois
  Yongteng Lei 39ef2ffba9
Feat: parsing supports jsonl or ldjson format (#9087) il y a 3 mois
  Stephen Hu 92cfbcb382
Fix: when parse markdown support extract image at local (#8906) il y a 3 mois
  Yongteng Lei e9b14142a5
Fix: fixed invalid save() arguments for slide thumbnails (#8851) il y a 3 mois
  Yongteng Lei 51a8604dcb
Fix: fixed context loss caused by separating markdown tables from original text (#8844) il y a 3 mois
  Stephen Hu ce140f1393
Fix:Better Support Table Value Type (#8822) il y a 3 mois
  Stephen Hu 2b7adbd2d1
Fix: Improve Memory Usage For Presentation (#8792) il y a 3 mois
  wenxuan.zhang f586dd0a96
Fix: docx parse error. (#8600) il y a 4 mois
  Tuan Le 6b1221d2f6
Fix parser_config access for layout_recognize in presentation.py (#8492) il y a 4 mois
  liuzhenghua 5256980ffb
Fix: Solve the OOM issue when passing large PDF files while using QA chunking method. (#8464) il y a 4 mois
  HaiyangP d6a941ebf5
Fix the bug of long type value overflow (#8313) il y a 4 mois
  Jin Hai 4a2ff633e0
Fix typo in code (#8327) il y a 4 mois
  HaiyangP baf32ee461
Display only the duplicate column names and corresponding original source. (#8138) il y a 4 mois
  Kevin Hu 24625e0695
Fix: presentation of PDF using vlm. (#8133) il y a 4 mois
  Yongteng Lei bd4678bca6
Fix: Unnecessary truncation in markdown parser (#7972) il y a 5 mois
  Kevin Hu bfe97d896d
Fix: docx get image exception. (#7636) il y a 5 mois
  Kevin Hu 321a280031
Feat: add image preview to retrieval test. (#7610) il y a 5 mois
  alkscr baa108f5cc
Fix: markdown table conversion error (#7570) il y a 5 mois
  WhiteBear 5352bdf4da
Error storing tag in Redis (#7541) il y a 5 mois
  Stephen Hu 1a5608d0f8
Fix: Add title_tks for Pictures (#7365) il y a 6 mois
  Stephen Hu 1662c7eda3
Feat: Markdown add image (#7124) il y a 6 mois
  QuintinTao 1b4016317e
fix bug chunking:expected string or bytes-like object (#7116) il y a 6 mois
  Kevin Hu ed5f81b02e
Fix: abnormal cell mergeing. (#6991) il y a 6 mois
  dylan 5aae73c230
Make error messages during PPT processing clearer. (#6980) il y a 6 mois
  Kevin Hu 14a3efd756
Fix: docx image exceptions. (#6839) il y a 6 mois
  Kevin Hu ee5aa51d43
Fix: point in tag issue. (#6436) il y a 7 mois
  fansir 0e0ebaac5f
Feat: Adds hierarchical title path tracking for tables in DOCX documents to improve context association (#6374) il y a 7 mois
  Kevin Hu 95497b4aab
Fix: adapt to old configurations. (#6321) il y a 7 mois
  Yongteng Lei 9611185eb4
Feat: add VLM-boosted DocX parser (#6307) il y a 7 mois
  Yongteng Lei e4380843c4
Feat: add fallback for PDF figure parser (#6305) il y a 7 mois
  Yongteng Lei 1d6760dd84
Feat: add VLM-boosted PDF parser (#6278) il y a 7 mois
  Yongteng Lei 5cf610af40
Feat: add vision LLM PDF parser (#6173) il y a 7 mois
  Kevin Hu 1333d3c02a
Fix: float transfer exception. (#6197) il y a 7 mois
  Kevin Hu 3a99c2b5f4
Refa: PARALLEL_DEVICES is a static parameter. (#6168) il y a 7 mois
  Kevin Hu bfa8d342b3
Fix: retrieval debug mode issue. (#6150) il y a 7 mois
  Debug Doctor 3e19044dee
Feat: add OCR's muti-gpus and parallel processing support (#5972) il y a 7 mois
  Yongteng Lei 4ff609b6a8
Fix: optimize OCR garbage identification to reduce unnecessary filtering (#6027) il y a 7 mois
  Yongteng Lei 7cd37c37cd
Feat: add CSV file parsing support (#5989) il y a 7 mois
  hy89 b0c21b00d9
Refactor: Optimize error handling and support parsing of XLS(EXCEL97—2003) files. (#5633) il y a 8 mois
  Kevin Hu b418ce5643
Fix table parser issue. (#5482) il y a 8 mois
  Kevin Hu 4f40f685d9
Code refactor (#5371) il y a 8 mois
  Kevin Hu c28bc41a96
Fix docx table issue. (#5117) il y a 8 mois
  Kevin Hu c24137bd11
Fix too long integer for `Table`. (#4651) il y a 9 mois