187 커밋 (2d89863fddbb360934c6687ecbbdf620f2a5dbbe)

작성자 SHA1 메시지 날짜
  pingguoCooler cf0011be67
Feat: Upgrade html parser (#9675) 2 달 전
  Yongteng Lei 382458ace7
Feat: advanced markdown parsing (#9607) 2 달 전
  Kevin Hu 312f1a0477
Fix: enlarge raptor timeout limits. (#9600) 2 달 전
  Yongteng Lei 787e0c6786
Refa: OpenAI whisper-1 (#9552) 2 달 전
  Yongteng Lei eef43fa25c
Fix: unexpected truncated Excel files (#9500) 2 달 전
  Jay Xu 6d1078b538
fix 'KeyError: "There is no item named 'word/NULL' in the archive"' (#9455) 2 달 전
  HaiyangP 79399f7f25
Support the case of one cell split by multiple columns. (#9225) 2 달 전
  Jay Xu 7f08ba47d7
Fix "no `tc` element at grid_offset" (#9375) 2 달 전
  yzz 550e65bb22
Fix: PlainParser using fix in presentation (#9239) 2 달 전
  Jay Xu cae11201ef
fix "out of memory" if slide.get_thumbnail() to a huge image (#9211) 3 달 전
  Kevin Hu d9fe279dde
Feat: Redesign and refactor agent module (#9113) 3 달 전
  Yongteng Lei 39ef2ffba9
Feat: parsing supports jsonl or ldjson format (#9087) 3 달 전
  Stephen Hu 92cfbcb382
Fix: when parse markdown support extract image at local (#8906) 3 달 전
  Yongteng Lei e9b14142a5
Fix: fixed invalid save() arguments for slide thumbnails (#8851) 3 달 전
  Yongteng Lei 51a8604dcb
Fix: fixed context loss caused by separating markdown tables from original text (#8844) 3 달 전
  Stephen Hu ce140f1393
Fix:Better Support Table Value Type (#8822) 3 달 전
  Stephen Hu 2b7adbd2d1
Fix: Improve Memory Usage For Presentation (#8792) 3 달 전
  wenxuan.zhang f586dd0a96
Fix: docx parse error. (#8600) 4 달 전
  Tuan Le 6b1221d2f6
Fix parser_config access for layout_recognize in presentation.py (#8492) 4 달 전
  liuzhenghua 5256980ffb
Fix: Solve the OOM issue when passing large PDF files while using QA chunking method. (#8464) 4 달 전
  HaiyangP d6a941ebf5
Fix the bug of long type value overflow (#8313) 4 달 전
  Jin Hai 4a2ff633e0
Fix typo in code (#8327) 4 달 전
  HaiyangP baf32ee461
Display only the duplicate column names and corresponding original source. (#8138) 4 달 전
  Kevin Hu 24625e0695
Fix: presentation of PDF using vlm. (#8133) 4 달 전
  Yongteng Lei bd4678bca6
Fix: Unnecessary truncation in markdown parser (#7972) 5 달 전
  Kevin Hu bfe97d896d
Fix: docx get image exception. (#7636) 5 달 전
  Kevin Hu 321a280031
Feat: add image preview to retrieval test. (#7610) 5 달 전
  alkscr baa108f5cc
Fix: markdown table conversion error (#7570) 5 달 전
  WhiteBear 5352bdf4da
Error storing tag in Redis (#7541) 5 달 전
  Stephen Hu 1a5608d0f8
Fix: Add title_tks for Pictures (#7365) 6 달 전
  Stephen Hu 1662c7eda3
Feat: Markdown add image (#7124) 6 달 전
  QuintinTao 1b4016317e
fix bug chunking:expected string or bytes-like object (#7116) 6 달 전
  Kevin Hu ed5f81b02e
Fix: abnormal cell mergeing. (#6991) 6 달 전
  dylan 5aae73c230
Make error messages during PPT processing clearer. (#6980) 6 달 전
  Kevin Hu 14a3efd756
Fix: docx image exceptions. (#6839) 6 달 전
  Kevin Hu ee5aa51d43
Fix: point in tag issue. (#6436) 7 달 전
  fansir 0e0ebaac5f
Feat: Adds hierarchical title path tracking for tables in DOCX documents to improve context association (#6374) 7 달 전
  Kevin Hu 95497b4aab
Fix: adapt to old configurations. (#6321) 7 달 전
  Yongteng Lei 9611185eb4
Feat: add VLM-boosted DocX parser (#6307) 7 달 전
  Yongteng Lei e4380843c4
Feat: add fallback for PDF figure parser (#6305) 7 달 전
  Yongteng Lei 1d6760dd84
Feat: add VLM-boosted PDF parser (#6278) 7 달 전
  Yongteng Lei 5cf610af40
Feat: add vision LLM PDF parser (#6173) 7 달 전
  Kevin Hu 1333d3c02a
Fix: float transfer exception. (#6197) 7 달 전
  Kevin Hu 3a99c2b5f4
Refa: PARALLEL_DEVICES is a static parameter. (#6168) 7 달 전
  Kevin Hu bfa8d342b3
Fix: retrieval debug mode issue. (#6150) 7 달 전
  Debug Doctor 3e19044dee
Feat: add OCR's muti-gpus and parallel processing support (#5972) 7 달 전
  Yongteng Lei 4ff609b6a8
Fix: optimize OCR garbage identification to reduce unnecessary filtering (#6027) 7 달 전
  Yongteng Lei 7cd37c37cd
Feat: add CSV file parsing support (#5989) 7 달 전
  hy89 b0c21b00d9
Refactor: Optimize error handling and support parsing of XLS(EXCEL97—2003) files. (#5633) 8 달 전
  Kevin Hu b418ce5643
Fix table parser issue. (#5482) 8 달 전