23 Commits (eef43fa25cbe380983129d169d9fa27cb9410a13)

Autor SHA1 Mensagem Data
  Yongteng Lei eef43fa25c
Fix: unexpected truncated Excel files (#9500) 2 meses atrás
  Jay Xu 79e2edc835
Fix "File contains no valid workbook part" (#9360) 2 meses atrás
  Jay Xu 569ab011c4
Add fallback to use 'calamine' parse engine in excel_parser.py (#9374) 2 meses atrás
  Jin Hai 03daf4618c
Refactor parser code (#9042) 3 meses atrás
  donblack01 0b48a2e0d1
Fix: When Excel is a formula, the parsed result is a formula, but cannot be correctly parsed as a value type (#6613) 7 meses atrás
  Yongteng Lei 7cd37c37cd
Feat: add CSV file parsing support (#5989) 7 meses atrás
  hy89 b0c21b00d9
Refactor: Optimize error handling and support parsing of XLS(EXCEL97—2003) files. (#5633) 8 meses atrás
  SkyfireWXY 8fcca1b958
fix: big xls file error (#4859) 8 meses atrás
  Jin Hai 3894de895b
Update comments (#4569) 9 meses atrás
  ly0303521 101b8ff813
fix chunk method "Table" losing content when the Excel file has multi… (#4123) 10 meses atrás
  Zhichang Yu 0d68a6cd1b
Fix errors detected by Ruff (#3918) 11 meses atrás
  Jin Hai cdea1d0a85
Update readme and add license (#1018) 1 ano atrás
  KevinHuSh a12fcf9156
fix minio helth bug (#850) 1 ano atrás
  GYH c27c02ea67
Split Excel file into different chunks (#847) 1 ano atrás
  KevinHuSh 7013d7f620
refine text decode (#657) 1 ano atrás
  KevinHuSh 9d60a84958
refactor code (#583) 1 ano atrás
  KevinHuSh ed6081845a
Fit a lot of encodings for text file. (#458) 1 ano atrás
  KevinHuSh 36f2d7b797
To avoid assertion while no rows in excel (#197) 1 ano atrás
  KevinHuSh fd7fcb5baf
apply pep8 formalize (#155) 1 ano atrás
  KevinHuSh 6999598101
refine for English corpus (#135) 1 ano atrás
  KevinHuSh 675a9f8d9a
add dockerfile for cuda envirement. Refine table search strategy, (#123) 1 ano atrás
  KevinHuSh f1f09df901
add local llm implementation (#119) 1 ano atrás
  KevinHuSh cacd36c5e1
use onnx models, new deepdoc (#68) 1 ano atrás
  KevinHuSh 30791976d5 build python version rag-flow (#21) 1 ano atrás