You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
liuzhenghua ea5e8caa69
feat: Enable antialiasing for PDF image extraction to improve OCR accuracy (#7562)
5 months ago
..
resume Fix:when start with source code not in docker env report 'UnicodeDec… (#5802) 7 months ago
__init__.py Update comments (#4569) 9 months ago
docx_parser.py Update comments (#4569) 9 months ago
excel_parser.py Fix: When Excel is a formula, the parsed result is a formula, but cannot be correctly parsed as a value type (#6613) 7 months ago
figure_parser.py Fix: Sometimes VisionFigureParser.figures may is tuple (#7477) 5 months ago
html_parser.py Update comments (#4569) 9 months ago
json_parser.py Update comments (#4569) 9 months ago
markdown_parser.py Feat:Optimize the table extraction logic in the Markdown parser: (#5663) 7 months ago
pdf_parser.py feat: Enable antialiasing for PDF image extraction to improve OCR accuracy (#7562) 5 months ago
ppt_parser.py Refa: Optimize pptx shape extraction to reduce content loss (#6703) 6 months ago
txt_parser.py Fix: delimiter issue. (#5720) 8 months ago
utils.py Update comments (#4569) 9 months ago