您最多选择25个主题 主题必须以字母或数字开头,可以包含连字符 (-),并且长度不得超过35个字符
liuzhenghua ea5e8caa69
feat: Enable antialiasing for PDF image extraction to improve OCR accuracy (#7562)
5 个月前
..
resume Fix:when start with source code not in docker env report 'UnicodeDec… (#5802) 7 个月前
__init__.py Update comments (#4569) 9 个月前
docx_parser.py Update comments (#4569) 9 个月前
excel_parser.py Fix: When Excel is a formula, the parsed result is a formula, but cannot be correctly parsed as a value type (#6613) 7 个月前
figure_parser.py Fix: Sometimes VisionFigureParser.figures may is tuple (#7477) 6 个月前
html_parser.py Update comments (#4569) 9 个月前
json_parser.py Update comments (#4569) 9 个月前
markdown_parser.py Feat:Optimize the table extraction logic in the Markdown parser: (#5663) 8 个月前
pdf_parser.py feat: Enable antialiasing for PDF image extraction to improve OCR accuracy (#7562) 5 个月前
ppt_parser.py Refa: Optimize pptx shape extraction to reduce content loss (#6703) 6 个月前
txt_parser.py Fix: delimiter issue. (#5720) 8 个月前
utils.py Update comments (#4569) 9 个月前