You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Bowen Liang 39c14ec7c1
improve: unify Excel files parsing in either xls or xlsx file format by Pandas (#4965)
1 year ago
..
blod improve: mordernizing validation by migrating pydantic from 1.x to 2.x (#4592) 1 year ago
entity fix: ExtractSetting optional value missing None as default val (#5238) 1 year ago
firecrawl Feat/firecrawl data source (#5232) 1 year ago
unstructured Add UNSTRUCTURED_API_KEY env support (#4369) 1 year ago
csv_extractor.py dep: bump pandas from 1.x to 2.x (#4820) 1 year ago
excel_extractor.py improve: unify Excel files parsing in either xls or xlsx file format by Pandas (#4965) 1 year ago
extract_processor.py Feat/firecrawl data source (#5232) 1 year ago
extractor_base.py Feat/dify rag (#2528) 1 year ago
helpers.py Feat/dify rag (#2528) 1 year ago
html_extractor.py Fix some RAG bugs (#2570) 1 year ago
markdown_extractor.py Feat/dify rag (#2528) 1 year ago
notion_extractor.py Feat/firecrawl data source (#5232) 1 year ago
pdf_extractor.py Feat/dify rag (#2528) 1 year ago
text_extractor.py Feat/dify rag (#2528) 1 year ago
word_extractor.py deal the external image when extract docx image (#5024) 1 year ago