ragflow

コミットグラフ

作成者	SHA1	メッセージ	日付
fansir	efc4796f01	Fix ratelimit errors during document parsing (#6413) ### What problem does this PR solve? When using the online large model API knowledge base to extract knowledge graphs, frequent Rate Limit Errors were triggered, causing document parsing to fail. This commit fixes the issue by optimizing API calls in the following way: Added exponential backoff and jitter to the API call to reduce the frequency of Rate Limit Errors. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	7ヶ月前
fansir	0e0ebaac5f	Feat: Adds hierarchical title path tracking for tables in DOCX documents to improve context association (#6374) ### What problem does this PR solve? Adds hierarchical title path tracking for tables in DOCX documents to improve context association. Previously, extracted tables lacked positional context within document structure. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	7ヶ月前
Kevin Hu	a2a4bfe3e3	Fix: change ollama default num_ctx. (#6395) ### What problem does this PR solve? #6163 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
zhou	85480f6292	Fix: the error of Ollama embeddings interface returning "500 Internal Server Error" (#6350) ### What problem does this PR solve? Fix the error where the Ollama embeddings interface returns a “500 Internal Server Error” when using models such as xiaobu-embedding-v2 for embedding. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Kevin Hu	d83911b632	Fix: huggingface rerank model issue. (#6385) ### What problem does this PR solve? #6348 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Zhichang Yu	ca9c3e59fa	Call register_scripts on connecting redis (#6361) ### What problem does this PR solve? Call register_scripts on connecting redis ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Zhichang Yu	dba0caa00b	Fix update_progress (#6340) ### What problem does this PR solve? Fix update_progress ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Kevin Hu	95497b4aab	Fix: adapt to old configurations. (#6321) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Kevin Hu	5b04b7d972	Fix: rerank with vllm issue. (#6306) ### What problem does this PR solve? #6301 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Yongteng Lei	9611185eb4	Feat: add VLM-boosted DocX parser (#6307) ### What problem does this PR solve? Add VLM-boosted DocX parser ### Type of change - [x] New Feature (non-breaking change which adds functionality)	7ヶ月前
Yongteng Lei	e4380843c4	Feat: add fallback for PDF figure parser (#6305) ### What problem does this PR solve? Add fallback for PDF figure parser ### Type of change - [x] New Feature (non-breaking change which adds functionality)	7ヶ月前
lgphone	046f0bba74	Fix: optimize setting config initialization to resolve Minio initialization error (#6282) ### What problem does this PR solve? Optimize setting configuration initialization to resolve Minio initialization error caused by using a specific storage. Reproduction Scenario: Using Aliyun OSS as the backend storage with the STORAGE_IMPL environment variable set to OSS. The service_conf.yaml.template configuration file contains OSS-related configurations, while other storage configurations are commented out. When the service starts, it still attempts to initialize the Minio storage. Since there is no Minio configuration in service_conf.yaml.template, it results in an error due to the missing configuration file. Optimization Measures: Automatically determine the required initialization configuration based on the environment variable. Do not initialize configurations for unused resources. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Yongteng Lei	1d6760dd84	Feat: add VLM-boosted PDF parser (#6278) ### What problem does this PR solve? Add VLM-boosted PDF parser if VLM is set. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	7ヶ月前
Zhichang Yu	bb869aca33	Fix get_unacked_iterator (#6280) ### What problem does this PR solve? Fix get_unacked_iterator. Close #6132 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
zhou	9cad60fa6d	Fix: Add a basic example when the example of content_tagging is empty (#6276) ### What problem does this PR solve? When using LLM for auto-tag, if there are no examples, the tag format generated by LLM may be wrong. This will cause Elasticsearch insert errors. Adding basic examples can avoid this problem. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Kevin Hu	c6e1a2ca8a	Feat: add TTS support for SILICONFLOW. (#6264) ### What problem does this PR solve? #6244 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	7ヶ月前
Kevin Hu	49086964b8	Fix: type violations. (#6262) ### What problem does this PR solve? #6238 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Kevin Hu	dd81c30976	Fix: tag_feas deletion error. (#6257) ### What problem does this PR solve? #6218 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Kevin Hu	a087d13ccb	Feat: text file support position retaining. (#6231) ### What problem does this PR solve? #5832 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	7ヶ月前
Kevin Hu	6e8d0e3177	Fix: rank feat issue. (#6225) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Yongteng Lei	5cf610af40	Feat: add vision LLM PDF parser (#6173) ### What problem does this PR solve? Add vision LLM PDF parser ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	7ヶ月前
Kevin Hu	e9a6675c40	Fix: enable ollama api-key. (#6205) ### What problem does this PR solve? #6189 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Kevin Hu	1333d3c02a	Fix: float transfer exception. (#6197) ### What problem does this PR solve? #6177 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Kevin Hu	7e4d693054	Fix: in case response.choices[0].message.content is None. (#6190) ### What problem does this PR solve? #6164 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Kevin Hu	3a99c2b5f4	Refa: PARALLEL_DEVICES is a static parameter. (#6168) ### What problem does this PR solve? ### Type of change - [x] Refactoring	7ヶ月前
Kevin Hu	fabc5e9259	Refa: fix re-rank scope. (#6152) ### What problem does this PR solve? #6140 ### Type of change - [x] Refactoring	7ヶ月前
Kevin Hu	bfa8d342b3	Fix: retrieval debug mode issue. (#6150) ### What problem does this PR solve? #6139 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Debug Doctor	3e19044dee	Feat: add OCR's muti-gpus and parallel processing support (#5972) ### What problem does this PR solve? Add OCR's muti-gpus and parallel processing support ### Type of change - [x] New Feature (non-breaking change which adds functionality) @yuzhichang I've tried to resolve the comments in #5697. OCR jobs can now be done on both CPU and GPU. ( By the way, I've encountered a “Generate embedding error” issue #5954 that might be due to my outdated GPUs? idk. ) Please review it and give me suggestions. GPU: ![gpu_ocr](https://github.com/user-attachments/assets/0ee2ecfb-a665-4e50-8bc7-15941b9cd80e) ![smi](https://github.com/user-attachments/assets/a2312f8c-cf24-443d-bf89-bec50503546d) CPU: ![cpu_ocr](https://github.com/user-attachments/assets/1ba6bb0b-94df-41ea-be79-790096da4bf1)	7ヶ月前
Zhichang Yu	89a69eed72	Introduced task priority (#6118) ### What problem does this PR solve? Introduced task priority ### Type of change - [x] New Feature (non-breaking change which adds functionality)	7ヶ月前
Kevin Hu	e5a8b23684	Fix: empty tag field issue. (#6103) ### What problem does this PR solve? #6102 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Zhichang Yu	4fffee6695	Regards kb_id at ElasticSearch insert, update, delete. (#6105) ### What problem does this PR solve? Regards kb_id at ElasticSearch insert, update, delete. Close #6066 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Kevin Hu	485bc7d7d6	Fix: limit the depth of DFS (#6101) ### What problem does this PR solve? #6085 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Zhichang Yu	5d75b6be62	Fix executor name (#6080) ### What problem does this PR solve? Fix executor name ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Kevin Hu	56b228f187	Refa: remove max toekns for image2txt models. (#6078) ### What problem does this PR solve? #6063 ### Type of change - [x] Refactoring	7ヶ月前
utopia2077	2d4a60cae6	Fix: Reduce excessive IO operations by loading LLM factory configurations (#6047) …ions ### What problem does this PR solve? This PR fixes an issue where the application was repeatedly reading the llm_factories.json file from disk in multiple places, which could lead to "Too many open files" errors under high load conditions. The fix centralizes the file reading operation in the settings.py module and stores the data in a global variable that can be accessed by other modules. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [x] Performance Improvement - [ ] Other (please describe):	7ヶ月前
Yongteng Lei	4ff609b6a8	Fix: optimize OCR garbage identification to reduce unnecessary filtering (#6027) ### What problem does this PR solve? Optimize OCR garbage identification to reduce unnecessary filtering. #5713 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
writinwaters	9c8060f619	0.17.1 release notes (#6021) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	7ヶ月前
Zhichang Yu	e213873852	Optimize graphrag cache get entity (#6018) ### What problem does this PR solve? Optimize graphrag cache get entity ### Type of change - [x] Performance Improvement	7ヶ月前
Kevin Hu	e05cdc2f9c	Fix: encode detect error. (#6006) ### What problem does this PR solve? #5967 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Kevin Hu	3571270191	Refa: refine the context window size warning. (#5993) ### What problem does this PR solve? ### Type of change - [x] Refactoring	7ヶ月前
Yongteng Lei	7cd37c37cd	Feat: add CSV file parsing support (#5989) ### What problem does this PR solve? Add CSV file parsing support #4552, #5849, #5870 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	7ヶ月前
kuro5989	6e13922bdc	Feat: Add qwq model support to Tongyi-Qianwen factory (#5981) ### What problem does this PR solve? add qwq model support to Tongyi-Qianwen factory https://github.com/infiniflow/ragflow/issues/5869 ### Type of change - [x] New Feature (non-breaking change which adds functionality) ![image](https://github.com/user-attachments/assets/49f5c6a0-ecaf-41dd-a23a-2009f854d62c) ![image](https://github.com/user-attachments/assets/93ffa303-920e-4942-8188-bcd6b7209204) ![`1741774779`](https://github.com/user-attachments/assets/25f2fd1d-8640-4df0-9a08-78ee9daaa8fe) ![image](https://github.com/user-attachments/assets/4763cf6c-1f76-43c4-80ee-74dfd666a184) Co-authored-by: zhaozhicheng <zhicheng.zhao@fastonetech.com>	7ヶ月前
donblack01	1c663b32b9	Fix:signal.SIGUSR1 and signal.SIGUSR2 can't use in window. so don't bind signal.SIGUSR1 and signal.SIGUSR2 in the windows env (#5941) ### What problem does this PR solve? Fix:signal.SIGUSR1 and signal.SIGUSR2 can't use in window. so don't bind signal.SIGUSR1 and signal.SIGUSR2 in the windows env ### Type of change - [✓ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): Co-authored-by: tangyu <1@1.com>	7ヶ月前
Kevin Hu	caecaa7562	Feat: apply LLM to optimize citations. (#5935) ### What problem does this PR solve? #5905 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	7ヶ月前
Zhichang Yu	6ec6ca6971	Refactor graphrag to remove redis lock (#5828) ### What problem does this PR solve? Refactor graphrag to remove redis lock ### Type of change - [x] Refactoring	7ヶ月前
Kevin Hu	15736c57c3	Fix: empty query issue. (#5830) ### What problem does this PR solve? #5214 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Edouard Hur	b29539b442	Fix: CoHereRerank not respecting base_url when provided (#5784) ### What problem does this PR solve? vLLM provider with a reranking model does not work : as vLLM uses under the hood the [CoHereRerank provider](https://github.com/infiniflow/ragflow/blob/v0.17.0/rag/llm/__init__.py#L250) with a `base_url`, if this URL [is not passed to the Cohere client](https://github.com/infiniflow/ragflow/blob/v0.17.0/rag/llm/rerank_model.py#L379-L382) any attempt will endup on the Cohere SaaS (sending your private api key in the process) instead of your vLLM instance. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	7ヶ月前
Kevin Hu	2ad852d8df	Fix: truncate message issue. (#5776) ### What problem does this PR solve? Close #5761 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Kevin Hu	64c6cc4cf3	Fix: truncate message issue. (#5765) ### What problem does this PR solve? Close #5761 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	7ヶ月前
Kevin Hu	b1bbb9e210	Refa: make Rewrite component effective to relative data expression. (#5752) ### What problem does this PR solve? #5716 ### Type of change - [x] Refactoring	7ヶ月前

... 3 4 5 6 7 ...

655 コミット (94181a990b957ed302952b4de17583d2b44f3099)