ragflow

Commit Graph

Author	SHA1	Message	Date
Stephen Hu	ca320a8c30	Refactor: for total_token_count method use if to check first. (#9707) ### What problem does this PR solve? for total_token_count method use if to check first, to improve the performance when we need to handle exception cases ### Type of change - [x] Refactoring	2 months ago
Yongteng Lei	b6c1ca828e	Refa: replace Chat Ollama implementation with LiteLLM (#9693) ### What problem does this PR solve? replace Chat Ollama implementation with LiteLLM. ### Type of change - [x] Refactoring	2 months ago
Yongteng Lei	3947da10ae	Fix: unexpected LLM parameters (#9661) ### What problem does this PR solve? Remove unexpected LLM parameters. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2 months ago
Yongteng Lei	787e0c6786	Refa: OpenAI whisper-1 (#9552) ### What problem does this PR solve? Refactor OpenAI to enable audio parsing. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring	2 months ago
Stephen Hu	a0d630365c	Refactor:Improve VoyageRerank not texts handling (#9539) ### What problem does this PR solve? Improve VoyageRerank not texts handling ### Type of change - [x] Refactoring	2 months ago
Yongteng Lei	fe32952825	Fix: Gemini parameters error (#9520) ### What problem does this PR solve? Fix Gemini parameters error. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2 months ago
Stephen Hu	fb77f9917b	Refactor: Use Input Length In DefaultRerank (#9516) ### What problem does this PR solve? 1. Use input length to prepare res 2. Adjust torch_empty_cache code location ### Type of change - [x] Refactoring - [x] Performance Improvement	2 months ago
RuyXu	762aa4b8c4	fix: preserve correct MIME & unify data URL handling for vision inputs (relates #9248) (#9474) fix: preserve correct MIME & unify data URL handling for vision inputs (relates #9248) - Updated image2base64() to return a full data URL (data:image/<fmt>;base64,...) with accurate MIME - Removed hardcoded image/jpeg in Base._image_prompt(); pass through data URLs and default raw base64 to image/png - Set AnthropicCV._image_prompt() raw base64 media_type default to image/png - Ensures MIME type matches actual image content, fixing “cannot process base64 image” errors on vLLM/OpenAI-compatible backends ### What problem does this PR solve? This PR fixes a compatibility issue where base64-encoded images sent to vision models (e.g., vLLM/OpenAI-compatible backends) were rejected due to mismatched MIME type or incorrect decoding. Previously, the backend: - Always converted raw base64 into data:image/jpeg;base64,... even if the actual content was PNG. - In some cases, base64 decoding was attempted on the full data URL string instead of the pure base64 part. This caused errors like: ``` cannot process base64 image failed to decode base64 string: illegal base64 data at input byte 0 ``` by strict validators such as vLLM. With this fix, the MIME type in the request now matches the actual image content, and data URLs are correctly handled or passed through, ensuring vision models can decode and process images reliably. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2 months ago
Stephen Hu	f2806a8332	Update cv_model.py (#9472) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/9452 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2 months ago
Stephen Hu	da5cef0686	Refactor:Improve the float compare for LocalAIRerank (#9428) ### What problem does this PR solve? Improve the float compare for LocalAIRerank ### Type of change - [x] Refactoring	2 months ago
Yongteng Lei	a0c2da1219	Fix: Patch LiteLLM (#9416) ### What problem does this PR solve? Patch LiteLLM refactor. #9408 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2 months ago
Yongteng Lei	83771e500c	Refa: migrate chat models to LiteLLM (#9394) ### What problem does this PR solve? All models pass the mock response tests, which means that if a model can return the correct response, everything should work as expected. However, not all models have been fully tested in a real environment, the real API_KEY. I suggest actively monitoring the refactored models over the coming period to ensure they work correctly and fixing them step by step, or waiting to merge until most have been tested in practical environment. ### Type of change - [x] Refactoring	2 months ago
Stephen Hu	7713e14d6a	Update chat_model.py (#9318) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/9317 base on https://discuss.ai.google.dev/t/valueerror-invalid-operation-the-response-text-quick-accessor-requires-the-response-to-contain-a-valid-part-but-none-were-returned/42866 should can be handled by retry ### Type of change - [x] Refactoring	2 months ago
Kevin Hu	a2e1f5618d	Fix: bytes style image issue. (#9304) ### What problem does this PR solve? #9302 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2 months ago
so95	35539092d0	Add kwargs to model base class constructors (#9252) Updated constructors for base and derived classes in chat, embedding, rerank, sequence2txt, and tts models to accept kwargs. This change improves extensibility and allows passing additional parameters without breaking existing interfaces. - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: IT: Sop.Son <sop.son@feavn.local> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2 months ago
Kevin Hu	2124329e95	Fix: local variable issue. (#9255) ### What problem does this PR solve? #9227 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2 months ago
Stephen Hu	0a303d9ae1	Refactor:Improve the chat stream logic for NvidiaCV (#9242) ### What problem does this PR solve? Improve the chat stream logic for NvidiaCV ### Type of change - [x] Refactoring	2 months ago
Stephen Hu	1deb0a2d42	Fix:local variable 'response' referenced before assignment (#9230) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/9227 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2 months ago
Yongteng Lei	30ccc4a66c	Fix: correct single base64 image handling in image prompt (#9220) ### What problem does this PR solve? Correct single base64 image handling in image prompt. ![img_v3_02or_ec4757c2-a9d4-4774-9a76-f7c6be633ebg](https://github.com/user-attachments/assets/872a86bf-e2a8-48d1-9b71-2a0c7a35ba9e) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2 months ago
Stephen Hu	e9cbf4611d	Fix:Error when parsing files using Gemini: ERROR: GENERIC_ERROR - Unknown field for GenerationConfig: max_tokens (#9195) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/9177 The reason should be due to the gemin internal use a different parameter name ` max_output_tokens (int): Optional. The maximum number of tokens to include in a response candidate. Note: The default value varies by model, see the ``Model.output_token_limit`` attribute of the ``Model`` returned from the ``getModel`` function. This field is a member of `oneof`_ ``_max_output_tokens``. ` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2 months ago
Stephen Hu	5ccdb95008	Refactor:Introduce Image Close For GeminiCV (#9147) ### What problem does this PR solve? Introduce Image Close For GeminiCV ### Type of change - [x] Refactoring - [x] Performance Improvement	3 months ago
JI4JUN	aeaeb169e4	Feat/support 302ai provider (#8742) ### What problem does this PR solve? Support 302.AI provider. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	3 months ago
Stephen Hu	20b4d88098	Refactor: Improve the try catch logic for XinferenceEmbed (#9128) ### What problem does this PR solve? Improve the try catch logic for XinferenceEmbed ### Type of change - [x] Refactoring	3 months ago
Kevin Hu	d9fe279dde	Feat: Redesign and refactor agent module (#9113) ### What problem does this PR solve? #9082 #6365 <u> WARNING: it's not compatible with the older version of `Agent` module, which means that `Agent` from older versions can not work anymore.</u> ### Type of change - [x] New Feature (non-breaking change which adds functionality)	3 months ago
謝富祥	021e8b57ae	Fix: fix error 429 api rate limit when building knowledge graph for all chat model and Mistral embedding model (#9106) ### What problem does this PR solve? fix error 429 api rate limit when building knowledge graph for all chat model and Mistral embedding model. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	3 months ago
Stephen Hu	ba563f8095	Update embedding_model.py (#9083) ### What problem does this PR solve? Reduce the logic scope for DefaultEmbedding ### Type of change - [x] Refactoring	3 months ago
Stephen Hu	86b4da0844	Refactor: Remove Useless split for BedrockEmbed (#9067) ### What problem does this PR solve? Remove Useless split for BedrockEmbed ### Type of change - [x] Refactoring	3 months ago
Stephen Hu	53b0b0e583	get keep alive from env (#9039) ### What problem does this PR solve? get keepalive from env ### Type of change - [x] Refactoring	3 months ago
Viktor Dmitriyev	b47dcc9108	Fix issue with `keep_alive=-1` for ollama chat model by allowing a user to set an additional configuration option (#9017) ### What problem does this PR solve? fix issue with `keep_alive=-1` for ollama chat model by allowing a user to set an additional configuration option. It is no-breaking change because it still uses a previous default value such as: `keep_alive=-1` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [X] Performance Improvement - [X] Other (please describe): - Additional configuration option has been added to control behavior of RAGFlow while working with ollama LLM	3 months ago
Yongteng Lei	a2f73af1a4	Fix: typo Bearer token (#8998) ### What problem does this PR solve? Typo Bearer token. #8960 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	3 months ago
Yongteng Lei	7ebc1f0943	Feat: add model provider DeepInfra (#9003) ### What problem does this PR solve? Add model provider DeepInfra. This model list comes from our community. NOTE: most endpoints haven't been tested, but they should work as OpenAI does. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	3 months ago
Stephen Hu	ec21d9a98f	Refactor:remove use less convert for FastEmbed (#8984) ### What problem does this PR solve? remove use less convert for FastEmbed ### Type of change - [x] Refactoring	3 months ago
Stephen Hu	95b9208b13	Fix:Improve float operation when rerank (#8963) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/8915 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	3 months ago
Stephen Hu	46caf6ae72	Refactor improve codes for ranker (#8936) ### What problem does this PR solve? Use the normalize method directly ### Type of change - [x] Refactoring	3 months ago
Stephen Hu	38b34116dd	Refa: Remove useless conver and fix a bug for DefaultRerank (#8887) ### What problem does this PR solve? 1. bug when re-try, we need to reset i. 2. remove useless convert ### Type of change - [x] Refactoring	3 months ago
Liu An	9e45fcfdb3	Fix: fix typo in OpenAI error logging message (#8865) ### What problem does this PR solve? Correct the logging message from "OpenAI cat_with_tools" to "OpenAI chat_with_tools" in the `_exceptions` method of the `Base` class to accurately reflect the method name and improve error traceability. ### Type of change - [x] Typo	3 months ago
Stephen Hu	5fa6f2f151	Update embedding_model.py (#8836) ### What problem does this PR solve? Remove useless covert for bge encode_queries ### Type of change - [x] Performance Improvement	3 months ago
Stephen Hu	5383e254c4	Perf:Remove Useless Convert When BGE Embedding (#8816) ### What problem does this PR solve? FlagModel internal support returns as numpy ### Type of change - [x] Performance Improvement	3 months ago
Stephen Hu	07208e519b	Fix: Wrong_Input_type_for_Gemin (#8783) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/8763#issuecomment-3055317110 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	3 months ago
Yongteng Lei	1895667573	Feat: add xAI provider (#8781) ### What problem does this PR solve? Add xAI provider (experimental feature, requires user feedback). ### Type of change - [x] New Feature (non-breaking change which adds functionality)	3 months ago
Kevin Hu	8281ceb406	Refa: refine retry gap. (#8773) ### What problem does this PR solve? ### Type of change - [x] Refactoring - [x] Performance Improvement	3 months ago
Stephen Hu	8d027813f5	Refactor: Improve How To Handle QWenEmbed (#8765) ### What problem does this PR solve? Based on https://github.com/infiniflow/ragflow/issues/8740 1. A better handle for 'NoneType' object is not subscriptable 2. Add some logs to get the internal message ### Type of change - [x] Refactoring	3 months ago
Stephen Hu	19419281c3	Fix: Change Ollama Embedding Keep Alive (#8734) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/8733 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	3 months ago
Stephen Hu	e60ec0a31b	Fix:disallowed special token while embedding (#8692) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/8567 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	3 months ago
6607changchun	9580e99650	fix: retry embedding with Qwen family models when limits temporarily reached. (#8690) fix: retry embedding with Qwen family models when limits temporarily reached. APIs of Qwen family models are limited by calling rates. When reached, the "output" attribute of the "resp" will be None, and in turn cause TypeError when trying to retrieve "embeddings". Since these limits are almost temporary, I have added a simple retry mechanism to avoid it. Besides, if retry_max reached, the error can be early raised, instead of hidden behind "TypeError". ### What problem does this PR solve? Sometimes Qwen blocks calling due to rate limits, but it will cause the whole parsing procedure stops when creating knowledge base. In this situation, resp["output"] will be None, and resp["output"]["embeddings"] will cause TypeError. Since the limits are temporary, I apply a simple retry mechanism to solve it. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	3 months ago
Yongteng Lei	f8a6987f1e	Refa: automatic LLMs registration (#8651) ### What problem does this PR solve? Support automatic LLMs registration. ### Type of change - [x] Refactoring	4 months ago
Kevin Hu	fffb7c0bba	Fix: anthropic llm issue. (#8633) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	4 months ago
He Wang	898da23caa	make dirs with 'exist_ok=True' (#8629) ### What problem does this PR solve? The following error occurred during local testing, which should be fixed by configuring 'exist_ok=True'. ```log set_progress(`7461edc253`), progress: -1, progress_msg: 21:41:41 Page(1~100000001): [ERROR][Errno 17] File exists: '/ragflow/tmp' ``` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	4 months ago
Tuan Le	d343cb4deb	Add Google Cloud Vision API Integration (Image2Text) (#8608) ### What problem does this PR solve? This PR introduces Google Cloud Vision API integration to enhance image understanding capabilities in the application. It addresses the need for advanced image description and chat functionalities by implementing a new `GoogleCV` class to handle API interactions and updating relevant configurations. This enables users to leverage Google Cloud Vision for image-to-text tasks, improving the application's ability to process and interpret visual data. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	4 months ago
Tuan Le	1c77b4ed9b	fix: Correctly format message parts in GoogleChat (#8596) ### What problem does this PR solve? This PR addresses an incompatibility issue with the Google Chat API by correcting the message content format in the `GoogleChat` class. Previously, the content was directly assigned to the "parts" field, which did not align with the API's expected format. This change ensures that messages are properly formatted with a "text" key within a dictionary, as required by the API. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	4 months ago

1 2 3 4 5 ...

350 Commits (2d89863fddbb360934c6687ecbbdf620f2a5dbbe)