ragflow

Commit graph

Autor	SHA1	Nachricht	Datum
Liu An	abb6359547	Docs: Update version references to v0.20.3 in READMEs and docs (#9581) ### What problem does this PR solve? - Update version tags in README files (including translations) from v0.20.2 to v0.20.3 - Modify Docker image references and documentation to reflect new version - Update version badges and image descriptions - Maintain consistency across all language variants of README files ### Type of change - [x] Documentation Update	vor 2 Monaten
Liu An	0aa3c4cdae	Docs: Update version references to v0.20.2 in READMEs and docs (#9559) ### What problem does this PR solve? - Update version tags in README files (including translations) from v0.20.1 to v0.20.2 - Modify Docker image references and documentation to reflect new version - Update version badges and image descriptions - Maintain consistency across all language variants of README files ### Type of change - [x] Documentation Update	vor 2 Monaten
Liu An	c8bbf7452d	Env: Update dependencies for proxy support (#9519) ### What problem does this PR solve? - Update httpx dependency to include socks support in pyproject.toml - Update lockfile with new socksio dependency ### Type of change - [x] Update dependencies for proxy support	vor 2 Monaten
Yongteng Lei	99df0766fe	Feat: add SMTP support for user invitation emails (#9479) ### What problem does this PR solve? Add SMTP support for user invitation emails ### Type of change - [x] New Feature (non-breaking change which adds functionality)	vor 2 Monaten
Kevin Hu	b6e34e3aa7	Fix: PyPDF's Manipulated FlateDecode streams can exhaust RAM (#9469) ### What problem does this PR solve? #3951 #8463 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	vor 2 Monaten
Jay Xu	569ab011c4	Add fallback to use 'calamine' parse engine in excel_parser.py (#9374) ### What problem does this PR solve? add fallback to `calamine` engine when parse error raised using the default `openpyxl` / `xlrd` engine. e.g. the following error can be fixed: ``` Traceback (most recent call last): File "/ragflow/deepdoc/parser/excel_parser.py", line 53, in _load_excel_to_workbook df = pd.read_excel(file_like_object) File "/ragflow/.venv/lib/python3.10/site-packages/pandas/io/excel/_base.py", line 495, in read_excel io = ExcelFile( File "/ragflow/.venv/lib/python3.10/site-packages/pandas/io/excel/_base.py", line 1567, in __init__ self._reader = self._engines[engine]( File "/ragflow/.venv/lib/python3.10/site-packages/pandas/io/excel/_xlrd.py", line 46, in __init__ super().__init__( File "/ragflow/.venv/lib/python3.10/site-packages/pandas/io/excel/_base.py", line 573, in __init__ self.book = self.load_workbook(self.handles.handle, engine_kwargs) File "/ragflow/.venv/lib/python3.10/site-packages/pandas/io/excel/_xlrd.py", line 63, in load_workbook return open_workbook(file_contents=data, **engine_kwargs) File "/ragflow/.venv/lib/python3.10/site-packages/xlrd/__init__.py", line 172, in open_workbook bk = open_workbook_xls( File "/ragflow/.venv/lib/python3.10/site-packages/xlrd/book.py", line 68, in open_workbook_xls bk.biff2_8_load( File "/ragflow/.venv/lib/python3.10/site-packages/xlrd/book.py", line 641, in biff2_8_load cd.locate_named_stream(UNICODE_LITERAL(qname)) File "/ragflow/.venv/lib/python3.10/site-packages/xlrd/compdoc.py", line 398, in locate_named_stream result = self._locate_stream( File "/ragflow/.venv/lib/python3.10/site-packages/xlrd/compdoc.py", line 429, in _locate_stream raise CompDocError("%s corruption: seen[%d] == %d" % (qname, s, self.seen[s])) xlrd.compdoc.CompDocError: Workbook corruption: seen[2] == 4 ``` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	vor 2 Monaten
Yongteng Lei	83771e500c	Refa: migrate chat models to LiteLLM (#9394) ### What problem does this PR solve? All models pass the mock response tests, which means that if a model can return the correct response, everything should work as expected. However, not all models have been fully tested in a real environment, the real API_KEY. I suggest actively monitoring the refactored models over the coming period to ensure they work correctly and fixing them step by step, or waiting to merge until most have been tested in practical environment. ### Type of change - [x] Refactoring	vor 2 Monaten
Liu An	b9eeb8e64f	Docs: Update version references to v0.20.1 in READMEs and docs (#9335) ### What problem does this PR solve? - Update version tags in README files (including translations) from v0.20.0 to v0.20.1 - Modify Docker image references and documentation to reflect new version - Update version badges and image descriptions - Maintain consistency across all language variants of README files ### Type of change - [x] Documentation Update	vor 2 Monaten
Liu An	95534f5cf2	Docs: Update version references to v0.20.0 in READMEs and docs (#9164) ### What problem does this PR solve? - Update version tags in README files (including translations) from v0.19.1 to v0.20.0 - Modify Docker image references and documentation to reflect new version - Update version badges and image descriptions - Maintain consistency across all language variants of README files ### Type of change - [x] Documentation Update	vor 3 Monaten
Kevin Hu	d9fe279dde	Feat: Redesign and refactor agent module (#9113) ### What problem does this PR solve? #9082 #6365 <u> WARNING: it's not compatible with the older version of `Agent` module, which means that `Agent` from older versions can not work anymore.</u> ### Type of change - [x] New Feature (non-breaking change which adds functionality)	vor 3 Monaten
Zhichang Yu	ad177951e9	Bump to infinity v0.6.0-dev4 (#9013) ### What problem does this PR solve? Bump to infinity v0.6.0-dev4. WARNNING: infinity v0.6.0-dev4 has very different meta data format with older versions. You have to destroy infinity data volume are restart infinity container if there's existing data. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	vor 3 Monaten
Song Fuchang	fd7ac17605	Feat: Scratch MCP tool calling support. (#8263) ### What problem does this PR solve? This is a cherry-pick from #7781 as requested. ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	vor 4 Monaten
Yongteng Lei	03656da4dd	Refa: upgrade MCP SDK to v1.9.4 (#8421) ### What problem does this PR solve? Upgrade MCP SDK to v1.9.4 (latest). ### Type of change - [x] Refactoring	vor 4 Monaten
Liu An	7e87eb2e23	Docs: Update version references to v0.19.1 in READMEs and docs (#8366) ### What problem does this PR solve? - Update Docker image version badges and references from v0.19.0 to v0.19.1 - Modify version mentions in all localized README files (id, ja, ko, pt_br, tzh, zh) - Update version in docker/README.md and related documentation files - Includes updates to Helm values and Python SDK dependencies ### Type of change - [x] Documentation Update	vor 4 Monaten
africa-worker	44287fb05f	Oss support opendal(including mysql) (#8204) ### What problem does this PR solve? #8074 Oss support opendal(including mysql) ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	vor 4 Monaten
Liu An	e702431fcb	Feat: sync test group to top pyproject.toml (#8015) ### What problem does this PR solve? sync test group from sdk/python/pyproject.toml to top pyproject.toml ### Type of change - [x] New Feature (non-breaking change which adds functionality)	vor 5 Monaten
He Wang	aaefc3f44c	update xgboost and dep scripts for local build on MacOS (#7857) ### What problem does this PR solve? There are two main changes: 1. Update xgboost to 1.6.0 to build the project on MacOS with Apple chips, this change refers to the issue: https://github.com/infiniflow/ragflow/issues/5114. 2. When `use_china_mirrors` is set in `download_deps.py`, the names of chrome files downloaded by the script will be different from the file names used in Dockerfile, so I added the file name in `get_urls` function to solve this problem. I think it's better to add testing for Docker image `infiniflow/ragflow_deps` to the test workflow, but since the workflow is currently running on a self-hosted runner, I'm not sure how to modify it. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	vor 5 Monaten
liu an	590b9dabab	Docs: update for v0.19.0 (#7823) ### What problem does this PR solve? update for v0.19.0 ### Type of change - [x] Documentation Update	vor 5 Monaten
Song Fuchang	a1f06a4fdc	Feat: Support tool calling in Generate component (#7572) ### What problem does this PR solve? Hello, our use case requires LLM agent to invoke some tools, so I made a simple implementation here. This PR does two things: 1. A simple plugin mechanism based on `pluginlib`: This mechanism lives in the `plugin` directory. It will only load plugins from `plugin/embedded_plugins` for now. A sample plugin `bad_calculator.py` is placed in `plugin/embedded_plugins/llm_tools`, it accepts two numbers `a` and `b`, then give a wrong result `a + b + 100`. In the future, it can load plugins from external location with little code change. Plugins are divided into different types. The only plugin type supported in this PR is `llm_tools`, which must implement the `LLMToolPlugin` class in the `plugin/llm_tool_plugin.py`. More plugin types can be added in the future. 2. A tool selector in the `Generate` component: Added a tool selector to select one or more tools for LLM: ![image](https://github.com/user-attachments/assets/74a21fdf-9333-4175-991b-43df6524c5dc) And with the `bad_calculator` tool, it results this with the `qwen-max` model: ![image](https://github.com/user-attachments/assets/93aff9c4-8550-414a-90a2-1a15a5249d94) ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com>	vor 5 Monaten
pyyuhao	c8c3b756b0	Feat: Adds OpenSearch2.19.1 as the vector_database support (#7140) ### What problem does this PR solve? This PR adds the support for latest OpenSearch2.19.1 as the store engine & search engine option for RAGFlow. ### Main Benefit 1. OpenSearch2.19.1 is licensed under the [Apache v2.0 License] which is much better than Elasticsearch 2. For search, OpenSearch2.19.1 supports full-text search、vector_search、hybrid_search those are similar with Elasticsearch on schema 3. For store, OpenSearch2.19.1 stores text、vector those are quite simliar with Elasticsearch on schema ### Changes - Support opensearch_python_connetor. I make a lot of adaptions since the schema and api/method between ES and Opensearch differs in many ways(especially the knn_search has a significant gap) : rag/utils/opensearch_coon.py - Support static config adaptions by changing: conf/service_conf.yaml、api/settings.py、rag/settings.py - Supprt some store&search schema changes between OpenSearch and ES: conf/os_mapping.json - Support OpenSearch python sdk : pyproject.toml - Support docker config for OpenSearch2.19.1 : docker/.env、docker/docker-compose-base.yml、docker/service_conf.yaml.template ### How to use - I didn't change the priority that ES as the default doc/search engine. Only if in docker/.env , we set DOC_ENGINE=${DOC_ENGINE:-opensearch}, it will work. ### Others Our team tested a lot of docs in our environment by using OpenSearch as the vector database ,it works very well. All the conifg for OpenSearch is necessary. ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Yongteng Lei <yongtengrey@outlook.com> Co-authored-by: writinwaters <93570324+writinwaters@users.noreply.github.com> Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com>	vor 6 Monaten
liu an	03672df691	Docs: update for v0.18.0 (#7223) ### What problem does this PR solve? update for v0.18.0 ### Type of change - [x] Documentation Update	vor 6 Monaten
Yongteng Lei	68b9dae6c0	Feat: mcp server (#7084) ### What problem does this PR solve? Add MCP support with a client example. Issue link: #4344 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	vor 6 Monaten
Zhichang Yu	6bf26e2a81	Optimize graphrag again (#6513) ### What problem does this PR solve? Removed set_entity and set_relation to avoid accessing doc engine during graph computation. Introduced GraphChange to avoid writing unchanged chunks. ### Type of change - [x] Performance Improvement	vor 7 Monaten
Yongteng Lei	85eb367ede	Feat: add basic Langfuse support for LLM module (#6443) ### What problem does this PR solve? #6155 Add basic Langfuse support for LLM module. A trace example: <img width="755" alt="image" src="https://github.com/user-attachments/assets/25c1f852-5116-486c-a47f-6097187142ca" /> ### Type of change - [x] New Feature (non-breaking change which adds functionality)	vor 7 Monaten
Kevin Hu	4df4bf68a2	DOCS: for release. (#6023) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	vor 7 Monaten
Kevin Hu	d44739283c	Docs: prepare docs for release v0.17.1 (#5900) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	vor 7 Monaten
Zhichang Yu	c813c1ff4c	Made task_executor async to speedup parsing (#5530) ### What problem does this PR solve? Made task_executor async to speedup parsing ### Type of change - [x] Performance Improvement	vor 8 Monaten
Kevin Hu	d6836444c9	DOC: for release. (#5472) ### What problem does this PR solve? ### Type of change - [x] Documentation Update --------- Co-authored-by: Jin Hai <haijin.chn@gmail.com>	vor 8 Monaten
yihong	37aacb3960	Refa: drop useless fasttext (#5470) ### What problem does this PR solve? This patch drop useless fastext which is seems useless in the code base and its very kind of hard install should close #4498 ### Type of change - [x] Refactoring Signed-off-by: yihong0618 <zouzou0208@gmail.com>	vor 8 Monaten
Kevin Hu	4f40f685d9	Code refactor (#5371) ### What problem does this PR solve? #5173 ### Type of change - [x] Refactoring	vor 8 Monaten
Zhichang Yu	ffb4cda475	Run keyword_extraction, question_proposal, content_tagging in thread pool (#5376) ### What problem does this PR solve? Run keyword_extraction, question_proposal, content_tagging in threads ### Type of change - [x] Performance Improvement	vor 8 Monaten
Kevin Hu	53b9e7b52f	Add tavily as web searh tool. (#5349) ### What problem does this PR solve? #5198 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	vor 8 Monaten
Zhichang Yu	eb72d598b1	Replaced pypi.tuna.tsinghua.edu.cn with mirrors.aliyun.com/pypi (#5309) ### What problem does this PR solve? Replaced pypi.tuna.tsinghua.edu.cn with mirrors.aliyun.com/pypi. I notice aliyun.com sometimes is much faster than tsinghua.edu. ### Type of change - [x] Refactoring	vor 8 Monaten
Kevin Hu	fe9e9a644f	Preparation for release. (#4739) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	vor 8 Monaten
Zhichang Yu	3411d0a2ce	Added cuda_is_available (#4725) ### What problem does this PR solve? Added cuda_is_available ### Type of change - [x] Refactoring	vor 8 Monaten
Zhichang Yu	e1526846da	Fixed GPU detection on CPU only environment (#4711) ### What problem does this PR solve? Fixed GPU detection on CPU only environment. Close #4692 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	vor 8 Monaten
Zhichang Yu	c4b1c4e6f4	Fix onnxruntime-gpu marks (#4643) ### What problem does this PR solve? Fix onnxruntime-gpu marks ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	vor 9 Monaten
Zhichang Yu	3c2c8942d5	Removed onnxruntime (#4632) ### What problem does this PR solve? Removed onnxruntime. It conflicts with the onnxruntime-gpu. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	vor 9 Monaten
Zhichang Yu	8b49734241	Added onnxruntime-gpu (#4631) ### What problem does this PR solve? Added onnxruntime-gpu ### Type of change - [x] Refactoring	vor 9 Monaten
Kevin Hu	dd0ebbea35	Light GraphRAG (#4585) ### What problem does this PR solve? #4543 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	vor 9 Monaten
Zhichang Yu	2962284c79	Bump akshare (#4536) ### What problem does this PR solve? Bump akshare. Close #4525 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	vor 9 Monaten
Zhichang Yu	57b4e0c464	Bump infinity to v0.6.0-dev2 (#4497) ### What problem does this PR solve? Bump infinity to v0.6.0-dev2. Close #4477 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	vor 9 Monaten
Zhichang Yu	d3c07794b5	Replace poetry with uv (#4471) ### What problem does this PR solve? Replace poetry with uv ### Type of change - [x] Refactoring	vor 9 Monaten

43 Commits (d55f44601a007532a62a909ccdb6d1520ce31861)