znn
ed263aed9f
fix text splitter (#22596)
3 月之前
-LAN-
460a825ef1
refactor: decouple Node and NodeData (#22581)
Signed-off-by: -LAN- <laipz8200@outlook.com>
Co-authored-by: QuantumGhost <obelisk.reg+git@gmail.com>
3 月之前
helojo
e7d80bf7bf
Fix: the pict type picture was not processed in the docx (#19305)
Co-authored-by: zqgame <zqgame@zqgame.local>
3 月之前
yihong
d2933c2bfe
fix: drop dead code phase2 unused class (#22042)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
3 月之前
wanttobeamaster
bf7b2c339b
tablestore vector support more method (#22225)
Co-authored-by: xiaozhiqing.xzq <xiaozhiqing.xzq@alibaba-inc.com>
3 月之前
Jacky Wu
3e96c0c468
fix: close session before doing long latency operation (#22306)
3 月之前
luckylhb90
a371390d6c
optimize: batch embedding and qdrant write_consistency_factor parameter (#21776)
Co-authored-by: hobo.l <hobo.l@binance.com>
3 月之前
wlleiiwang
89b52471fb
Optimize the memory usage of Tencent Vector Database (#22079)
Co-authored-by: wlleiiwang <wlleiiwang@tencent.com>
3 月之前
baonudesifeizhai
1c7404099d
fix: prevent timeout in file encoding detection for large files (#21453)
Co-authored-by: crazywoola <427733928@qq.com>
4 月之前
efrey kong
826bf25abf
Fix: prevent SQL errors when metadata filter Constant value is None or blank (#21803)
4 月之前
Dongyu Li
00f0b569cc
Feat/kb index (#20868)
Co-authored-by: twwu <twwu@dify.ai>
4 月之前
Jin
3e7f8bad56
fix: markdown_extractor lost chunks if it starts without a header(#21308) (#21309)
4 月之前
LiuBo
17fe62cf91
feat: add support for Matrixone database (#20714)
4 月之前
NeatGuyCoding
9835730278
Translation fix (#21194)
4 月之前
NeatGuyCoding
2eae7503e1
Minor Improvements for File Validation and Configuration Handling #21179 (#21171)
Co-authored-by: tech <cto@sb>
4 月之前
Ademílson Tonato
9e73e8b9e8
feat: add search endpoint for Firecrawl Integration (#20521)
Co-authored-by: crazywoola <427733928@qq.com>
4 月之前
Rain Wang
47e0f92c0f
Fixes #20748 KnowledgeRetrievalNode return all external documents when reranker disabled even top-k configed (#20762)
4 月之前
kazuya-awano
45c89bd6de
feat: add pagenation to notion extractor (#20919)
4 月之前
kurokobo
4689e8953e
fix: shorten connection timeout to pypi.org for deprecation check for weaviate client (#21131)
4 月之前
Bowen Liang
366ddb05ae
test: run vdb test of oceanbase with docker compose in CI tests (#20945)
4 月之前
Bowen Liang
0f3d4d0b6e
chore: bump mypy to 1.16 (#20608)
4 月之前
QuantumGhost
c439e82038
refactor(api): Decouple `ParameterExtractorNode` from `LLMNode` (#20843)
- Extract methods used by `ParameterExtractorNode` from `LLMNode` into a separate file.
- Convert `ParameterExtractorNode` into a subclass of `BaseNode`.
- Refactor code referencing the extracted methods to ensure functionality and clarity.
- Fixes the issue that `ParameterExtractorNode` returns error when executed.
- Fix relevant test cases.
Closes #20840 .
4 月之前
yihong
65c7c01d90
fix: clean up two unreachable code (#20773)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
4 月之前
jefferyvvv
37c3283450
fix: opensearch vector search falls back to keyword search (#20723)
Co-authored-by: wenjun.gu <wenjun.gu@envision-energy.com>
5 月之前
jefferyvvv
4271602cfc
fix: opensearch metadata filtering returns empty (#20701)
Co-authored-by: wenjun.gu <wenjun.gu@envision-energy.com>
Co-authored-by: crazywoola <427733928@qq.com>
5 月之前
jefferyvvv
138ad6e8b3
fix: opensearch fulltext search with metadata filtering dsl error (#20702)
Co-authored-by: wenjun.gu <wenjun.gu@envision-energy.com>
5 月之前
kenwoodjw
01d500db14
fix: autocorrect everything in web (#20605)
Signed-off-by: kenwoodjw <blackxin55+@gmail.com>
5 月之前
zhaobingshuang
3f7aa38d77
fix: #20560 When elasticsearch is used as the vector database, the Retrieval Test fails to filter the data after setting the Score Threshold, and the score of the recalled results is empty (#20561)
5 月之前
Cheney Zhang
b4b59148dc
check zilliz cloud of full-text search (#20519)
5 月之前
Dongyu Li
1ea4459d9f
update knowledge base api (#20426)
5 月之前
-LAN-
a6ea15e63c
Refactor/message cycle manage and knowledge retrieval (#20460)
Signed-off-by: -LAN- <laipz8200@outlook.com>
5 月之前
yihong
5a991295e0
fix: drop some type fixme (#20344)
5 月之前
-LAN-
482e50aae9
Refactor/remove db from cycle manager (#20455)
Signed-off-by: -LAN- <laipz8200@outlook.com>
5 月之前
rouxiaomin
4c4887c5fc
feat(qdrant):add replication_factor when create collection in qdrant (#20133)
Co-authored-by: 刘敏 <min.liu@tongdun.net>
5 月之前
He Huang
6f48af2610
Refactor OpenSearch config to separate use_ssl and verify_certs flags (#20075)
Co-authored-by: he.huang <he.huang1@outlook.com>
Co-authored-by: crazywoola <427733928@qq.com>
5 月之前
wlleiiwang
7d230acf40
tencent vectordb compatible with version 1.1.3 and below (#20056)
Co-authored-by: wlleiiwang <wlleiiwang@tencent.com>
5 月之前
-LAN-
3196dc2d61
refactor: Use typed SQLAlchemy base model and fix type errors (#19980)
Signed-off-by: -LAN- <laipz8200@outlook.com>
5 月之前
Darlewo
8cb3b4aef2
fix: multiple retrieve reranking_enabled switch (#19958)
5 月之前
Amir Mohsen Asaran
c9ee60e197
Feat(WaterCrawl error handling): add custom exceptions and error handling (#19948)
5 月之前
-LAN-
4977bb21ec
feat(workflow): domain model for workflow node execution (#19430)
Signed-off-by: -LAN- <laipz8200@outlook.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
5 月之前
k-kayashima
b292990075
Fix: Ensure unique index names for pgvector knowledge tables (#19672)
Co-authored-by: crazywoola <427733928@qq.com>
5 月之前
非法操作
085bd1aa93
chore: model.query change to db.session.query (#19551)
Co-authored-by: QuantumGhost <obelisk.reg+git@gmail.com>
5 月之前
非法操作
14cd71ed0a
chore: all model.query replace to db.session.query (#19521)
5 月之前
非法操作
b00f94df64
fix: replace all dataset.Model.query to db.session.query(Model) (#19509)
5 月之前
湛露先生
1119790b02
clean rag word_extractor. (#19397)
Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>
5 月之前
Will
bfa652f2d0
fix: metadata filtering condition variable unassigned; fix External K… (#19208)
6 月之前
Hao Cheng
9bcf837f17
fix: use only supported operators in metadata filter system prompts (#19195)
6 月之前
Will
a212a63e6a
fix: time type metadata filtering error (#19192)
6 月之前
Bowen Liang
12c96b93d9
immediately return initialed tiktokenizer instance and remove dead code in usage of tiktokenizer (#17957)
6 月之前
QuantumGhost
bd1bbfee4b
Enhance Code Consistency Across Repository with `.editorconfig` (#19023)
6 月之前