226 Commits (38e155d8196acecac7e204ce36704920798344e1)

Author SHA1 Message Date
  Bowen Liang 924b4fe742
test: run vdb tests on TiDB Vector with docker in CI tests (#11645) 10 months ago
  yihong 22258fb0bf
fix: filter bug for keywork cause code can not reach (#11666) 10 months ago
  yihong 36cb25b341
fix: support mdx files close #11557 (#11565) 11 months ago
  Jiang 0d04cdc323
Lindorm vdb (#11574) 11 months ago
  Jyong 9b7adcd4d9
update tidb batch get endpoint to basic mode (#11426) 11 months ago
  Jyong d7c1f43b49
fix tidb full-text-search vector missed (#11337) 11 months ago
  Jyong c58d2fce89
roll back rerank topn setting (#11297) 11 months ago
  yihong e686f12317
fix: better handle error (#11265) 11 months ago
  -LAN- 9601102885
fix(word_extractor): Fix type error and remove stream in ssrf_proxy (#11241) 11 months ago
  Cling_o3 f9c2aa7689
feat: add retireval_top_n to config in env (#11132) 11 months ago
  kazuya-awano 2d6865d421
Ensure consistent float type for cached embedding return values (#10185) 11 months ago
  yihong d7160ee563
fix: typo in upstashVector if id is always true, also fix some type hint (#11183) 11 months ago
  -LAN- 9789905a1f
chore(*): Removes debugging print statements (#11145) 11 months ago
  Bowen Liang 6c8e208ef3
chore: bump minimum supported Python version to 3.11 (#10386) 11 months ago
  yihong ed55de888a
fix: rules should not be None for in (#10977) 11 months ago
  AkisAya cb0c55daa7
fix weight rerank of knowledge retrieval (#10931) 11 months ago
  yihong 58a9d9eb9a
fix: better WeightRerankRunner run logic use O(1) and delete unused code (#10849) 11 months ago
  Zane 14f3d44c37
refactor: improve handling of leading punctuation removal (#10761) 11 months ago
  8bitpd 873e9720e9
feat: AnalyticDB vector store supports invocation via SQL. (#10802) 11 months ago
  Bowen Liang 51db59622c
chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) 11 months ago
  Jyong 0b2d51d859
add the index field for elasticsearch (#10592) 1 year ago
  -LAN- a1543b7da0
fix(extractor): temporary file (#10543) 1 year ago
  Leo.Wang c9f785e00f
Feat/tools/gitlab (#10407) 1 year ago
  Bowen Liang 574c4a264f
chore(lint): Use logging.exception instead of logging.error (#10415) 1 year ago
  Jyong 1024fc623e
fix the ssrf of docx file extractor external images (#10237) 1 year ago
  Jiang 0c9e79cd67
Add Lindorm as a VDB choice (#10202) 1 year ago
  Shili Cao b61baa87ec
fix: avoid unexpected error when create knowledge base with baidu vector database and wenxin embedding model (#10130) 1 year ago
  Jyong dad041c49f
fix issue: query is none when doing retrieval (#10129) 1 year ago
  omr 11ca1bec0b
fix: optimize unique document filtering with set (#10082) 1 year ago
  zhuhao 7433095240
chore: use dify_config.TIDB_SPEND_LIMIT instead of constant value (#10038) 1 year ago
  Jyong 9ebd453b87
add rerank check when doing mutil-retrieval (#9998) 1 year ago
  powerfool 878d13ef42
Added OceanBase as an option for the vector store in Dify (#10010) 1 year ago
  Jyong 5580bcf870
add tidb spend limit config (#9999) 1 year ago
  roadgoat19 c8ef9223e5
feat: couchbase integration (#6165) 1 year ago
  Jyong f47177ecb4
add top_k for es full text search (#9963) 1 year ago
  virgosoy 17cacf258e
fix: wrong element object (#9868) 1 year ago
  Jyong 18106a4fc6
add tidb on qdrant type (#9831) 1 year ago
  Zixuan Cheng 88dec6ef2b
Added description for .ppt, specify the reason for unstructured.io (#9452) 1 year ago
  Jyong 5f11fe521d
remove unstructured pdf extract (#9794) 1 year ago
  Jyong 3e9d271b52
nltk security issue and upgrade unstructured (#9558) 1 year ago
  ice yao ceb2c4f3ef
chore: reuse existing test functions with upstash vdb (#9679) 1 year ago
  Zven 8e7a752b2a
feat: add upstash as a new vector database provider (#9644) 1 year ago
  -LAN- 5f12c17355
fix(core): use CreatedByRole enum for role consistency (#9607) 1 year ago
  Bowen Liang 4d9160ca9f
refactor: use dify_config to replace legacy usage of flask app's config (#9089) 1 year ago
  -LAN- e61752bd3a
feat/enhance the multi-modal support (#8818) 1 year ago
  ice yao 2155bba5b0
fix: update mismatch vector type (#9462) 1 year ago
  zhuhao b90ad587c2
refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 1 year ago
  zhuhao 86594851cb
refactor: update the default values of top-k parameter in vdb to be consistent (#9367) 1 year ago
  Jyong 50635e9c15
Fix/economical knowledge retrieval (#9396) 1 year ago
  zhuhao cd7ab6231f
refactor: Add an enumeration type and use the factory pattern to obtain the corresponding class (#9356) 1 year ago