767 Commits (2a11b2c3315c182d35b0679af06ce602d9d34bc0)

Author SHA1 Message Date
  Stephen Hu 19419281c3
Fix: Change Ollama Embedding Keep Alive (#8734) 3 months ago
  Stephen Hu 00c954755e
Fix:use the same logic to handle pos in tokenize_chunks_with_images (#8732) 3 months ago
  Stephen Hu 8af0d04ad0
Refactor:Improve the logic in search.py (#8716) 3 months ago
  Stephen Hu e60ec0a31b
Fix:disallowed special token while embedding (#8692) 3 months ago
  6607changchun 9580e99650
fix: retry embedding with Qwen family models when limits temporarily reached. (#8690) 3 months ago
  Kevin Hu 1e6bda735a
Fix: add ES re-connect once request timeout. (#8678) 3 months ago
  kwrobel.eth 8a3b5d1d76
Fix a small typo in count of used fragments (#8673) 4 months ago
  Yongteng Lei a306a6f158
Refa: refactor prompts into markdown-style structure using Jinja2 (#8667) 4 months ago
  Stephen Hu d5f6335f99
Fix: The data set created by API call failed to parse after uploading the file. (#8657) 4 months ago
  Yongteng Lei f8a6987f1e
Refa: automatic LLMs registration (#8651) 4 months ago
  Yongteng Lei 62b63acbb5
Refa: more robust mcp tool call (#8631) 4 months ago
  Kevin Hu fffb7c0bba
Fix: anthropic llm issue. (#8633) 4 months ago
  He Wang 898da23caa
make dirs with 'exist_ok=True' (#8629) 4 months ago
  He Wang 695bfe34a2
fix opendal config 'oss_table' and 'max_allowed_packet' (#8611) 4 months ago
  Tuan Le d343cb4deb
Add Google Cloud Vision API Integration (Image2Text) (#8608) 4 months ago
  wenxuan.zhang f586dd0a96
Fix: docx parse error. (#8600) 4 months ago
  Tuan Le 1c77b4ed9b
fix: Correctly format message parts in GoogleChat (#8596) 4 months ago
  Kevin Hu e3edcc3064
Trivals. (#8597) 4 months ago
  symvation 32f8b3ad77
Fix: the output log is incorrect (#8577) 4 months ago
  Yongteng Lei 8801de2772
Refa: change mcp_client module to rag/utils/conn (#8578) 4 months ago
  Kevin Hu d46c24045f
Feat: add GiteeAI as a llm provider. (#8572) 4 months ago
  Kevin Hu aafeffa292
Feat: add gitee as LLM provider. (#8545) 4 months ago
  Kevin Hu e441c17c2c
Refa: limit embedding concurrency and fix `chat_with_tool` (#8543) 4 months ago
  Kevin Hu a10f05f4d7
Fix: chat with tools bug. (#8528) 4 months ago
  Tuan Le 303c6dd1a8
Fix memory leaks in PIL image and BytesIO handling during chunk processing (#8522) 4 months ago
  Stephen Hu be712714af
Refactor:improve the logic to check cancel (#8524) 4 months ago
  Kevin Hu 6d256ff0f5
Perf: ignore concate between rows. (#8507) 4 months ago
  Tuan Le 6b1221d2f6
Fix parser_config access for layout_recognize in presentation.py (#8492) 4 months ago
  Rainman 340354b79c
fix the error 'Unknown field for GenerationConfig: max_tokens' when u… (#8473) 4 months ago
  Yongteng Lei b705ff08fe
Refa: improve GraphRAG similarity sensitivity to numeric differences (#8479) 4 months ago
  liuzhenghua 5256980ffb
Fix: Solve the OOM issue when passing large PDF files while using QA chunking method. (#8464) 4 months ago
  Stephen Hu 8d9d2cc0a9
Fix: some cases Task return but not set progress (#8469) 4 months ago
  HaiyangP d6a941ebf5
Fix the bug of long type value overflow (#8313) 4 months ago
  WuWeiFlow bc1b837616
FIX:Saving an RGBA image directly as JPEG will cause an error. If the… (#8399) 4 months ago
  Rainman 49d67cbcb7
fix a bug when using huggingface embedding api (#8432) 4 months ago
  Song Fuchang fd7ac17605
Feat: Scratch MCP tool calling support. (#8263) 4 months ago
  Liu An 244d8a47b9
Fix: AzureChat model code (#8426) 4 months ago
  kira-offgrid f0e0783618
Fix: Database Query Vulnerable to Injection Attacks in rag/utils/opendal_conn.py (#8408) 4 months ago
  Kevin Hu d4e6e2bd21
Fix: doc_aggs issue. (#8418) 4 months ago
  Kevin Hu 83e23f1e8a
Fix: rank feature score should be greater than 0. (#8416) 4 months ago
  Stephen Hu 794a4102c2
Fix: Document parse via API will alot problen (#8407) 4 months ago
  Stephen Hu ef5e7d8c44
Fix:embedding_model class SILICONFLOWEmbed(Base)Function reusing json (#8378) 4 months ago
  changqingla 4784aa5b0b
fix: List Chunks API fails to return the correct document status. (#8347) 4 months ago
  Kevin Hu 8f3fe63d73
Fix: duplicated task (#8358) 4 months ago
  Stephen Hu 35034fed73
Fix: Raptor: [Bug]: **ERROR**: Unknown field for GenerationConfig: max_tokens (#8331) 4 months ago
  Jin Hai 4a2ff633e0
Fix typo in code (#8327) 4 months ago
  cutiechi 8f9bcb1c74
Feat: make document parsing and embedding batch sizes configurable via environment variables (#8266) 4 months ago
  Kevin Hu b1117a8717
Fix: base url issue. (#8281) 4 months ago
  cutiechi dabbc852c8
Fix: opendal storage health attribute not found & remove duplicate operator scheme initialization (#8265) 4 months ago
  Yongteng Lei 8f9e7a6f6f
Refa: revert to original task message collection logic (#8251) 4 months ago