浏览代码

set DLA active for KG (#3386)

### What problem does this PR solve?

### Type of change


- [x] Refactoring
tags/v0.14.0
Kevin Hu 11 个月前
父节点
当前提交
83c6b1f308
没有帐户链接到提交者的电子邮件
共有 2 个文件被更改,包括 3 次插入3 次删除
  1. 1
    1
      api/apps/document_app.py
  2. 2
    2
      rag/app/knowledge_graph.py

+ 1
- 1
api/apps/document_app.py 查看文件

options.add_argument('--disable-dev-shm-usage') options.add_argument('--disable-dev-shm-usage')
driver = Chrome(options=options) driver = Chrome(options=options)
driver.get(url) driver.get(url)
sections = RAGFlowHtmlParser()(driver.page_source)
sections = RAGFlowHtmlParser()("", binary=driver.page_source)
return get_json_result(data="\n".join(sections)) return get_json_result(data="\n".join(sections))


if 'file' not in request.files: if 'file' not in request.files:

+ 2
- 2
rag/app/knowledge_graph.py 查看文件

lang="Chinese", callback=None, **kwargs): lang="Chinese", callback=None, **kwargs):
parser_config = kwargs.get( parser_config = kwargs.get(
"parser_config", { "parser_config", {
"chunk_token_num": 512, "delimiter": "\n!?。;!?", "layout_recognize": False})
"chunk_token_num": 512, "delimiter": "\n!?。;!?", "layout_recognize": True})
eng = lang.lower() == "english" eng = lang.lower() == "english"


parser_config["layout_recognize"] = False
parser_config["layout_recognize"] = True
sections = naive.chunk(filename, binary, from_page=from_page, to_page=to_page, section_only=True, sections = naive.chunk(filename, binary, from_page=from_page, to_page=to_page, section_only=True,
parser_config=parser_config, callback=callback) parser_config=parser_config, callback=callback)
chunks = build_knowledge_graph_chunks(tenant_id, sections, callback, chunks = build_knowledge_graph_chunks(tenant_id, sections, callback,

正在加载...
取消
保存