Du kan inte välja fler än 25 ämnen Ämnen måste starta med en bokstav eller siffra, kan innehålla bindestreck ('-') och vara max 35 tecken långa.

accelerate_doc_indexing.mdx 1.0KB

12345678910111213141516171819
  1. ---
  2. sidebar_position: 9
  3. slug: /accelerate_doc_indexing
  4. ---
  5. # Accelerate indexing
  6. import APITable from '@site/src/components/APITable';
  7. A checklist to speed up document parsing and indexing.
  8. ---
  9. Please note that some of your settings may consume a significant amount of time. If you often find that document parsing is time-consuming, here is a checklist to consider:
  10. - Use GPU to reduce embedding time.
  11. - On the configuration page of your knowledge base, switch off **Use RAPTOR to enhance retrieval**.
  12. - Extracting knowledge graph (GraphRAG) is time-consuming.
  13. - Disable **Auto-keyword** and **Auto-question** on the configuration page of yor knowledge base, as both depend on the LLM.
  14. - **v0.17.1:** If your document is plain text PDF and does not require GPU-intensive processes like OCR (Optical Character Recognition), TSR (Table Structure Recognition), or DLA (Document Layout Analysis), you can choose **Naive** over **DeepDoc** or other time-consuming large model options in the **Document parser** dropdown. This will substantially reduce document parsing time.