Você não pode selecionar mais de 25 tópicos Os tópicos devem começar com uma letra ou um número, podem incluir traços ('-') e podem ter até 35 caracteres.

xinference.md 2.0KB

12345678910111213141516171819202122232425262728293031323334353637383940414243
  1. # Xinference
  2. <div align="center" style="margin-top:20px;margin-bottom:20px;">
  3. <img src="https://github.com/infiniflow/ragflow/assets/12318111/2c5e86a7-807b-4d29-bd2b-f73fb1018866" width="130"/>
  4. </div>
  5. Xorbits Inference([Xinference](https://github.com/xorbitsai/inference)) empowers you to unleash the full potential of cutting-edge AI models.
  6. ## Install
  7. - [pip install "xinference[all]"](https://inference.readthedocs.io/en/latest/getting_started/installation.html)
  8. - [Docker](https://inference.readthedocs.io/en/latest/getting_started/using_docker_image.html)
  9. To start a local instance of Xinference, run the following command:
  10. ```bash
  11. $ xinference-local --host 0.0.0.0 --port 9997
  12. ```
  13. ## Launch Xinference
  14. Decide which LLM you want to deploy ([here's a list for supported LLM](https://inference.readthedocs.io/en/latest/models/builtin/)), say, **mistral**.
  15. Execute the following command to launch the model, remember to replace ${quantization} with your chosen quantization method from the options listed above:
  16. ```bash
  17. $ xinference launch -u mistral --model-name mistral-v0.1 --size-in-billions 7 --model-format pytorch --quantization ${quantization}
  18. ```
  19. ## Use Xinference in RAGFlow
  20. - Go to 'Settings > Model Providers > Models to be added > Xinference'.
  21. <div align="center" style="margin-top:20px;margin-bottom:20px;">
  22. <img src="https://github.com/infiniflow/ragflow/assets/12318111/bcbf4d7a-ade6-44c7-ad5f-0a92c8a73789" width="1300"/>
  23. </div>
  24. > Base URL: Enter the base URL where the Xinference service is accessible, like, `http://<your-xinference-endpoint-domain>:9997/v1`.
  25. - Use Xinference Models.
  26. <div align="center" style="margin-top:20px;margin-bottom:20px;">
  27. <img src="https://github.com/infiniflow/ragflow/assets/12318111/b01fcb6f-47c9-4777-82e0-f1e947ed615a" width="530"/>
  28. </div>
  29. <div align="center" style="margin-top:20px;margin-bottom:20px;">
  30. <img src="https://github.com/infiniflow/ragflow/assets/12318111/1763dcd1-044f-438d-badd-9729f5b3a144" width="530"/>
  31. </div>