Explorar el Código

fix the ssrf of docx file extractor external images (#10237)

tags/0.11.0
Jyong hace 1 año
padre
commit
1024fc623e
No account linked to committer's email address
Se han modificado 1 ficheros con 2 adiciones y 1 borrados
  1. 2
    1
      api/core/rag/extractor/word_extractor.py

+ 2
- 1
api/core/rag/extractor/word_extractor.py Ver fichero

from docx import Document as DocxDocument from docx import Document as DocxDocument


from configs import dify_config from configs import dify_config
from core.helper import ssrf_proxy
from core.rag.extractor.extractor_base import BaseExtractor from core.rag.extractor.extractor_base import BaseExtractor
from core.rag.models.document import Document from core.rag.models.document import Document
from extensions.ext_database import db from extensions.ext_database import db
image_count += 1 image_count += 1
if rel.is_external: if rel.is_external:
url = rel.reltype url = rel.reltype
response = requests.get(url, stream=True)
response = ssrf_proxy.get(url, stream=True)
if response.status_code == 200: if response.status_code == 200:
image_ext = mimetypes.guess_extension(response.headers["Content-Type"]) image_ext = mimetypes.guess_extension(response.headers["Content-Type"])
file_uuid = str(uuid.uuid4()) file_uuid = str(uuid.uuid4())

Cargando…
Cancelar
Guardar