Browse Source

pypdf2 to pypdf (#1684)

### What problem does this PR solve?

pypdf and PyPDF2 possible Infinite Loop when a comment isn't followed by
a character #59

### Type of change

- [x] Refactoring
tags/v0.9.0
Kevin Hu 1 year ago
parent
commit
100b3165d8
No account linked to committer's email address
4 changed files with 4 additions and 1 deletions
  1. 1
    1
      deepdoc/parser/pdf_parser.py
  2. 1
    0
      requirements.txt
  3. 1
    0
      requirements_arm.txt
  4. 1
    0
      requirements_dev.txt

+ 1
- 1
deepdoc/parser/pdf_parser.py View File

@@ -23,7 +23,7 @@ import logging
from PIL import Image, ImageDraw
import numpy as np
from timeit import default_timer as timer
from PyPDF2 import PdfReader as pdf2_read
from pypdf import PdfReader as pdf2_read

from api.utils.file_utils import get_project_base_directory
from deepdoc.vision import OCR, Recognizer, LayoutRecognizer, TableStructureRecognizer

+ 1
- 0
requirements.txt View File

@@ -79,3 +79,4 @@ word2number==1.1
xgboost==2.1.0
xpinyin==0.7.6
zhipuai==2.0.1
pypdf==4.3.0

+ 1
- 0
requirements_arm.txt View File

@@ -153,3 +153,4 @@ groq==0.9.0
wikipedia==1.4.0
Bio==1.7.1
arxiv==2.1.3
pypdf==4.3.0

+ 1
- 0
requirements_dev.txt View File

@@ -138,3 +138,4 @@ groq==0.9.0
wikipedia==1.4.0
Bio==1.7.1
arxiv==2.1.3
pypdf==4.3.0

Loading…
Cancel
Save