Bläddra i källkod

Catch the exception while parsing pptx. (#4202)

### What problem does this PR solve?
#4189

### Type of change

- [x] Bug Fix (non-breaking change which fixes an issue)
tags/v0.15.1
Kevin Hu 10 månader sedan
förälder
incheckning
76cd23eecf
Inget konto är kopplat till bidragsgivarens mejladress
1 ändrade filer med 7 tillägg och 4 borttagningar
  1. 7
    4
      deepdoc/parser/ppt_parser.py

+ 7
- 4
deepdoc/parser/ppt_parser.py Visa fil

@@ -10,7 +10,7 @@
# See the License for the specific language governing permissions and
# limitations under the License.
#
import logging
from io import BytesIO
from pptx import Presentation

@@ -53,9 +53,12 @@ class RAGFlowPptParser(object):
texts = []
for shape in sorted(
slide.shapes, key=lambda x: ((x.top if x.top is not None else 0) // 10, x.left)):
txt = self.__extract(shape)
if txt:
texts.append(txt)
try:
txt = self.__extract(shape)
if txt:
texts.append(txt)
except Exception as e:
logging.exception(e)
txts.append("\n".join(texts))

return txts

Laddar…
Avbryt
Spara