浏览代码

Catch the exception while parsing pptx. (#4202)

### What problem does this PR solve?
#4189

### Type of change

- [x] Bug Fix (non-breaking change which fixes an issue)
tags/v0.15.1
Kevin Hu 10 个月前
父节点
当前提交
76cd23eecf
没有帐户链接到提交者的电子邮件
共有 1 个文件被更改,包括 7 次插入4 次删除
  1. 7
    4
      deepdoc/parser/ppt_parser.py

+ 7
- 4
deepdoc/parser/ppt_parser.py 查看文件

# See the License for the specific language governing permissions and # See the License for the specific language governing permissions and
# limitations under the License. # limitations under the License.
# #
import logging
from io import BytesIO from io import BytesIO
from pptx import Presentation from pptx import Presentation


texts = [] texts = []
for shape in sorted( for shape in sorted(
slide.shapes, key=lambda x: ((x.top if x.top is not None else 0) // 10, x.left)): slide.shapes, key=lambda x: ((x.top if x.top is not None else 0) // 10, x.left)):
txt = self.__extract(shape)
if txt:
texts.append(txt)
try:
txt = self.__extract(shape)
if txt:
texts.append(txt)
except Exception as e:
logging.exception(e)
txts.append("\n".join(texts)) txts.append("\n".join(texts))


return txts return txts

正在加载...
取消
保存