KevinHuSh
843720f958
fix bug in pdf parser (#986)
### What problem does this PR solve?
#963
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
il y a 1 an
KevinHuSh
0171082cc5
fix create dialog bug (#982)
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
il y a 1 an
Zhedong Cen
8dd45459be
Add support for HTML file (#973)
### What problem does this PR solve?
Add support for HTML file
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
il y a 1 an
KevinHuSh
7eee193956
fix #917 #915 (#946)
### What problem does this PR solve?
#917
#915
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
il y a 1 an
KevinHuSh
46454362d7
fix raptor bugs (#928)
### What problem does this PR solve?
#922
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
il y a 1 an
dashi6174
fe18627ebc
Fix some syntax errors, re not import (#904)
re not import
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
il y a 1 an
xinzhuang
3bbdf3b770
fixbug for computing 'not concating feature' (#896)
### What problem does this PR solve?
When pdfparser call `_naive_vertical_merge` method,there is a "not
concating feature " value by computing difference between `b` and `b_`'s
layoutno ,but actually is `b` and `b`. I think it's a bug, so fix it.
Please check again.
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
il y a 1 an
KevinHuSh
a12fcf9156
fix minio helth bug (#850)
### What problem does this PR solve?
#643
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
il y a 1 an
GYH
c27c02ea67
Split Excel file into different chunks (#847)
### What problem does this PR solve?
Split Excel into different chunk
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
il y a 1 an
KevinHuSh
99be226c7c
fix coordinate error (#686)
### What problem does this PR solve?
#683
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
il y a 1 an
KevinHuSh
7013d7f620
refine text decode (#657)
### What problem does this PR solve?
#651
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
il y a 1 an
KevinHuSh
cab274f560
remove PyMuPDF (#618)
### What problem does this PR solve?
#613
### Type of change
- [x] Other (please describe):
il y a 1 an
KevinHuSh
8c07992b6c
refine code (#595)
### What problem does this PR solve?
### Type of change
- [x] Refactoring
il y a 1 an
KevinHuSh
d589b0f568
fix exception in pdf parser (#584)
### What problem does this PR solve?
#451
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
il y a 1 an
KevinHuSh
9d60a84958
refactor code (#583)
### What problem does this PR solve?
### Type of change
- [x] Refactoring
il y a 1 an
KevinHuSh
66f8d35632
Refactor (#537)
### What problem does this PR solve?
### Type of change
- [x] Refactoring
il y a 1 an
Dong Liu
2bc9a7cc18
Add Chinese readme for DeepDoc (#515)
### What problem does this PR solve?
Add Chinese explanation for deepdoc
### Type of change
- [ ] Bug Fix (non-breaking change which fixes an issue)
- [ ] New Feature (non-breaking change which adds functionality)
- [*] Documentation Update
- [ ] Refactoring
- [ ] Performance Improvement
- [ ] Other (please describe):
il y a 1 an
KevinHuSh
0dfc8ddc0f
enlarge docker memory usage (#501)
### What problem does this PR solve?
### Type of change
- [x] Refactoring
il y a 1 an
KevinHuSh
ed6081845a
Fit a lot of encodings for text file. (#458)
### What problem does this PR solve?
#384
### Type of change
- [x] Performance Improvement
il y a 1 an
KevinHuSh
962c66714e
fix divide by zero bug (#447)
### What problem does this PR solve?
#445
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
il y a 1 an
加帆
39f1feaccb
Bug fix pdf parse index out of range (#440)
### What problem does this PR solve?
fix a bug comes when parse some pdf file #436
### Type of change
- [☑️ ] Bug Fix (non-breaking change which fixes an issue)
il y a 1 an
KevinHuSh
0499a3f621
rm page number exception for pdf parser (#424)
### What problem does this PR solve?
#423
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
il y a 1 an
KevinHuSh
453c29170f
make sure the models will not be load twice (#422)
### What problem does this PR solve?
#381
### Type of change
- [x] Refactoring
il y a 1 an
jie yang
a7be5d4e8b
build ragflow image from scratch (#376)
### What problem does this PR solve?
issue: #205
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
il y a 1 an
KevinHuSh
36f2d7b797
To avoid assertion while no rows in excel (#197)
### What problem does this PR solve?
_Briefly describe what this PR aims to solve. Include background context
that will help reviewers understand the purpose of the PR._
Issue link:#[[Link the issue
here](https://github.com/infiniflow/ragflow/issues/196 )]
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
- [ ] New Feature (non-breaking change which adds functionality)
- [ ] Breaking Change (fix or feature that could cause existing
functionality not to work as expected)
- [ ] Documentation Update
- [ ] Refactoring
- [ ] Performance Improvement
- [ ] Test cases
- [ ] Python SDK impacted, Need to update PyPI
- [ ] Other (please describe):
il y a 1 an
KevinHuSh
38e5737067
add base url for OpenAI (#166)
il y a 1 an
KevinHuSh
a5384446e3
let's load model from local (#163)
il y a 1 an
KevinHuSh
fd7fcb5baf
apply pep8 formalize (#155)
il y a 1 an
KevinHuSh
979b3a5b4b
support snapshot download from local (#153)
* support snapshot download from local
* let snapshot download from local
il y a 1 an
KevinHuSh
da21320b88
fix plainPdf bugs (#152)
il y a 1 an
KevinHuSh
71fe314955
refine page ranges (#147)
il y a 1 an
KevinHuSh
f6aee7f230
add use layout or not option (#145)
* add use layout or not option
* trival
il y a 1 an
KevinHuSh
6c6b144de2
refine manual parser (#140)
il y a 1 an
KevinHuSh
6999598101
refine for English corpus (#135)
il y a 1 an
KevinHuSh
9a843667b3
fix github account login issue (#132)
il y a 1 an
KevinHuSh
9da671b951
refine manul parser (#131)
il y a 1 an
KevinHuSh
675a9f8d9a
add dockerfile for cuda envirement. Refine table search strategy, (#123)
il y a 1 an
KevinHuSh
0feb085c88
refine table parser (#120)
il y a 1 an
KevinHuSh
f1f09df901
add local llm implementation (#119)
il y a 1 an
KevinHuSh
bcb58b7e71
layout refine (#115)
il y a 1 an
KevinHuSh
8f86ab9f7f
refine pdf parser, add time zone to userinfo (#112)
il y a 1 an
KevinHuSh
2d7c9080f4
deal with stop reason being length problem (#109)
il y a 1 an
KevinHuSh
b89ac3c4be
chage tas execution logic (#103)
il y a 1 an
KevinHuSh
602038ac49
fix task cancling bug (#98)
il y a 1 an
KevinHuSh
8a57f2afd5
change callback strategy, add timezone to docker (#96)
il y a 1 an
KevinHuSh
7bfaf0df29
fix position extraction bug (#93)
* fix position extraction bug
* remove delimiter for naive parser
il y a 1 an
KevinHuSh
685b4d8a95
fix table desc bugs, add positions to chunks (#91)
il y a 1 an
KevinHuSh
8a726fb04b
solve task execution issues (#90)
il y a 1 an
KevinHuSh
3d4315c42a
resolve the issue of naive parser (#87)
il y a 1 an
KevinHuSh
0429107e80
fix user login issue (#85)
il y a 1 an