Zhichang Yu
8d73cf6f02
Added time to progress message (#4185)
### What problem does this PR solve?
Added time to progress message
### Type of change
- [x] Refactoring
10 maanden geleden
Jin Hai
8f070c3d56
Fix 'SCORE' not found bug (#4178)
### What problem does this PR solve?
As title
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
---------
Signed-off-by: jinhai <haijin.chn@gmail.com>
10 maanden geleden
Zhichang Yu
1fdad50dac
Fix raptor (#4089)
### What problem does this PR solve?
Fix raptor
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
10 maanden geleden
Kevin Hu
fddac1345d
Fix raptor resuable issue. (#4063)
### What problem does this PR solve?
#4045
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
10 maanden geleden
Kevin Hu
ce1e855328
Upgrades Document Layout Analysis model. (#4054)
### What problem does this PR solve?
#4052
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
10 maanden geleden
Kevin Hu
cb6e9ce164
Cache the result from llm for graphrag and raptor (#4051)
### What problem does this PR solve?
#4045
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
10 maanden geleden
Kevin Hu
7fb67c4f67
Fix chunk number error after re-parsing. (#4043)
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
10 maanden geleden
Zhichang Yu
c8b1a564aa
Replaced md5 with xxhash64 for chunk id (#4009)
### What problem does this PR solve?
Replaced md5 with xxhash64 for chunk id
### Type of change
- [x] Refactoring
10 maanden geleden
Zhichang Yu
301f95837c
Try to reuse existing chunks (#3983)
### What problem does this PR solve?
Try to reuse existing chunks. Close #3793
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
10 maanden geleden
Zhichang Yu
0d68a6cd1b
Fix errors detected by Ruff (#3918)
### What problem does this PR solve?
Fix errors detected by Ruff
### Type of change
- [x] Refactoring
10 maanden geleden
Kevin Hu
74b28ef1b0
Add pagerank to KB. (#3809)
### What problem does this PR solve?
#3794
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
11 maanden geleden
Zhichang Yu
4ed5ca2666
handle_task catch all exception (#3441)
### What problem does this PR solve?
handle_task catch all exception
Report heartbeats
### Type of change
- [x] Refactoring
11 maanden geleden
yqkcn
57237634f1
Refactoring large integers to improve readability (#2636)
### What problem does this PR solve?
Refactoring large integers
### Type of change
- [x] Refactoring
1 jaar geleden
Fachuan Bai
8dd3adc443
Storage: Support the s3, azure blob as the object storage of ragflow. (#2278)
### What problem does this PR solve?
issue: https://github.com/infiniflow/ragflow/issues/2277
_Briefly describe what this PR aims to solve. Include background context
that will help reviewers understand the purpose of the PR._
### Type of change
- [ ] Bug Fix (non-breaking change which fixes an issue)
- [x] New Feature (non-breaking change which adds functionality)
- [ ] Documentation Update
- [ ] Refactoring
- [ ] Performance Improvement
- [ ] Other (please describe):
Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>
1 jaar geleden
Kevin Hu
fc1ac3a962
fix delete message error (#2153)
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
1 jaar geleden
Kevin Hu
212bb8e601
add retry count to task (#2152)
### What problem does this PR solve?
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
1 jaar geleden
Jin Hai
6b3a40be5c
Format file format from Windows/dos to Unix (#1949)
### What problem does this PR solve?
Related source file is in Windows/DOS format, they are format to Unix
format.
### Type of change
- [x] Refactoring
Signed-off-by: Jin Hai <haijin.chn@gmail.com>
1 jaar geleden
Kevin Hu
152072f900
Add graphrag (#1793)
### What problem does this PR solve?
#1594
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
1 jaar geleden
KevinHuSh
2023fdc13e
fix file preview in file management (#1151)
### What problem does this PR solve?
fix file preview in file management
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
1 jaar geleden
KevinHuSh
6f99bbbb08
add raptor (#899)
### What problem does this PR solve?
#882
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
1 jaar geleden
KevinHuSh
d8c080ee52
fix bugs in searching file using keywords (#780)
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
1 jaar geleden
KevinHuSh
7ddb2f19be
make sure to raise exception if redis is not there (#674)
### What problem does this PR solve?
### Type of change
- [x] Refactoring
1 jaar geleden
KevinHuSh
8d6d7f6887
fix task losting isssue (#665)
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
1 jaar geleden
KevinHuSh
a5aed2412f
fix bugs (#662)
### What problem does this PR solve?
Fix import error for task_service.py
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
1 jaar geleden
Fakai Zhao
de839fc3f0
optimize srv broker and executor logic (#630)
### What problem does this PR solve?
Optimize task broker and executor for reduce memory usage and deployment
complexity.
### Type of change
- [x] Performance Improvement
- [x] Refactoring
### Change Log
- Enhance redis utils for message queue(use stream)
- Modify task broker logic via message queue (1.get parse event from
message queue 2.use ThreadPoolExecutor async executor )
- Modify the table column name of document and task (process_duation ->
process_duration maybe just a spelling mistake)
- Reformat some code style(just what i see)
- Add requirement_dev.txt for developer
- Add redis container on docker compose
---------
Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>
1 jaar geleden
KevinHuSh
8c07992b6c
refine code (#595)
### What problem does this PR solve?
### Type of change
- [x] Refactoring
1 jaar geleden
KevinHuSh
944776f207
fix bug about fetching file from minio (#574)
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
1 jaar geleden
KevinHuSh
66f8d35632
Refactor (#537)
### What problem does this PR solve?
### Type of change
- [x] Refactoring
1 jaar geleden
KevinHuSh
ed6081845a
Fit a lot of encodings for text file. (#458)
### What problem does this PR solve?
#384
### Type of change
- [x] Performance Improvement
1 jaar geleden
KevinHuSh
890561703b
Add bce-embedding and fastembed (#383)
### What problem does this PR solve?
Issue link:#326
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
1 jaar geleden
KevinHuSh
0feb085c88
refine table parser (#120)
1 jaar geleden
KevinHuSh
602038ac49
fix task cancling bug (#98)
1 jaar geleden
KevinHuSh
8a726fb04b
solve task execution issues (#90)
1 jaar geleden
KevinHuSh
7fd1eca582
init README of deepdoc, add picture processer. (#71)
* init README of deepdoc, add picture processer.
* add resume parsing
1 jaar geleden
KevinHuSh
407b2523b6
remove unused codes, seperate layout detection out as a new api. Add new rag methed 'table' (#55)
1 jaar geleden
KevinHuSh
e6acaf6738
Add Q&A and Book, fix task running bugs (#50)
1 jaar geleden
KevinHuSh
6224edcd1b
Add task moduel, and pipline the task and every parser (#49)
1 jaar geleden