KevinHuSh
							
						 
						
							
								843720f958
								
									
										
											 
										
									
								
							 
						 
						
							
									fix bug in pdf parser (#986) 
							 
							
							 
							
							
							
							
### What problem does this PR solve?
#963  
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue) 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								7eee193956
								
									
										
											 
										
									
								
							 
						 
						
							
									fix #917 #915 (#946) 
							 
							
							 
							
							
							
							
### What problem does this PR solve?
#917  
#915 
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue) 
							
						 
						1 year ago  
					 
				
					
						
							
								   xinzhuang
							
						 
						
							
								3bbdf3b770
								
									
										
											 
										
									
								
							 
						 
						
							
									fixbug for computing 'not concating feature' (#896) 
							 
							
							 
							
							
							
							
### What problem does this PR solve?
When pdfparser call `_naive_vertical_merge` method,there is a "not
concating feature " value by computing difference between `b` and `b_`'s
layoutno ,but actually is `b` and `b`. I think it's a bug, so fix it.
Please check again.
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue) 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								99be226c7c
								
									
										
											 
										
									
								
							 
						 
						
							
									fix coordinate error (#686) 
							 
							
							 
							
							
							
							
### What problem does this PR solve?
#683  
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue) 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								cab274f560
								
									
										
											 
										
									
								
							 
						 
						
							
									remove PyMuPDF (#618) 
							 
							
							 
							
							
							
							
### What problem does this PR solve?
#613  
### Type of change
- [x] Other (please describe): 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								8c07992b6c
								
									
										
											 
										
									
								
							 
						 
						
							
									refine code (#595) 
							 
							
							 
							
							
							
							
### What problem does this PR solve?
### Type of change
- [x] Refactoring 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								d589b0f568
								
									
										
											 
										
									
								
							 
						 
						
							
									fix exception in pdf parser (#584) 
							 
							
							 
							
							
							
							
### What problem does this PR solve?
#451  
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue) 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								9d60a84958
								
									
										
											 
										
									
								
							 
						 
						
							
									refactor code (#583) 
							 
							
							 
							
							
							
							
### What problem does this PR solve?
### Type of change
- [x] Refactoring 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								66f8d35632
								
									
										
											 
										
									
								
							 
						 
						
							
									Refactor (#537) 
							 
							
							 
							
							
							
							
### What problem does this PR solve?
### Type of change
- [x] Refactoring 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								0dfc8ddc0f
								
									
										
											 
										
									
								
							 
						 
						
							
									enlarge docker memory usage (#501) 
							 
							
							 
							
							
							
							
### What problem does this PR solve?
### Type of change
- [x] Refactoring 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								962c66714e
								
									
										
											 
										
									
								
							 
						 
						
							
									fix divide by zero bug (#447) 
							 
							
							 
							
							
							
							
### What problem does this PR solve?
#445  
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue) 
							
						 
						1 year ago  
					 
				
					
						
							
								   加帆
							
						 
						
							
								39f1feaccb
								
									
										
											 
										
									
								
							 
						 
						
							
									Bug fix pdf parse index out of range (#440) 
							 
							
							 
							
							
							
							
### What problem does this PR solve?
fix a bug comes when parse some pdf file #436  
### Type of change
- [☑️ ] Bug Fix (non-breaking change which fixes an issue) 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								0499a3f621
								
									
										
											 
										
									
								
							 
						 
						
							
									rm page number exception for pdf parser (#424) 
							 
							
							 
							
							
							
							
### What problem does this PR solve?
#423  
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue) 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								453c29170f
								
									
										
											 
										
									
								
							 
						 
						
							
									make sure the models will not be load twice (#422) 
							 
							
							 
							
							
							
							
### What problem does this PR solve?
#381  
### Type of change
- [x] Refactoring 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								a5384446e3
								
									
										
											 
										
									
								
							 
						 
						
							
									let's load model from local (#163) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								fd7fcb5baf
								
									
										
											 
										
									
								
							 
						 
						
							
									apply pep8 formalize (#155) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								979b3a5b4b
								
									
										
											 
										
									
								
							 
						 
						
							
									support snapshot download from local (#153) 
							 
							
							 
							
							
							
							
* support snapshot download from local
* let snapshot download from local 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								da21320b88
								
									
										
											 
										
									
								
							 
						 
						
							
									fix plainPdf bugs (#152) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								71fe314955
								
									
										
											 
										
									
								
							 
						 
						
							
									refine page ranges (#147) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								f6aee7f230
								
									
										
											 
										
									
								
							 
						 
						
							
									add use layout or not option (#145) 
							 
							
							 
							
							
							
							
* add use layout or not option
* trival 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								6c6b144de2
								
									
										
											 
										
									
								
							 
						 
						
							
									refine manual parser (#140) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								6999598101
								
									
										
											 
										
									
								
							 
						 
						
							
									refine for English corpus (#135) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								9a843667b3
								
									
										
											 
										
									
								
							 
						 
						
							
									fix github account login issue (#132) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								9da671b951
								
									
										
											 
										
									
								
							 
						 
						
							
									refine manul parser (#131) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								675a9f8d9a
								
									
										
											 
										
									
								
							 
						 
						
							
									add dockerfile for cuda envirement. Refine table search strategy, (#123) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								8f86ab9f7f
								
									
										
											 
										
									
								
							 
						 
						
							
									refine pdf parser, add time zone to userinfo (#112) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								602038ac49
								
									
										
											 
										
									
								
							 
						 
						
							
									fix task cancling bug (#98) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								8a57f2afd5
								
									
										
											 
										
									
								
							 
						 
						
							
									change callback strategy, add timezone to docker (#96) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								7bfaf0df29
								
									
										
											 
										
									
								
							 
						 
						
							
									fix position extraction bug (#93) 
							 
							
							 
							
							
							
							
* fix position extraction bug
* remove delimiter for naive parser 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								685b4d8a95
								
									
										
											 
										
									
								
							 
						 
						
							
									fix table desc bugs, add positions to chunks (#91) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								8a726fb04b
								
									
										
											 
										
									
								
							 
						 
						
							
									solve task execution issues (#90) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								3d4315c42a
								
									
										
											 
										
									
								
							 
						 
						
							
									resolve the issue of naive parser (#87) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								0429107e80
								
									
										
											 
										
									
								
							 
						 
						
							
									fix user login issue (#85) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								4568a4b2cb
								
									
										
											 
										
									
								
							 
						 
						
							
									refine admin initialization (#75) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								d32322c081
								
									
										
											 
										
									
								
							 
						 
						
							
									rename vision, add layour and tsr recognizer (#70) 
							 
							
							 
							
							
							
							
* rename vision, add layour and tsr recognizer
* trivial fixing 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								cacd36c5e1
								
									
										
											 
										
									
								
							 
						 
						
							
									use onnx models, new deepdoc (#68) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								a8294f2168
								
							 
						 
						
							
									Refine resume parts and fix bugs in retrival using sql (#66) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								407b2523b6
								
							 
						 
						
							
									remove unused codes, seperate layout detection out as a new api. Add new rag methed 'table' (#55) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								51482f3e2a
								
							 
						 
						
							
									Some document API refined. (#53) 
							 
							
							 
							
							
							
							
Add naive chunking method to RAG 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								e6acaf6738
								
							 
						 
						
							
									Add Q&A and Book, fix task running bugs (#50) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								6224edcd1b
								
							 
						 
						
							
									Add task moduel, and pipline the task and every parser (#49) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								96a1a44cb6
								
							 
						 
						
							
									add paper & manual parser (#46) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								072f9dd5bc
								
							 
						 
						
							
									Add app to rag module: presentaion & laws (#43) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								484e5abc1f
								
							 
						 
						
							
									llm configuation refine and trievalTest API refine (#40) 
							 
							
							
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								30791976d5
								
							 
						 
						
							
									build python version rag-flow (#21) 
							 
							
							 
							
							
							
							
* clean rust version project
* clean rust version project
* build python version rag-flow 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								d0db329fef
								
							 
						 
						
							
									add llm API (#19) 
							 
							
							 
							
							
							
							
* add llm API
* refine llm API 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								3245107dc7
								
							 
						 
						
							
									use minio to store uploaded files; build dialog server; (#16) 
							 
							
							 
							
							
							
							
* format code
* use minio to store uploaded files; build dialog server; 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								738c322508
								
							 
						 
						
							
									add docker compose (#8) 
							 
							
							 
							
							
							
							
* add docker compose
* add docker compose 
							
						 
						1 year ago  
					 
				
					
						
							
								   KevinHuSh
							
						 
						
							
								f4456af464
								
							 
						 
						
							
									init python part (#7) 
							 
							
							
							
						 
						1 year ago