| .. |
|
dict2md
|
03469909bb
Feat/support footnote in figure (#532)
|
1 jaar geleden |
|
filter
|
df14c61f6f
update: Enhance the capability to detect garbled document issues
|
1 jaar geleden |
|
integrations
|
b72d4ebd94
Feat/support rag (#510)
|
1 jaar geleden |
|
layout
|
03469909bb
Feat/support footnote in figure (#532)
|
1 jaar geleden |
|
libs
|
03469909bb
Feat/support footnote in figure (#532)
|
1 jaar geleden |
|
model
|
4b372f3f7e
feat(ocr): pass language parameter for custom model init
|
1 jaar geleden |
|
para
|
a7e615ffba
fix: resolve inaccuracy of drawing layout box caused by paragraphs combination #384 (#542)
|
1 jaar geleden |
|
pipe
|
4b372f3f7e
feat(ocr): pass language parameter for custom model init
|
1 jaar geleden |
|
post_proc
|
1b9d65b3d3
1、Trace类的key增加前置下划线
|
1 jaar geleden |
|
pre_proc
|
03469909bb
Feat/support footnote in figure (#532)
|
1 jaar geleden |
|
resources
|
d714ac8b76
Release: Release 0.7.1 verison, update dev (#527)
|
1 jaar geleden |
|
rw
|
40e0827e60
Feat/impl cli (#264)
|
1 jaar geleden |
|
spark
|
c9af3457f5
delete useless files
|
1 jaar geleden |
|
tools
|
4b372f3f7e
feat(ocr): pass language parameter for custom model init
|
1 jaar geleden |
|
__init__.py
|
d5dbed7325
目录重构
|
1 jaar geleden |
|
pdf_parse_by_ocr.py
|
959b8d82d8
renamed pipeline file name
|
1 jaar geleden |
|
pdf_parse_by_txt.py
|
959b8d82d8
renamed pipeline file name
|
1 jaar geleden |
|
pdf_parse_union_core.py
|
068fab7f81
fix(end_page_id):Fix the issue where end_page_id is corrected to len-1 when its input is 0. (#518)
|
1 jaar geleden |
|
user_api.py
|
0f91fcf61f
feat(cli&analyze&pipeline): add start_page and end_page args for pagination (#507)
|
1 jaar geleden |