| .. |
|
dict2md
|
98313d4a25
Merge branch 'dev' into content-list-not-drop
|
1 жил өмнө |
|
filter
|
df14c61f6f
update: Enhance the capability to detect garbled document issues
|
1 жил өмнө |
|
integrations
|
b72d4ebd94
Feat/support rag (#510)
|
1 жил өмнө |
|
layout
|
03469909bb
Feat/support footnote in figure (#532)
|
1 жил өмнө |
|
libs
|
43a57d5627
feat(draw_bbox): add option to toggle bounding box drawing
|
1 жил өмнө |
|
model
|
f2a3a49541
fix(pdf_extract_kit):change unimernet base -> small
|
1 жил өмнө |
|
para
|
58a003177c
fix: resolve inaccuracy of drawing layout box caused by paragraphs combination #384 (#574)
|
1 жил өмнө |
|
pipe
|
23b621e05a
feat(UNIPipe): change default drop_mode to NONE_WITH_REASON
|
1 жил өмнө |
|
post_proc
|
1b9d65b3d3
1、Trace类的key增加前置下划线
|
1 жил өмнө |
|
pre_proc
|
34f8965007
refactor(draw_bbox): add line sorting visualization
|
1 жил өмнө |
|
resources
|
f2a3a49541
fix(pdf_extract_kit):change unimernet base -> small
|
1 жил өмнө |
|
rw
|
40e0827e60
Feat/impl cli (#264)
|
1 жил өмнө |
|
spark
|
c9af3457f5
delete useless files
|
1 жил өмнө |
|
tools
|
43a57d5627
feat(draw_bbox): add option to toggle bounding box drawing
|
1 жил өмнө |
|
v3
|
3cbcf2ded0
feat(draw_bbox): add layout sorting visualization
|
1 жил өмнө |
|
__init__.py
|
d5dbed7325
目录重构
|
1 жил өмнө |
|
pdf_parse_by_ocr.py
|
1efebe421c
refactor(pdf_parse_union): integrate LayoutLMv3 for block orderingReplace the heuristic-based block ordering algorithm with LayoutLMv3 model predictions toimprove the accuracy of block ordering on PDF pages. Additionally, refactor the span
|
1 жил өмнө |
|
pdf_parse_by_txt.py
|
1efebe421c
refactor(pdf_parse_union): integrate LayoutLMv3 for block orderingReplace the heuristic-based block ordering algorithm with LayoutLMv3 model predictions toimprove the accuracy of block ordering on PDF pages. Additionally, refactor the span
|
1 жил өмнө |
|
pdf_parse_union_core.py
|
068fab7f81
fix(end_page_id):Fix the issue where end_page_id is corrected to len-1 when its input is 0. (#518)
|
1 жил өмнө |
|
pdf_parse_union_core_v2.py
|
34f8965007
refactor(draw_bbox): add line sorting visualization
|
1 жил өмнө |
|
user_api.py
|
6062862c96
feat(pipeline): pass language parameter for parsing and markdown conversion
|
1 жил өмнө |