myhloli 43a57d5627 feat(draw_bbox): add option to toggle bounding box drawing 1 жил өмнө
..
dict2md 98313d4a25 Merge branch 'dev' into content-list-not-drop 1 жил өмнө
filter df14c61f6f update: Enhance the capability to detect garbled document issues 1 жил өмнө
integrations b72d4ebd94 Feat/support rag (#510) 1 жил өмнө
layout 03469909bb Feat/support footnote in figure (#532) 1 жил өмнө
libs 43a57d5627 feat(draw_bbox): add option to toggle bounding box drawing 1 жил өмнө
model f2a3a49541 fix(pdf_extract_kit):change unimernet base -> small 1 жил өмнө
para 58a003177c fix: resolve inaccuracy of drawing layout box caused by paragraphs combination #384 (#574) 1 жил өмнө
pipe 23b621e05a feat(UNIPipe): change default drop_mode to NONE_WITH_REASON 1 жил өмнө
post_proc 1b9d65b3d3 1、Trace类的key增加前置下划线 1 жил өмнө
pre_proc 34f8965007 refactor(draw_bbox): add line sorting visualization 1 жил өмнө
resources f2a3a49541 fix(pdf_extract_kit):change unimernet base -> small 1 жил өмнө
rw 40e0827e60 Feat/impl cli (#264) 1 жил өмнө
spark c9af3457f5 delete useless files 1 жил өмнө
tools 43a57d5627 feat(draw_bbox): add option to toggle bounding box drawing 1 жил өмнө
v3 3cbcf2ded0 feat(draw_bbox): add layout sorting visualization 1 жил өмнө
__init__.py d5dbed7325 目录重构 1 жил өмнө
pdf_parse_by_ocr.py 1efebe421c refactor(pdf_parse_union): integrate LayoutLMv3 for block orderingReplace the heuristic-based block ordering algorithm with LayoutLMv3 model predictions toimprove the accuracy of block ordering on PDF pages. Additionally, refactor the span 1 жил өмнө
pdf_parse_by_txt.py 1efebe421c refactor(pdf_parse_union): integrate LayoutLMv3 for block orderingReplace the heuristic-based block ordering algorithm with LayoutLMv3 model predictions toimprove the accuracy of block ordering on PDF pages. Additionally, refactor the span 1 жил өмнө
pdf_parse_union_core.py 068fab7f81 fix(end_page_id):Fix the issue where end_page_id is corrected to len-1 when its input is 0. (#518) 1 жил өмнө
pdf_parse_union_core_v2.py 34f8965007 refactor(draw_bbox): add line sorting visualization 1 жил өмнө
user_api.py 6062862c96 feat(pipeline): pass language parameter for parsing and markdown conversion 1 жил өмнө