myhloli 4f340c4429 refactor(pdf_extract_kit): update model config and weight paths for UniMERNet-0.2.0 vor 1 Jahr
..
dict2md 03469909bb Feat/support footnote in figure (#532) vor 1 Jahr
filter df14c61f6f update: Enhance the capability to detect garbled document issues vor 1 Jahr
integrations b72d4ebd94 Feat/support rag (#510) vor 1 Jahr
layout 03469909bb Feat/support footnote in figure (#532) vor 1 Jahr
libs 03469909bb Feat/support footnote in figure (#532) vor 1 Jahr
model 4f340c4429 refactor(pdf_extract_kit): update model config and weight paths for UniMERNet-0.2.0 vor 1 Jahr
para 58a003177c fix: resolve inaccuracy of drawing layout box caused by paragraphs combination #384 (#574) vor 1 Jahr
pipe 0f91fcf61f feat(cli&analyze&pipeline): add start_page and end_page args for pagination (#507) vor 1 Jahr
post_proc 1b9d65b3d3 1、Trace类的key增加前置下划线 vor 1 Jahr
pre_proc 03469909bb Feat/support footnote in figure (#532) vor 1 Jahr
resources 4f340c4429 refactor(pdf_extract_kit): update model config and weight paths for UniMERNet-0.2.0 vor 1 Jahr
rw 40e0827e60 Feat/impl cli (#264) vor 1 Jahr
spark c9af3457f5 delete useless files vor 1 Jahr
tools b72d4ebd94 Feat/support rag (#510) vor 1 Jahr
__init__.py d5dbed7325 目录重构 vor 1 Jahr
pdf_parse_by_ocr.py 959b8d82d8 renamed pipeline file name vor 1 Jahr
pdf_parse_by_txt.py 959b8d82d8 renamed pipeline file name vor 1 Jahr
pdf_parse_union_core.py 068fab7f81 fix(end_page_id):Fix the issue where end_page_id is corrected to len-1 when its input is 0. (#518) vor 1 Jahr
user_api.py 0f91fcf61f feat(cli&analyze&pipeline): add start_page and end_page args for pagination (#507) vor 1 Jahr