myhloli faf8c286fb fix(magic_pdf): handle missing image_path in spans 1 ano atrás
..
config 02b7999299 add init to magic_pdf.config 1 ano atrás
data 283b597a6e feat: add [figure | table] match [caption | footnote] match algorithm v2 1 ano atrás
dict2md faf8c286fb fix(magic_pdf): handle missing image_path in spans 1 ano atrás
filter df14c61f6f update: Enhance the capability to detect garbled document issues 1 ano atrás
integrations b72d4ebd94 Feat/support rag (#510) 1 ano atrás
layout 03469909bb Feat/support footnote in figure (#532) 1 ano atrás
libs 7d2dfc8091 Merge branch 'dev' into dev-table-model-update 1 ano atrás
model 377b09cf8c refactor(table): disable StructEqTable support and add TableMaster support 1 ano atrás
para 8cc76c4921 refactor(para): improve paragraph splitting algorithm 1 ano atrás
pipe 1279f2cd0f feat(model): add support for DocLayout-YOLO model 1 ano atrás
post_proc 1b9d65b3d3 1、Trace类的key增加前置下划线 1 ano atrás
pre_proc 1807126e7f refactor(ocr): adjust OCR processing parameters 1 ano atrás
resources 1279f2cd0f feat(model): add support for DocLayout-YOLO model 1 ano atrás
rw 40e0827e60 Feat/impl cli (#264) 1 ano atrás
spark c9af3457f5 delete useless files 1 ano atrás
tools acab8de50f docs: update model download instructions and simplify demo scripts 1 ano atrás
utils 9cda7051c6 add init to magic_pdf.utils 1 ano atrás
__init__.py d5dbed7325 目录重构 1 ano atrás
pdf_parse_by_ocr.py 283b597a6e feat: add [figure | table] match [caption | footnote] match algorithm v2 1 ano atrás
pdf_parse_by_txt.py 283b597a6e feat: add [figure | table] match [caption | footnote] match algorithm v2 1 ano atrás
pdf_parse_union_core.py 068fab7f81 fix(end_page_id):Fix the issue where end_page_id is corrected to len-1 when its input is 0. (#518) 1 ano atrás
pdf_parse_union_core_v2.py 4cf7e9a224 refactor(pdf_parse): adjust block splitting logic for wide blocks 1 ano atrás
user_api.py 1279f2cd0f feat(model): add support for DocLayout-YOLO model 1 ano atrás