myhloli b2e37a2d1b feat(ocr): improve text detection and OCR accuracy пре 1 година
..
config 02b7999299 add init to magic_pdf.config пре 1 година
data 338c681455 feat: add more unittest пре 1 година
dict2md a07007e5e1 fix(ocr_mkcontent): improve hyphen handling at line ends пре 1 година
filter df14c61f6f update: Enhance the capability to detect garbled document issues пре 1 година
integrations b72d4ebd94 Feat/support rag (#510) пре 1 година
layout 03469909bb Feat/support footnote in figure (#532) пре 1 година
libs e78edb193e refactor(table): update default table model to Rapid Table пре 1 година
model b2e37a2d1b feat(ocr): improve text detection and OCR accuracy пре 1 година
para 69805f4ba9 refactor(para): adjust right margin threshold based on block width пре 1 година
pipe 1279f2cd0f feat(model): add support for DocLayout-YOLO model пре 1 година
post_proc 1b9d65b3d3 1、Trace类的key增加前置下划线 пре 1 година
pre_proc e4810cee17 fix(remove_overlaps_min_spans): optimize overlap detection in OCR span list modification пре 1 година
resources 240fe99e3c feat(table): integrate RapidTable model for table recognition пре 1 година
rw 40e0827e60 Feat/impl cli (#264) пре 1 година
spark c9af3457f5 delete useless files пре 1 година
tools 20ed0cd5fb fix(tools): handle empty language string in common.py пре 1 година
utils 9cda7051c6 add init to magic_pdf.utils пре 1 година
__init__.py d5dbed7325 目录重构 пре 1 година
pdf_parse_by_ocr.py 283b597a6e feat: add [figure | table] match [caption | footnote] match algorithm v2 пре 1 година
pdf_parse_by_txt.py 283b597a6e feat: add [figure | table] match [caption | footnote] match algorithm v2 пре 1 година
pdf_parse_union_core.py 068fab7f81 fix(end_page_id):Fix the issue where end_page_id is corrected to len-1 when its input is 0. (#518) пре 1 година
pdf_parse_union_core_v2.py 08f46125a0 refactor(model): rename and restructure model modules пре 1 година
user_api.py 1279f2cd0f feat(model): add support for DocLayout-YOLO model пре 1 година