Xiaomeng Zhao 3fd024da8b Merge pull request #883 from opendatalab/release-0.9.2 hai 1 ano
..
config 02b7999299 add init to magic_pdf.config hai 1 ano
data 338c681455 feat: add more unittest hai 1 ano
dict2md bd75596219 fix(merge_text): add ligature replacement functionality hai 1 ano
filter df14c61f6f update: Enhance the capability to detect garbled document issues hai 1 ano
integrations b72d4ebd94 Feat/support rag (#510) hai 1 ano
layout 03469909bb Feat/support footnote in figure (#532) hai 1 ano
libs 699b589b23 Update version.py with new version hai 1 ano
model 4b0f11769d refactor(model): remove unused code and simplify OCR model initialization hai 1 ano
para cf0d76c094 feat(para_split_v3): improve list identification with block aspect ratio hai 1 ano
pipe 1279f2cd0f feat(model): add support for DocLayout-YOLO model hai 1 ano
post_proc 1b9d65b3d3 1、Trace类的key增加前置下划线 hai 1 ano
pre_proc 1807126e7f refactor(ocr): adjust OCR processing parameters hai 1 ano
resources 1279f2cd0f feat(model): add support for DocLayout-YOLO model hai 1 ano
rw 40e0827e60 Feat/impl cli (#264) hai 1 ano
spark c9af3457f5 delete useless files hai 1 ano
tools acab8de50f docs: update model download instructions and simplify demo scripts hai 1 ano
utils 9cda7051c6 add init to magic_pdf.utils hai 1 ano
__init__.py d5dbed7325 目录重构 hai 1 ano
pdf_parse_by_ocr.py 283b597a6e feat: add [figure | table] match [caption | footnote] match algorithm v2 hai 1 ano
pdf_parse_by_txt.py 283b597a6e feat: add [figure | table] match [caption | footnote] match algorithm v2 hai 1 ano
pdf_parse_union_core.py 068fab7f81 fix(end_page_id):Fix the issue where end_page_id is corrected to len-1 when its input is 0. (#518) hai 1 ano
pdf_parse_union_core_v2.py 149132d608 feat(pdf_parse): improve span filtering and add new block types hai 1 ano
user_api.py 1279f2cd0f feat(model): add support for DocLayout-YOLO model hai 1 ano