xu rui 11994506e0 feat: add zh_cn docs hai 1 ano
..
config b492c19c4c refactor: move some constants or enums defs to config folder hai 1 ano
data 11994506e0 feat: add zh_cn docs hai 1 ano
dict2md b80befe9cf refactor(mkcontent): optimize paragraph text merging and language detection hai 1 ano
filter a3a720ea87 refactor: isolate inference and pipeline hai 1 ano
integrations b492c19c4c refactor: move some constants or enums defs to config folder hai 1 ano
libs f6bd47de6a docs: add dataset method description hai 1 ano
model d44e7a28f4 refactor: add docs hai 1 ano
para 41545a13c6 refactor(para): adjust line height multiplier for block splitting hai 1 ano
pipe f6bd47de6a docs: add dataset method description hai 1 ano
pre_proc 7f8dc353b0 fix(pre_proc): prevent errors when imageWriter is None hai 1 ano
resources 240fe99e3c feat(table): integrate RapidTable model for table recognition hai 1 ano
rw 2db3c26374 refactor(libs): remove unused imports and functions hai 1 ano
spark b492c19c4c refactor: move some constants or enums defs to config folder hai 1 ano
tools 4a82d6a07a feat: add function definitions hai 1 ano
utils 9cda7051c6 add init to magic_pdf.utils hai 1 ano
__init__.py d5dbed7325 目录重构 hai 1 ano
pdf_parse_by_ocr.py a3a720ea87 refactor: isolate inference and pipeline hai 1 ano
pdf_parse_by_txt.py a3a720ea87 refactor: isolate inference and pipeline hai 1 ano
pdf_parse_union_core_v2.py f6bd47de6a docs: add dataset method description hai 1 ano
user_api.py a3a720ea87 refactor: isolate inference and pipeline hai 1 ano