liukaiwen 4c096443c7 add table recognition and convertion to LaTeX vor 1 Jahr
..
cli 30ac6f227c fix(magic-pdf): add default values and improve warning logs for config optionsEnsure that 'temp-output-dir', 'models-dir', and 'device-mode' have sensible default vor 1 Jahr
dict2md d04f3f22f5 # feat(model inference): add table recognition and convertion to LaTeX vor 1 Jahr
filter df14c61f6f update: Enhance the capability to detect garbled document issues vor 1 Jahr
layout d5dbed7325 目录重构 vor 1 Jahr
libs c98e7b9804 Merge branch 'opendatalab:master' into master vor 1 Jahr
model 4c096443c7 add table recognition and convertion to LaTeX vor 1 Jahr
para 7dcf63e69c fix:close some log output if not in debug mode vor 1 Jahr
pipe f8f6ba6fd3 update:Add md make mode config in do_parse.You can control whether the produced md is for NLP or MM by changing the value of f_make_md_mode vor 1 Jahr
post_proc 1b9d65b3d3 1、Trace类的key增加前置下划线 vor 1 Jahr
pre_proc e831df807a fix(magic_pdf): use interline_equations instead of interline_equation_blocks vor 1 Jahr
resources d04f3f22f5 # feat(model inference): add table recognition and convertion to LaTeX vor 1 Jahr
rw 5db8911daa add errors="replace" in write mode MODE_TXT vor 1 Jahr
spark c9af3457f5 delete useless files vor 1 Jahr
train_utils efed5faa53 feat: modify foot note bbox tmp vor 1 Jahr
__init__.py d5dbed7325 目录重构 vor 1 Jahr
pdf_parse_by_ocr.py 959b8d82d8 renamed pipeline file name vor 1 Jahr
pdf_parse_by_txt.py 959b8d82d8 renamed pipeline file name vor 1 Jahr
pdf_parse_for_train.py d438b97a0a 切图逻辑重构 vor 1 Jahr
pdf_parse_union_core.py e831df807a fix(magic_pdf): use interline_equations instead of interline_equation_blocks vor 1 Jahr
user_api.py 959b8d82d8 renamed pipeline file name vor 1 Jahr