myhloli ee81b3398e fix(magic_pdf): filter out formulas outside image bounds during cropped_img vor 1 Jahr
..
cli 63b3cfebfc docs(cli_help): update Chinese PDF path description vor 1 Jahr
dict2md ff13c8e115 fix(mkmarkdown): add 2 space after image and table URLs vor 1 Jahr
filter df14c61f6f update: Enhance the capability to detect garbled document issues vor 1 Jahr
layout d5dbed7325 目录重构 vor 1 Jahr
libs ab1ec00232 Merge pull request #172 from dt-yy/master vor 1 Jahr
model ee81b3398e fix(magic_pdf): filter out formulas outside image bounds during cropped_img vor 1 Jahr
para 7dcf63e69c fix:close some log output if not in debug mode vor 1 Jahr
pipe f8f6ba6fd3 update:Add md make mode config in do_parse.You can control whether the produced md is for NLP or MM by changing the value of f_make_md_mode vor 1 Jahr
post_proc 1b9d65b3d3 1、Trace类的key增加前置下划线 vor 1 Jahr
pre_proc 5f992de4da fix(magic_pdf): prevent removal of low-confidence spans already dropped vor 1 Jahr
resources 81260a22d7 fix: remove personal info vor 1 Jahr
rw 5db8911daa add errors="replace" in write mode MODE_TXT vor 1 Jahr
spark c9af3457f5 delete useless files vor 1 Jahr
train_utils efed5faa53 feat: modify foot note bbox tmp vor 1 Jahr
__init__.py d5dbed7325 目录重构 vor 1 Jahr
pdf_parse_by_ocr.py 959b8d82d8 renamed pipeline file name vor 1 Jahr
pdf_parse_by_txt.py 959b8d82d8 renamed pipeline file name vor 1 Jahr
pdf_parse_for_train.py d438b97a0a 切图逻辑重构 vor 1 Jahr
pdf_parse_union_core.py e92de75844 add todo about interline_equation vor 1 Jahr
user_api.py 959b8d82d8 renamed pipeline file name vor 1 Jahr