myhloli 20499ec388 fix(pdf_extract_kit): specify utf-8 encoding when reading model configEnsure the model configuration file is read with utf-8 encoding to support il y a 1 an
..
cli 30ac6f227c fix(magic-pdf): add default values and improve warning logs for config optionsEnsure that 'temp-output-dir', 'models-dir', and 'device-mode' have sensible default il y a 1 an
dict2md ff13c8e115 fix(mkmarkdown): add 2 space after image and table URLs il y a 1 an
filter df14c61f6f update: Enhance the capability to detect garbled document issues il y a 1 an
layout d5dbed7325 目录重构 il y a 1 an
libs d244a1c1a7 fix(config_reader): add utf-8 encoding when reading config file il y a 1 an
model 20499ec388 fix(pdf_extract_kit): specify utf-8 encoding when reading model configEnsure the model configuration file is read with utf-8 encoding to support il y a 1 an
para 7dcf63e69c fix:close some log output if not in debug mode il y a 1 an
pipe f8f6ba6fd3 update:Add md make mode config in do_parse.You can control whether the produced md is for NLP or MM by changing the value of f_make_md_mode il y a 1 an
post_proc 1b9d65b3d3 1、Trace类的key增加前置下划线 il y a 1 an
pre_proc e831df807a fix(magic_pdf): use interline_equations instead of interline_equation_blocks il y a 1 an
resources 57380cbed5 feat(language): add FT LANG cache directory setup il y a 1 an
rw 5db8911daa add errors="replace" in write mode MODE_TXT il y a 1 an
spark c9af3457f5 delete useless files il y a 1 an
train_utils efed5faa53 feat: modify foot note bbox tmp il y a 1 an
__init__.py d5dbed7325 目录重构 il y a 1 an
pdf_parse_by_ocr.py 959b8d82d8 renamed pipeline file name il y a 1 an
pdf_parse_by_txt.py 959b8d82d8 renamed pipeline file name il y a 1 an
pdf_parse_for_train.py d438b97a0a 切图逻辑重构 il y a 1 an
pdf_parse_union_core.py e831df807a fix(magic_pdf): use interline_equations instead of interline_equation_blocks il y a 1 an
user_api.py 959b8d82d8 renamed pipeline file name il y a 1 an