myhloli 0a9a6d3e53 fix(magic_pdf): include List and Index block types in processing vor 1 Jahr
..
Constants.py d714ac8b76 Release: Release 0.7.1 verison, update dev (#527) vor 1 Jahr
MakeContentConfig.py 37fbe998ac feat(ocr_mkcontent): support drop reason in none_with_reason modeEnable the `NONE_WITH_REASON` drop mode in `para_to_standard_format_v2` by updating the vor 1 Jahr
ModelBlockTypeEnum.py d1a9d1db2f io modules vor 1 Jahr
__init__.py d5dbed7325 目录重构 vor 1 Jahr
boxbase.py 6cc8cbca52 fix: 1. resolve uncorrect pair relation of figure and footnote, 2. resolve uncorrect pair relation of table and caption #590 vor 1 Jahr
calc_span_stats.py d5dbed7325 目录重构 vor 1 Jahr
clean_memory.py 4c9bf8abd5 refactor(memory management): remove unused clean_memory function vor 1 Jahr
commons.py 1de37e4c65 add version_name to middle json vor 1 Jahr
config_reader.py ded2818ac2 feat(layoutreader): support local model directory and improve model loading vor 1 Jahr
convert_utils.py 709a65008a 中间态dict结构调整 vor 1 Jahr
coordinate_transform.py 7b0db8a4b3 将fix缩放倍率的bbox写入model_list vor 1 Jahr
detect_language_from_model.py e492b3dce8 语言检测逻辑移动到parse流程 vor 1 Jahr
draw_bbox.py 0a9a6d3e53 fix(magic_pdf): include List and Index block types in processing vor 1 Jahr
drop_reason.py 2f13b3a87c add new drop scene vor 1 Jahr
drop_tag.py 45ce99bf87 block type 字段名修复 vor 1 Jahr
hash_utils.py 00f16239c6 实现parse_ocr_pdf api,切图逻辑s3使用平铺地址,本地使用层级地址,删除预设s3_image_save_path vor 1 Jahr
json_compressor.py d5dbed7325 目录重构 vor 1 Jahr
language.py 57380cbed5 feat(language): add FT LANG cache directory setup vor 1 Jahr
local_math.py 12bec17eed refactor(magic_pdf): replace math module with local_math vor 1 Jahr
markdown_utils.py 59b0b0c3da make markdown时特殊符号转义 vor 1 Jahr
nlp_utils.py d5dbed7325 目录重构 vor 1 Jahr
ocr_content_type.py 1f1dd3538d feat(list&index block): detect and merge list and index blocks vor 1 Jahr
path_utils.py 6c656af65f update:cleanup requirements.txt vor 1 Jahr
pdf_check.py 8998380da5 update check invalid_chars algorithm to improve accuracy vor 1 Jahr
pdf_image_tools.py 435ab922c6 Merge branch 'master' into master vor 1 Jahr
safe_filename.py d5dbed7325 目录重构 vor 1 Jahr
textbase.py d5dbed7325 目录重构 vor 1 Jahr
version.py a4c72e2e33 fix: solve conflicts vor 1 Jahr
vis_utils.py d5dbed7325 目录重构 vor 1 Jahr