黄泳凯 cbc14bfe1d Fix:Error code:500,param:Request timed out, please try again later. преди 5 месеца
..
__init__.py bd9279198c refactor: rename init file and update app.py to enable parsing method преди 5 месеца
block_pre_proc.py 236a6033f1 refactor: improve block processing logic and enhance span handling преди 5 месеца
block_sort.py 284cec041a refactor: replace get_file_from_repos with auto_download_and_get_model_root_path in multiple files преди 5 месеца
boxbase.py 236a6033f1 refactor: improve block processing logic and enhance span handling преди 5 месеца
config_reader.py b34c8cb004 fix: handle None config cases in configuration retrieval functions преди 5 месеца
cut_image.py 38ace5dc61 refactor: streamline document analysis and enhance image handling in processing pipeline преди 5 месеца
draw_bbox.py 41ecaedc0c feat: disable logging for invalid overlay PDF generation in draw_bbox.py преди 5 месеца
enum_class.py 4eaa85fd31 refactor: update make mode constants to improve content list handling преди 5 месеца
format_utils.py 15dd9a0ff1 refactor: reorganize config_reader imports and enhance format utilities преди 5 месеца
hash_utils.py cbba27b4f5 refactor: reorganize project structure and update import paths преди 5 месеца
language.py 8f1f9abec5 refactor: enhance bounding box utilities and add configuration reader for S3 integration преди 5 месеца
llm_aided.py cbc14bfe1d Fix:Error code:500,param:Request timed out, please try again later. преди 5 месеца
model_utils.py 101b12a10a refactor: improve image handling by transitioning from NumPy arrays to PIL images in cropping functions преди 5 месеца
models_download_utils.py 9bfb3e9ec3 feat: enhance model downloading logic to support different repository modes преди 5 месеца
ocr_utils.py 59d8f105e5 feat: introduce OcrConfidence class and update confidence threshold checks in OCR processing преди 5 месеца
pdf_classify.py 84fa04e22d feat: enhance PDF image coverage analysis with improved parsing and coverage calculation преди 5 месеца
pdf_image_tools.py 38ace5dc61 refactor: streamline document analysis and enhance image handling in processing pipeline преди 5 месеца
pdf_reader.py 8e55a52693 feat: add mineru-vlm backend. преди 5 месеца
pdf_text_tool.py 1ed61cb5d6 refactor: update OCR span extraction logic and improve PDF page processing преди 5 месеца
run_async.py 8e55a52693 feat: add mineru-vlm backend. преди 5 месеца
span_block_fix.py f211554137 refactor: improve text processing by adding ligature and unicode replacement functions преди 5 месеца
span_pre_proc.py a3ae57bf20 refactor: streamline text span extraction and remove unused functions преди 5 месеца