myhloli e17ca99350 refactor: rename vertical text threshold constant for clarity in span_block_fix.py 4 сар өмнө
..
__init__.py bd9279198c refactor: rename init file and update app.py to enable parsing method 6 сар өмнө
block_pre_proc.py 7d8f68cbb9 refactor: improve overlap handling by removing smaller blocks in block_pre_proc.py and pipeline_magic_model.py 4 сар өмнө
block_sort.py 284cec041a refactor: replace get_file_from_repos with auto_download_and_get_model_root_path in multiple files 5 сар өмнө
boxbase.py 236a6033f1 refactor: improve block processing logic and enhance span handling 5 сар өмнө
config_reader.py bd5252d946 fix: add conditional import for torch and torch_npu in config_reader.py 5 сар өмнө
cut_image.py 38ace5dc61 refactor: streamline document analysis and enhance image handling in processing pipeline 5 сар өмнө
draw_bbox.py 41ecaedc0c feat: disable logging for invalid overlay PDF generation in draw_bbox.py 5 сар өмнө
enum_class.py 58b8e8a912 fix: add new enum values and improve MIN_BATCH_INFERENCE_SIZE documentation in pipeline_analyze.py 5 сар өмнө
format_utils.py 0031981e60 Fix otsl to html conversion 5 сар өмнө
hash_utils.py cbba27b4f5 refactor: reorganize project structure and update import paths 6 сар өмнө
language.py 8f1f9abec5 refactor: enhance bounding box utilities and add configuration reader for S3 integration 6 сар өмнө
llm_aided.py 06db3d173b feat: enhance title block processing with average height calculation and padding for OCR 4 сар өмнө
model_utils.py d58b24b5dd fix: add conditional imports for torch and torch_npu in model_utils.py 5 сар өмнө
models_download_utils.py fa9aaaa7b7 fix: update model path handling in model.py and models_download_utils.py 5 сар өмнө
ocr_utils.py 18691cfde2 refactor: enhance span merging logic for vertical text blocks in span_block_fix.py and ocr_utils.py 4 сар өмнө
pdf_classify.py 84fa04e22d feat: enhance PDF image coverage analysis with improved parsing and coverage calculation 5 сар өмнө
pdf_image_tools.py 38ace5dc61 refactor: streamline document analysis and enhance image handling in processing pipeline 5 сар өмнө
pdf_reader.py 4243b0eaed refactor: increase YOLO layout base batch size and improve progress tracking in predictions 5 сар өмнө
pdf_text_tool.py 1ed61cb5d6 refactor: update OCR span extraction logic and improve PDF page processing 5 сар өмнө
run_async.py 8e55a52693 feat: add mineru-vlm backend. 6 сар өмнө
span_block_fix.py e17ca99350 refactor: rename vertical text threshold constant for clarity in span_block_fix.py 4 сар өмнө
span_pre_proc.py a3ae57bf20 refactor: streamline text span extraction and remove unused functions 5 сар өмнө