myhloli 1906643c67 refactor: streamline bbox processing and enhance category tying logic in magic_model_utils.py hai 3 meses
..
__init__.py bd9279198c refactor: rename init file and update app.py to enable parsing method hai 5 meses
block_pre_proc.py 7d8f68cbb9 refactor: improve overlap handling by removing smaller blocks in block_pre_proc.py and pipeline_magic_model.py hai 4 meses
block_sort.py 284cec041a refactor: replace get_file_from_repos with auto_download_and_get_model_root_path in multiple files hai 5 meses
boxbase.py 236a6033f1 refactor: improve block processing logic and enhance span handling hai 5 meses
cli_parser.py e6f817fe6f refactor: extract command-line argument parsing to cli_parser.py and update usage in main functions hai 4 meses
config_reader.py bd5252d946 fix: add conditional import for torch and torch_npu in config_reader.py hai 5 meses
cut_image.py 38ace5dc61 refactor: streamline document analysis and enhance image handling in processing pipeline hai 5 meses
draw_bbox.py 41ecaedc0c feat: disable logging for invalid overlay PDF generation in draw_bbox.py hai 5 meses
enum_class.py d9b5d004d9 refactor: update content type references in pipeline and VLM processing scripts hai 4 meses
format_utils.py 0031981e60 Fix otsl to html conversion hai 5 meses
hash_utils.py cbba27b4f5 refactor: reorganize project structure and update import paths hai 5 meses
language.py 8f1f9abec5 refactor: enhance bounding box utilities and add configuration reader for S3 integration hai 5 meses
llm_aided.py 1719c71a73 Merge pull request #2634 from Ar-Hyk/master hai 4 meses
magic_model_utils.py 1906643c67 refactor: streamline bbox processing and enhance category tying logic in magic_model_utils.py hai 3 meses
model_utils.py fbc8d21d6a refactor: optimize overlap removal logic in remove_overlaps_min_blocks function hai 4 meses
models_download_utils.py fa9aaaa7b7 fix: update model path handling in model.py and models_download_utils.py hai 5 meses
ocr_utils.py 98d23e71ec refactor: rename overlap detection functions for consistency in ocr_utils.py and span_block_fix.py hai 4 meses
pdf_classify.py 84fa04e22d feat: enhance PDF image coverage analysis with improved parsing and coverage calculation hai 5 meses
pdf_image_tools.py 38ace5dc61 refactor: streamline document analysis and enhance image handling in processing pipeline hai 5 meses
pdf_reader.py 4243b0eaed refactor: increase YOLO layout base batch size and improve progress tracking in predictions hai 4 meses
pdf_text_tool.py 1ed61cb5d6 refactor: update OCR span extraction logic and improve PDF page processing hai 5 meses
run_async.py 8e55a52693 feat: add mineru-vlm backend. hai 5 meses
span_block_fix.py 98d23e71ec refactor: rename overlap detection functions for consistency in ocr_utils.py and span_block_fix.py hai 4 meses
span_pre_proc.py 1ee1550460 refactor: Optimize fill_char_in_spans using a spatial grid hai 4 meses