| .. |
|
__init__.py
|
bd9279198c
refactor: rename init file and update app.py to enable parsing method
|
5 mesiacov pred |
|
block_pre_proc.py
|
7d8f68cbb9
refactor: improve overlap handling by removing smaller blocks in block_pre_proc.py and pipeline_magic_model.py
|
4 mesiacov pred |
|
block_sort.py
|
284cec041a
refactor: replace get_file_from_repos with auto_download_and_get_model_root_path in multiple files
|
5 mesiacov pred |
|
boxbase.py
|
236a6033f1
refactor: improve block processing logic and enhance span handling
|
5 mesiacov pred |
|
cli_parser.py
|
e6f817fe6f
refactor: extract command-line argument parsing to cli_parser.py and update usage in main functions
|
4 mesiacov pred |
|
config_reader.py
|
bd5252d946
fix: add conditional import for torch and torch_npu in config_reader.py
|
5 mesiacov pred |
|
cut_image.py
|
38ace5dc61
refactor: streamline document analysis and enhance image handling in processing pipeline
|
5 mesiacov pred |
|
draw_bbox.py
|
825fc95a8a
fix: ensure new pages are created for overlay merging in draw_bbox.py
|
3 mesiacov pred |
|
enum_class.py
|
d9b5d004d9
refactor: update content type references in pipeline and VLM processing scripts
|
4 mesiacov pred |
|
format_utils.py
|
6ed75347a6
Update mineru/utils/format_utils.py
|
3 mesiacov pred |
|
hash_utils.py
|
cbba27b4f5
refactor: reorganize project structure and update import paths
|
5 mesiacov pred |
|
language.py
|
8f1f9abec5
refactor: enhance bounding box utilities and add configuration reader for S3 integration
|
5 mesiacov pred |
|
llm_aided.py
|
1719c71a73
Merge pull request #2634 from Ar-Hyk/master
|
4 mesiacov pred |
|
magic_model_utils.py
|
1906643c67
refactor: streamline bbox processing and enhance category tying logic in magic_model_utils.py
|
3 mesiacov pred |
|
model_utils.py
|
a50616b089
Update mineru/utils/model_utils.py
|
3 mesiacov pred |
|
models_download_utils.py
|
fa9aaaa7b7
fix: update model path handling in model.py and models_download_utils.py
|
5 mesiacov pred |
|
ocr_utils.py
|
98d23e71ec
refactor: rename overlap detection functions for consistency in ocr_utils.py and span_block_fix.py
|
4 mesiacov pred |
|
pdf_classify.py
|
84fa04e22d
feat: enhance PDF image coverage analysis with improved parsing and coverage calculation
|
5 mesiacov pred |
|
pdf_image_tools.py
|
38ace5dc61
refactor: streamline document analysis and enhance image handling in processing pipeline
|
5 mesiacov pred |
|
pdf_reader.py
|
4243b0eaed
refactor: increase YOLO layout base batch size and improve progress tracking in predictions
|
4 mesiacov pred |
|
pdf_text_tool.py
|
1ed61cb5d6
refactor: update OCR span extraction logic and improve PDF page processing
|
5 mesiacov pred |
|
run_async.py
|
8e55a52693
feat: add mineru-vlm backend.
|
5 mesiacov pred |
|
span_block_fix.py
|
98d23e71ec
refactor: rename overlap detection functions for consistency in ocr_utils.py and span_block_fix.py
|
4 mesiacov pred |
|
span_pre_proc.py
|
1ee1550460
refactor: Optimize fill_char_in_spans using a spatial grid
|
4 mesiacov pred |