| .. |
|
__init__.py
|
bd9279198c
refactor: rename init file and update app.py to enable parsing method
|
hace 5 meses |
|
block_pre_proc.py
|
7d8f68cbb9
refactor: improve overlap handling by removing smaller blocks in block_pre_proc.py and pipeline_magic_model.py
|
hace 4 meses |
|
block_sort.py
|
284cec041a
refactor: replace get_file_from_repos with auto_download_and_get_model_root_path in multiple files
|
hace 5 meses |
|
boxbase.py
|
236a6033f1
refactor: improve block processing logic and enhance span handling
|
hace 5 meses |
|
cli_parser.py
|
e6f817fe6f
refactor: extract command-line argument parsing to cli_parser.py and update usage in main functions
|
hace 4 meses |
|
config_reader.py
|
bd5252d946
fix: add conditional import for torch and torch_npu in config_reader.py
|
hace 5 meses |
|
cut_image.py
|
38ace5dc61
refactor: streamline document analysis and enhance image handling in processing pipeline
|
hace 5 meses |
|
draw_bbox.py
|
5b73b89ceb
fix: add handling for reference text blocks in draw_bbox.py
|
hace 2 meses |
|
enum_class.py
|
cdbe6ba9b6
Update mineru/utils/enum_class.py
|
hace 2 meses |
|
format_utils.py
|
6ed75347a6
Update mineru/utils/format_utils.py
|
hace 3 meses |
|
guess_suffix_or_lang.py
|
c9315b8e10
Refactor suffix guessing to handle PDF extensions for AI files
|
hace 1 mes |
|
hash_utils.py
|
cbba27b4f5
refactor: reorganize project structure and update import paths
|
hace 5 meses |
|
language.py
|
8f1f9abec5
refactor: enhance bounding box utilities and add configuration reader for S3 integration
|
hace 5 meses |
|
llm_aided.py
|
1719c71a73
Merge pull request #2634 from Ar-Hyk/master
|
hace 4 meses |
|
magic_model_utils.py
|
1906643c67
refactor: streamline bbox processing and enhance category tying logic in magic_model_utils.py
|
hace 3 meses |
|
model_utils.py
|
d0e68a3018
feat: implement RapidTable model for enhanced table structure prediction and batch processing
|
hace 2 meses |
|
models_download_utils.py
|
fa9aaaa7b7
fix: update model path handling in model.py and models_download_utils.py
|
hace 5 meses |
|
ocr_utils.py
|
af66bc02c2
优化ocr推理性能400%
|
hace 1 mes |
|
pdf_classify.py
|
e429c5a840
fix: refactor PDF processing logic to ensure proper resource management and improve error handling
|
hace 3 meses |
|
pdf_image_tools.py
|
2fcffcb0af
fix: refactor image handling to use numpy arrays instead of PIL images
|
hace 2 meses |
|
pdf_reader.py
|
26aa3d81e2
Update mineru/utils/pdf_reader.py
|
hace 3 meses |
|
pdf_text_tool.py
|
1ed61cb5d6
refactor: update OCR span extraction logic and improve PDF page processing
|
hace 5 meses |
|
run_async.py
|
8e55a52693
feat: add mineru-vlm backend.
|
hace 5 meses |
|
span_block_fix.py
|
beadb7a689
fix: adjust overlap area ratio for image and table spans in span_block_fix
|
hace 2 meses |
|
span_pre_proc.py
|
1ee1550460
refactor: Optimize fill_char_in_spans using a spatial grid
|
hace 4 meses |
|
table_merge.py
|
39be54023b
Refactor table merging logic to enhance colspan adjustments and improve caption handling
|
hace 1 mes |