myhloli 14ca71eed0 docs: enhance quick usage documentation with configuration examples and improve mac environment check há 2 semanas atrás
..
__init__.py bd9279198c refactor: rename init file and update app.py to enable parsing method há 5 meses atrás
block_pre_proc.py 7d8f68cbb9 refactor: improve overlap handling by removing smaller blocks in block_pre_proc.py and pipeline_magic_model.py há 4 meses atrás
block_sort.py 279e84bf58 fix: improve device compatibility check for bf16 support in model initialization há 2 semanas atrás
boxbase.py 236a6033f1 refactor: improve block processing logic and enhance span handling há 5 meses atrás
check_mac_env.py 14ca71eed0 docs: enhance quick usage documentation with configuration examples and improve mac environment check há 2 semanas atrás
cli_parser.py e6f817fe6f refactor: extract command-line argument parsing to cli_parser.py and update usage in main functions há 4 meses atrás
config_reader.py bd5252d946 fix: add conditional import for torch and torch_npu in config_reader.py há 5 meses atrás
cut_image.py 38ace5dc61 refactor: streamline document analysis and enhance image handling in processing pipeline há 5 meses atrás
draw_bbox.py 5b73b89ceb fix: add handling for reference text blocks in draw_bbox.py há 2 meses atrás
enum_class.py 2d249666ba feat: integrate PP-FormulaNet_plus-M architecture and update model initialization há 1 mês atrás
format_utils.py 6ed75347a6 Update mineru/utils/format_utils.py há 3 meses atrás
guess_suffix_or_lang.py c9315b8e10 Refactor suffix guessing to handle PDF extensions for AI files há 1 mês atrás
hash_utils.py cbba27b4f5 refactor: reorganize project structure and update import paths há 5 meses atrás
language.py 8f1f9abec5 refactor: enhance bounding box utilities and add configuration reader for S3 integration há 5 meses atrás
llm_aided.py ad29a6a02a Update mineru/utils/llm_aided.py há 2 semanas atrás
magic_model_utils.py 1906643c67 refactor: streamline bbox processing and enhance category tying logic in magic_model_utils.py há 3 meses atrás
model_utils.py d0e68a3018 feat: implement RapidTable model for enhanced table structure prediction and batch processing há 2 meses atrás
models_download_utils.py fa9aaaa7b7 fix: update model path handling in model.py and models_download_utils.py há 5 meses atrás
ocr_utils.py af66bc02c2 优化ocr推理性能400% há 1 mês atrás
pdf_classify.py e429c5a840 fix: refactor PDF processing logic to ensure proper resource management and improve error handling há 3 meses atrás
pdf_image_tools.py 2fcffcb0af fix: refactor image handling to use numpy arrays instead of PIL images há 2 meses atrás
pdf_reader.py 26aa3d81e2 Update mineru/utils/pdf_reader.py há 3 meses atrás
pdf_text_tool.py 1ed61cb5d6 refactor: update OCR span extraction logic and improve PDF page processing há 5 meses atrás
run_async.py 8e55a52693 feat: add mineru-vlm backend. há 5 meses atrás
span_block_fix.py beadb7a689 fix: adjust overlap area ratio for image and table spans in span_block_fix há 2 meses atrás
span_pre_proc.py 1ee1550460 refactor: Optimize fill_char_in_spans using a spatial grid há 4 meses atrás
table_merge.py 39be54023b Refactor table merging logic to enhance colspan adjustments and improve caption handling há 1 mês atrás