myhloli 1ed61cb5d6 refactor: update OCR span extraction logic and improve PDF page processing vor 5 Monaten
..
__init__.py bd9279198c refactor: rename init file and update app.py to enable parsing method vor 5 Monaten
block_pre_proc.py 0f21495a06 refactor: enhance block processing and sorting utilities for improved span management vor 5 Monaten
block_sort.py 0f21495a06 refactor: enhance block processing and sorting utilities for improved span management vor 5 Monaten
boxbase.py ae7b0a6eba refactor: implement block preprocessing utilities for improved bounding box management vor 5 Monaten
cut_image.py 38ace5dc61 refactor: streamline document analysis and enhance image handling in processing pipeline vor 5 Monaten
draw_bbox.py 0a899f1af8 feat: add batch processing for OCR detection and implement new client and common utilities vor 5 Monaten
enum_class.py 8f1f9abec5 refactor: enhance bounding box utilities and add configuration reader for S3 integration vor 5 Monaten
hash_utils.py cbba27b4f5 refactor: reorganize project structure and update import paths vor 5 Monaten
language.py 8f1f9abec5 refactor: enhance bounding box utilities and add configuration reader for S3 integration vor 5 Monaten
model_utils.py 101b12a10a refactor: improve image handling by transitioning from NumPy arrays to PIL images in cropping functions vor 5 Monaten
ocr_utils.py cbba27b4f5 refactor: reorganize project structure and update import paths vor 5 Monaten
pdf_classify.py ea5cb65a1f refactor: enhance document parsing by supporting multiple PDF files and improving method organization vor 5 Monaten
pdf_image_tools.py 38ace5dc61 refactor: streamline document analysis and enhance image handling in processing pipeline vor 5 Monaten
pdf_reader.py 8e55a52693 feat: add mineru-vlm backend. vor 5 Monaten
pdf_text_tool.py 1ed61cb5d6 refactor: update OCR span extraction logic and improve PDF page processing vor 5 Monaten
pipeline_magic_model.py 8f1f9abec5 refactor: enhance bounding box utilities and add configuration reader for S3 integration vor 5 Monaten
run_async.py 8e55a52693 feat: add mineru-vlm backend. vor 5 Monaten
span_block_fix.py f211554137 refactor: improve text processing by adding ligature and unicode replacement functions vor 5 Monaten
span_pre_proc.py 1ed61cb5d6 refactor: update OCR span extraction logic and improve PDF page processing vor 5 Monaten
vlm_magic_model.py 0a899f1af8 feat: add batch processing for OCR detection and implement new client and common utilities vor 5 Monaten