myhloli a8747f1deb refactor: comment out warning logs for missing config values in config_reader.py hai 6 meses
..
__init__.py bd9279198c refactor: rename init file and update app.py to enable parsing method hai 6 meses
block_pre_proc.py 236a6033f1 refactor: improve block processing logic and enhance span handling hai 6 meses
block_sort.py 284cec041a refactor: replace get_file_from_repos with auto_download_and_get_model_root_path in multiple files hai 6 meses
boxbase.py 236a6033f1 refactor: improve block processing logic and enhance span handling hai 6 meses
config_reader.py a8747f1deb refactor: comment out warning logs for missing config values in config_reader.py hai 6 meses
cut_image.py 38ace5dc61 refactor: streamline document analysis and enhance image handling in processing pipeline hai 6 meses
draw_bbox.py 57f44dd8ac refactor: update PDF library import from PyPDF2 to pypdf hai 6 meses
enum_class.py c3531d72ae refactor: update model paths and enhance RapidTableModel initialization hai 6 meses
format_utils.py 15dd9a0ff1 refactor: reorganize config_reader imports and enhance format utilities hai 6 meses
hash_utils.py cbba27b4f5 refactor: reorganize project structure and update import paths hai 6 meses
language.py 8f1f9abec5 refactor: enhance bounding box utilities and add configuration reader for S3 integration hai 6 meses
llm_aided.py c18934a339 refactor: improve dictionary formatting in output structure hai 6 meses
model_utils.py 101b12a10a refactor: improve image handling by transitioning from NumPy arrays to PIL images in cropping functions hai 6 meses
models_download_utils.py 284cec041a refactor: replace get_file_from_repos with auto_download_and_get_model_root_path in multiple files hai 6 meses
ocr_utils.py cbba27b4f5 refactor: reorganize project structure and update import paths hai 6 meses
pdf_classify.py ea5cb65a1f refactor: enhance document parsing by supporting multiple PDF files and improving method organization hai 6 meses
pdf_image_tools.py 38ace5dc61 refactor: streamline document analysis and enhance image handling in processing pipeline hai 6 meses
pdf_reader.py 8e55a52693 feat: add mineru-vlm backend. hai 6 meses
pdf_text_tool.py 1ed61cb5d6 refactor: update OCR span extraction logic and improve PDF page processing hai 6 meses
run_async.py 8e55a52693 feat: add mineru-vlm backend. hai 6 meses
span_block_fix.py f211554137 refactor: improve text processing by adding ligature and unicode replacement functions hai 6 meses
span_pre_proc.py a3ae57bf20 refactor: streamline text span extraction and remove unused functions hai 6 meses