Commit History

Autor SHA1 Mensaxe Data
  myhloli 1f2ea493cd refactor: change default value of enable_ocr_det_batch to False in BatchAnalyze constructor hai 5 meses
  myhloli f211554137 refactor: improve text processing by adding ligature and unicode replacement functions hai 5 meses
  myhloli 76e1a7c1b7 refactor: enhance markdown generation by introducing pipeline_union_make and improving latex delimiter handling hai 5 meses
  myhloli 9ded9e6bad refactor: simplify UnimernetModel constructor by removing unused cfg_path parameter hai 5 meses
  myhloli 51393aa814 refactor: update union_make import and adjust middle JSON structure for consistency hai 5 meses
  myhloli 6b1df41947 refactor: optimize OCR batch processing and enhance image cropping logic hai 5 meses
  myhloli 73f8503514 refactor: optimize OCR batch processing and enhance image cropping logic hai 5 meses
  myhloli 101b12a10a refactor: improve image handling by transitioning from NumPy arrays to PIL images in cropping functions hai 5 meses
  myhloli a9abb4e201 refactor: enhance OCR processing and paragraph splitting in document analysis pipeline hai 5 meses
  myhloli 7a22bfeebe refactor: enhance image margin cropping and processing for improved handling of PIL and NumPy images hai 5 meses
  myhloli bd2c3d120a refactor: update OCR handling and adjust root directory path for model loading hai 5 meses
  myhloli 38ace5dc61 refactor: streamline document analysis and enhance image handling in processing pipeline hai 5 meses
  myhloli 6833882585 refactor: enhance language support and improve document parsing for multiple files hai 5 meses
  myhloli 0f21495a06 refactor: enhance block processing and sorting utilities for improved span management hai 5 meses
  myhloli ae7b0a6eba refactor: implement block preprocessing utilities for improved bounding box management hai 5 meses
  myhloli 8f1f9abec5 refactor: enhance bounding box utilities and add configuration reader for S3 integration hai 5 meses
  myhloli 7285ea9285 refactor: improve document analysis by integrating image loading and enhancing data handling hai 5 meses
  myhloli ea5cb65a1f refactor: enhance document parsing by supporting multiple PDF files and improving method organization hai 5 meses
  myhloli 0a899f1af8 feat: add batch processing for OCR detection and implement new client and common utilities hai 5 meses
  myhloli cbba27b4f5 refactor: reorganize project structure and update import paths hai 5 meses
  Xiaomeng Zhao 3027c677c9 Merge pull request #11 from johnking0099/refactor-mineru2 hai 5 meses
  Jin Zhen Jiang 8e55a52693 feat: add mineru-vlm backend. hai 5 meses
  myhloli 6f8a961087 feat: implement S3 data reader and writer with multi-bucket support hai 5 meses
  myhloli bd9279198c refactor: rename init file and update app.py to enable parsing method hai 5 meses
  Xiaomeng Zhao f50165084d Merge pull request #2519 from opendatalab/master hai 5 meses
  myhloli 580193bae0 Update version.py with new version hai 5 meses
  Xiaomeng Zhao a989444e2f Merge pull request #2514 from opendatalab/release-1.3.12 hai 5 meses
  Xiaomeng Zhao e3a4295527 Merge pull request #2513 from myhloli/dev hai 5 meses
  myhloli 73f0530d16 feat(docs): update changelog for PP-OCRv5 model support and handwritten document recognition enhancements hai 5 meses
  Xiaomeng Zhao e92b5b698e Merge pull request #2512 from myhloli/dev hai 5 meses