myhloli
|
1f2ea493cd
refactor: change default value of enable_ocr_det_batch to False in BatchAnalyze constructor
|
5 months ago |
myhloli
|
f211554137
refactor: improve text processing by adding ligature and unicode replacement functions
|
5 months ago |
myhloli
|
76e1a7c1b7
refactor: enhance markdown generation by introducing pipeline_union_make and improving latex delimiter handling
|
5 months ago |
myhloli
|
9ded9e6bad
refactor: simplify UnimernetModel constructor by removing unused cfg_path parameter
|
5 months ago |
myhloli
|
51393aa814
refactor: update union_make import and adjust middle JSON structure for consistency
|
5 months ago |
myhloli
|
6b1df41947
refactor: optimize OCR batch processing and enhance image cropping logic
|
5 months ago |
myhloli
|
73f8503514
refactor: optimize OCR batch processing and enhance image cropping logic
|
5 months ago |
myhloli
|
101b12a10a
refactor: improve image handling by transitioning from NumPy arrays to PIL images in cropping functions
|
5 months ago |
myhloli
|
a9abb4e201
refactor: enhance OCR processing and paragraph splitting in document analysis pipeline
|
5 months ago |
myhloli
|
7a22bfeebe
refactor: enhance image margin cropping and processing for improved handling of PIL and NumPy images
|
5 months ago |
myhloli
|
bd2c3d120a
refactor: update OCR handling and adjust root directory path for model loading
|
5 months ago |
myhloli
|
38ace5dc61
refactor: streamline document analysis and enhance image handling in processing pipeline
|
5 months ago |
myhloli
|
6833882585
refactor: enhance language support and improve document parsing for multiple files
|
5 months ago |
myhloli
|
0f21495a06
refactor: enhance block processing and sorting utilities for improved span management
|
5 months ago |
myhloli
|
ae7b0a6eba
refactor: implement block preprocessing utilities for improved bounding box management
|
5 months ago |
myhloli
|
8f1f9abec5
refactor: enhance bounding box utilities and add configuration reader for S3 integration
|
5 months ago |
myhloli
|
7285ea9285
refactor: improve document analysis by integrating image loading and enhancing data handling
|
5 months ago |
myhloli
|
ea5cb65a1f
refactor: enhance document parsing by supporting multiple PDF files and improving method organization
|
5 months ago |
myhloli
|
0a899f1af8
feat: add batch processing for OCR detection and implement new client and common utilities
|
5 months ago |
myhloli
|
cbba27b4f5
refactor: reorganize project structure and update import paths
|
5 months ago |
Xiaomeng Zhao
|
3027c677c9
Merge pull request #11 from johnking0099/refactor-mineru2
|
5 months ago |
Jin Zhen Jiang
|
8e55a52693
feat: add mineru-vlm backend.
|
5 months ago |
myhloli
|
6f8a961087
feat: implement S3 data reader and writer with multi-bucket support
|
5 months ago |
myhloli
|
bd9279198c
refactor: rename init file and update app.py to enable parsing method
|
5 months ago |
Xiaomeng Zhao
|
f50165084d
Merge pull request #2519 from opendatalab/master
|
5 months ago |
myhloli
|
580193bae0
Update version.py with new version
|
5 months ago |
Xiaomeng Zhao
|
a989444e2f
Merge pull request #2514 from opendatalab/release-1.3.12
|
5 months ago |
Xiaomeng Zhao
|
e3a4295527
Merge pull request #2513 from myhloli/dev
|
5 months ago |
myhloli
|
73f0530d16
feat(docs): update changelog for PP-OCRv5 model support and handwritten document recognition enhancements
|
5 months ago |
Xiaomeng Zhao
|
e92b5b698e
Merge pull request #2512 from myhloli/dev
|
5 months ago |