非结构化文档识别统一平台

zhch158_admin 499120d8c4 refactor: Implement lazy loading for PDFUtils and extract_pdf_pages 1 周之前
.vscode d412d10b80 fix: Correct JSON formatting in launch configuration 1 周之前
ocr_tools e49a2a1868 feat: Add PaddleOCR-VL batch processing tool with main script and README 1 周之前
ocr_utils 499120d8c4 refactor: Implement lazy loading for PDFUtils and extract_pdf_pages 1 周之前
README.md 2ec2167e12 添加环境配置说明和代码获取步骤 2 周之前
pyrightconfig.json b4b5494b89 chore: Update pyright configuration to include additional path for DotsOCR 2 周之前

README.md

1. 环境配置

1.1 代码获取

git clone https://gitee.com/zhch158_admin/ocr_platform.git -c user.name=zhch158_admin -c user.email=zhch158@sina.com
cd MinerU
git config --local user.name "zhch158_admin"
git config --local user.email "zhch158@sina.com"