非结构化文档识别统一平台

zhch158_admin 5f5e88e396 feat: 新增表格线生成器模块及其核心功能 hai 1 semana
.vscode 6b95fb0489 feat: 添加 VSCode 任务配置以支持前后端同时启动 hai 1 semana
docs 674b43692f feat: 添加多个文档和工具说明,包括 OCR 平台、表格识别模块及其评估算法 hai 1 semana
legacy_table_line_generator 5f5e88e396 feat: 新增表格线生成器模块及其核心功能 hai 1 semana
ocr_comparator 244ee9de2c feat: Add report generation and similarity calculation modules hai 1 semana
ocr_tools d8ecf2d8c6 Add new language dictionaries and model configurations for OCR hai 1 semana
ocr_utils a7520b9498 feat: 添加多个工具模块,包括设备检测、图像处理、HTML/Markdown 处理和数字解析功能 hai 1 semana
ocr_validator 444025c466 feat: Add OCR validation display module with cross-validation results and table handling hai 1 semana
table_line_generator 3f977e0137 feat: replace old template with updated version for 康强_北京农村商业银行, including new line data and relative coordinates hai 1 semana
README.md 2ec2167e12 添加环境配置说明和代码获取步骤 hai 2 semanas
pyrightconfig.json 02b7f63b3d chore: Update pyright configuration to include additional path for PaddleX hai 1 semana

README.md

1. 环境配置

1.1 代码获取

git clone https://gitee.com/zhch158_admin/ocr_platform.git -c user.name=zhch158_admin -c user.email=zhch158@sina.com
cd MinerU
git config --local user.name "zhch158_admin"
git config --local user.email "zhch158@sina.com"