Commit History

Author SHA1 Message Date
  zhch158_admin ef447c6c7b fix: 从识别结果中获取倾斜角度,优化表格处理的准确性 14 hours ago
  zhch158_admin 64652051e4 fix: 更新示例输入输出路径,修正注释以提高代码可读性 1 day ago
  zhch158_admin ca720abd31 fix: 增强UNet预处理的缩放因子验证,优化预测结果的尺寸一致性检查,记录详细的调试信息以确保坐标转换的准确性 1 day ago
  zhch158_admin 1fbcf06f4a fix: 增强文本填充器的OCR检测能力,支持跨单元格检测和输出调试图像,优化重叠检测逻辑 1 day ago
  zhch158_admin bb0acb2afc fix: 增强网格结构恢复中的坐标转换精度,添加调试信息以验证缩放比例和单元格覆盖情况 1 day ago
  zhch158_admin 3cf3aa5085 fix: 调整表格处理中的padding策略,优化边缘保护与噪声控制的平衡 1 day ago
  zhch158_admin 2f5c74136e fix: 优化 crop_region 方法中的代码格式,提升可读性 1 day ago
  zhch158_admin 0102386803 fix: Update OCR confidence threshold in bank_statement_wired_unet.yaml to improve cell recognition accuracy 1 day ago
  zhch158_admin 652b321bd6 feat: Update batch processing in main_v2.py to include output directory parameter for document processing, enhancing flexibility in file management. 2 days ago
  zhch158_admin 1bb438fba3 fix: Improve coordinate transformation accuracy in WiredTableVisualizer to reduce cumulative errors and enhance debugging with detailed logging of cell coordinates during visualization. 2 days ago
  zhch158_admin c628acd7b7 feat: Enhance text filling strategy in TextFiller class by introducing overlap ratio calculation for improved OCR box matching, optimizing performance with binary search, and refining text extraction logic. 2 days ago
  zhch158_admin 60aa86e4bf feat: Enhance GridRecovery class by adding optional parameters for original image dimensions and improving scaling logic for cell extraction, along with detailed debug logging for better traceability. 2 days ago
  zhch158_admin 26b500f344 feat: Add documentation for grid recovery scenarios in UNet, detailing causes of empty rows/columns and how `compress_grid` addresses them. 2 days ago
  zhch158_admin 6b063ced58 fix: Enhance UNet preprocessing and resizing logic in MinerUWiredTableRecognizer to ensure consistent prediction dimensions and improve debugging information 2 days ago
  zhch158_admin 3a5b2ab300 chore: Add .gitignore and a script to verify GridRecovery module import and cell computation with mocked dependencies. 3 days ago
  zhch158_admin 76f8e864a8 feat: Add .gitignore, implement grid recovery syntax verification, and enhance HuggingFace model loading with local cache prioritization. 3 days ago
  zhch158_admin e355727495 feat: Add wired table processing modules, `wired_table` adapter, and enhance HuggingFace model caching in `docling_layout_adapter`. 3 days ago
  zhch158_admin a4ad1d803a feat: Implement wired table processing with grid recovery and skew detection, and improve HuggingFace model caching. 3 days ago
  zhch158_admin 4f32495604 feat: Introduce new wired table processing module with enhanced skew detection, grid recovery, and output capabilities, and update pipeline to utilize it. 3 days ago
  zhch158_admin 3b3c3c9c5a feat: Introduce wired table parsing adapter with grid recovery, OCR formatting, and enhanced region cropping. 3 days ago
  zhch158_admin ce29ee3458 feat: Implement `mineru_wired_table_v2` adapter with enhanced table OCR preprocessing, grid recovery, and visualization utilities. 3 days ago
  zhch158_admin 6477e9183b feat: Add wired table adapter components, update Mineru wired table adapter, and improve HuggingFace model caching logic. 3 days ago
  zhch158_admin f7da730070 fix: 增强错误日志记录,添加详细的堆栈跟踪信息以便于调试 5 days ago
  zhch158_admin 4b399d085e feat: 添加倾斜检测与矫正功能,集成BBoxExtractor以优化OCR框处理 5 days ago
  zhch158_admin 05d07bb9ef feat: 添加BBoxExtractor以计算OCR文本的倾斜角度并记录信息 5 days ago
  zhch158_admin d7e5f2f689 refactor: 移除多边形到边界框的转换逻辑,简化IOU计算过程 5 days ago
  zhch158_admin bd17ca00f4 feat: 更新示例输入输出路径,添加新的测试图像以增强文档解析功能的测试覆盖率 5 days ago
  zhch158_admin 5235aff1b9 feat: 更新MinerU适配器,添加ocr_platform根目录到Python路径并优化坐标处理逻辑 5 days ago
  zhch158_admin fe223cd19d feat: 优化OCR文本框识别逻辑,优先使用多边形数据并增强错误日志信息 5 days ago
  zhch158_admin 60f761a6b5 feat: 重构单元格计算与网格恢复逻辑,增强对复杂表格的处理能力 6 days ago