zhch158_admin
|
3263321e84
feat: 添加统一的PDF图像加载函数,支持多种渲染引擎
|
6 timmar sedan |
zhch158_admin
|
4e6c855b17
feat: 添加PDF渲染引擎对比分析工具,支持分析图像属性和差异
|
6 timmar sedan |
zhch158_admin
|
95e2272ed9
fix: 添加PDF文档处理时保存页面图像的功能
|
6 timmar sedan |
zhch158_admin
|
e21b57e051
fix: 更新输出配置,启用保存布局和OCR图像
|
6 timmar sedan |
zhch158_admin
|
f75f1bb639
Fix typo in documentation regarding U-Net line detection and comparison with existing methods.
|
10 timmar sedan |
zhch158_admin
|
af1c467c48
fix: 优化UNet推理调试信息记录,增强尺寸一致性验证,返回检测到的倾斜角度
|
12 timmar sedan |
zhch158_admin
|
6be6c8c3bb
fix: 移除UNet预处理的宽高缩放因子参数,优化坐标转换的调试信息记录
|
12 timmar sedan |
zhch158_admin
|
ef447c6c7b
fix: 从识别结果中获取倾斜角度,优化表格处理的准确性
|
12 timmar sedan |
zhch158_admin
|
64652051e4
fix: 更新示例输入输出路径,修正注释以提高代码可读性
|
1 dag sedan |
zhch158_admin
|
ca720abd31
fix: 增强UNet预处理的缩放因子验证,优化预测结果的尺寸一致性检查,记录详细的调试信息以确保坐标转换的准确性
|
1 dag sedan |
zhch158_admin
|
1fbcf06f4a
fix: 增强文本填充器的OCR检测能力,支持跨单元格检测和输出调试图像,优化重叠检测逻辑
|
1 dag sedan |
zhch158_admin
|
bb0acb2afc
fix: 增强网格结构恢复中的坐标转换精度,添加调试信息以验证缩放比例和单元格覆盖情况
|
1 dag sedan |
zhch158_admin
|
3cf3aa5085
fix: 调整表格处理中的padding策略,优化边缘保护与噪声控制的平衡
|
1 dag sedan |
zhch158_admin
|
2f5c74136e
fix: 优化 crop_region 方法中的代码格式,提升可读性
|
1 dag sedan |
zhch158_admin
|
0102386803
fix: Update OCR confidence threshold in bank_statement_wired_unet.yaml to improve cell recognition accuracy
|
1 dag sedan |
zhch158_admin
|
652b321bd6
feat: Update batch processing in main_v2.py to include output directory parameter for document processing, enhancing flexibility in file management.
|
2 dagar sedan |
zhch158_admin
|
1bb438fba3
fix: Improve coordinate transformation accuracy in WiredTableVisualizer to reduce cumulative errors and enhance debugging with detailed logging of cell coordinates during visualization.
|
2 dagar sedan |
zhch158_admin
|
c628acd7b7
feat: Enhance text filling strategy in TextFiller class by introducing overlap ratio calculation for improved OCR box matching, optimizing performance with binary search, and refining text extraction logic.
|
2 dagar sedan |
zhch158_admin
|
60aa86e4bf
feat: Enhance GridRecovery class by adding optional parameters for original image dimensions and improving scaling logic for cell extraction, along with detailed debug logging for better traceability.
|
2 dagar sedan |
zhch158_admin
|
26b500f344
feat: Add documentation for grid recovery scenarios in UNet, detailing causes of empty rows/columns and how `compress_grid` addresses them.
|
2 dagar sedan |
zhch158_admin
|
6b063ced58
fix: Enhance UNet preprocessing and resizing logic in MinerUWiredTableRecognizer to ensure consistent prediction dimensions and improve debugging information
|
2 dagar sedan |
zhch158_admin
|
3a5b2ab300
chore: Add .gitignore and a script to verify GridRecovery module import and cell computation with mocked dependencies.
|
3 dagar sedan |
zhch158_admin
|
76f8e864a8
feat: Add .gitignore, implement grid recovery syntax verification, and enhance HuggingFace model loading with local cache prioritization.
|
3 dagar sedan |
zhch158_admin
|
e355727495
feat: Add wired table processing modules, `wired_table` adapter, and enhance HuggingFace model caching in `docling_layout_adapter`.
|
3 dagar sedan |
zhch158_admin
|
a4ad1d803a
feat: Implement wired table processing with grid recovery and skew detection, and improve HuggingFace model caching.
|
3 dagar sedan |
zhch158_admin
|
4f32495604
feat: Introduce new wired table processing module with enhanced skew detection, grid recovery, and output capabilities, and update pipeline to utilize it.
|
3 dagar sedan |
zhch158_admin
|
3b3c3c9c5a
feat: Introduce wired table parsing adapter with grid recovery, OCR formatting, and enhanced region cropping.
|
3 dagar sedan |
zhch158_admin
|
ce29ee3458
feat: Implement `mineru_wired_table_v2` adapter with enhanced table OCR preprocessing, grid recovery, and visualization utilities.
|
3 dagar sedan |
zhch158_admin
|
6477e9183b
feat: Add wired table adapter components, update Mineru wired table adapter, and improve HuggingFace model caching logic.
|
3 dagar sedan |
zhch158_admin
|
f7da730070
fix: 增强错误日志记录,添加详细的堆栈跟踪信息以便于调试
|
5 dagar sedan |