Prechádzať zdrojové kódy

Merge pull request #2064 from myhloli/dev

update released note
Xiaomeng Zhao 7 mesiacov pred
rodič
commit
96ab0ad8fb
2 zmenil súbory, kde vykonal 32 pridanie a 0 odobranie
  1. 16 0
      README.md
  2. 16 0
      README_zh-CN.md

+ 16 - 0
README.md

@@ -47,6 +47,20 @@ Easier to use: Just grab MinerU Desktop. No coding, no login, just a simple inte
 </div>
 
 # Changelog
+- 2025/04/03 Release of version 1.3.0, with many changes in this version:
+  - Installation and compatibility optimization
+    - By using paddleocr2torch, completely replaced the paddle framework and paddleocr used in the project, resolving conflicts between paddle and torch (OCR speed under single-process is slightly slower compared to the paddle framework).
+    - Removed the use of layoutlmv3 in layout, solving compatibility issues caused by `detectron2`.
+    - Extended torch version compatibility to 2.2~2.6.
+    - CUDA compatibility extended to 11.8~12.6 (CUDA version determined by torch), addressing compatibility issues for some users with 50-series and H-series GPUs.
+    - Python compatible versions extended to 3.10~3.12, resolving the issue of automatic downgrade to 0.6.1 during installation in non-3.10 environments.
+  - Performance optimization (compared to version 1.0.1, formula parsing speed improved by over 1400%, and overall parsing speed improved by over 500%)
+    - Supported batch processing for multiple PDF files, enhancing the parsing speed of batch files.
+    - Optimized the loading and usage of the mfr model, reducing memory usage and improving parsing speed.
+    - Optimized memory usage, allowing the project to run with as little as 6GB.
+    - Improved running speed on mps devices.
+  - Parsing effect optimization
+    - Updated the mfr model to unimernet(2503), solving the issue of missing line breaks in multi-line formulas.
 - 2025/03/03 1.2.1 released, fixed several bugs:
   - Fixed the impact on punctuation marks during full-width to half-width conversion of letters and numbers
   - Fixed caption matching inaccuracies in certain scenarios
@@ -414,6 +428,8 @@ This project currently uses PyMuPDF to achieve advanced functionality. However,
 - [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy)
 - [RapidTable](https://github.com/RapidAI/RapidTable)
 - [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)
+- [RapidOCR](https://github.com/RapidAI/RapidOCR)
+- [PaddleOCR2Pytorch](https://github.com/frotms/PaddleOCR2Pytorch)
 - [PyMuPDF](https://github.com/pymupdf/PyMuPDF)
 - [layoutreader](https://github.com/ppaanngggg/layoutreader)
 - [fast-langdetect](https://github.com/LlmKira/fast-langdetect)

+ 16 - 0
README_zh-CN.md

@@ -46,6 +46,20 @@
 </div>
 
 # 更新记录
+- 2025/04/03 1.3.0 发布,在这个版本我们做出了许多改变:
+  - 安装与兼容性优化
+    - 通过使用paddleocr2torch,完全替代了paddle框架以及paddleocr在项目中的使用,解决了paddle和torch的冲突问题(单进程下ocr速度相比paddle框架略有小幅下降)
+    - 通过移除layout中layoutlmv3的使用,解决了由`detectron2`导致的兼容问题
+    - torch版本兼容扩展到2.2~2.6
+    - cuda兼容扩展到11.8~12.6(cuda版本由torch决定),解决部分用户50系显卡与H系显卡的兼容问题
+    - python兼容版本扩展到3.10~3.12,解决了在非3.10环境下安装时自动降级到0.6.1的问题
+  - 性能优化(与1.0.1版本相比,公式解析速度最高提升超过1400%,整体解析速度提升超过500%)
+    - 通过支持多个pdf文件的batch,提升了批量文件的解析速度 
+    - 通过优化mfr模型的加载和使用,降低了显存占用并提升了解析速度
+    - 优化显存占用,最低仅需6GB即可运行本项目
+    - 优化了在mps设备上的运行速度
+  - 解析效果优化
+    - mfr模型更新到unimernet(2503),解决多行公式中换行丢失的问题
 - 2025/03/03 1.2.1 发布,修复了一些问题:
   - 修复在字母与数字的全角转半角操作时对标点符号的影响
   - 修复在某些情况下caption的匹配不准确问题
@@ -418,6 +432,8 @@ TODO
 - [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy)
 - [RapidTable](https://github.com/RapidAI/RapidTable)
 - [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)
+- [RapidOCR](https://github.com/RapidAI/RapidOCR)
+- [PaddleOCR2Pytorch](https://github.com/frotms/PaddleOCR2Pytorch)
 - [PyMuPDF](https://github.com/pymupdf/PyMuPDF)
 - [layoutreader](https://github.com/ppaanngggg/layoutreader)
 - [fast-langdetect](https://github.com/LlmKira/fast-langdetect)