cuicheng01 6 meses atrás
pai
commit
f8fbd6b341
7 arquivos alterados com 38 adições e 112 exclusões
  1. 2 28
      README.md
  2. 1 3
      README_en.md
  3. 16 0
      docs/CHANGELOG.en.md
  4. 12 0
      docs/CHANGELOG.md
  5. 2 39
      docs/index.en.md
  6. 2 39
      docs/index.md
  7. 3 3
      docs/installation/installation.en.md

+ 2 - 28
README.md

@@ -39,7 +39,7 @@ PaddleX 3.0 是基于飞桨框架构建的低代码开发工具,它集成了
 - **重要能力发布:**
   - **重磅发布文字识别模型 PP-OCRv5**:全场景 OCR 识别精度跃升13%,单模型同时支持 5 种文字类型(简体中文、繁体中文、中文拼音、英文和日文),在中英文手写字体、竖直文本、生僻字等提升非常明显。可在 [在线Demo](https://aistudio.baidu.com/community/app/91660/webUI?source=appCenter) 中立即体验。
   - **重磅发布文档解析方案 PP-StructureV3**:强化了版面区域检测、表格识别、中英文公式识别、多栏阅读顺序的恢复能力,增加了图表理解能力,在 OmniDocBench 榜单上,PP-StructureV3 的整体中文和英文的编辑距离均达到 SOTA 水平。可在 [在线Demo](https://aistudio.baidu.com/community/app/518494/webUI?source=appCenter) 中立即体验。
-  - **优化PP-ChatOCRv4**:原生支持文心大模型4.5T,结合PP-DocBee2,关键信息抽取精度相比上一代提升15.7个百分点。可在 [在线Demo](https://aistudio.baidu.com/community/app/518493/webUI?source=appCenter) 中立即体验。
+  - **优化PP-ChatOCRv4**:原生支持文心大模型4.5T,结合新增的PP-DocBee2,关键信息抽取精度相比上一代提升15.7个百分点。可在 [在线Demo](https://aistudio.baidu.com/community/app/518493/webUI?source=appCenter) 中立即体验。
 - **推理能力优化:**
   - 通用OCR、通用版面解析v3、公式识别、印章文本识别、文档图像预处理产线支持设置batch size>1,一次处理多个页面。
   - 通用OCR、通用版面解析v3等17条产线支持多卡并行推理;新增产线多进程并行推理示例代码。
@@ -59,32 +59,6 @@ PaddleX 3.0 是基于飞桨框架构建的低代码开发工具,它集成了
   - **GCU:燧原正式纳入飞桨例行发版体系,完成了 PaddleX 生态适配。支持 90 个模型的训练和推理。**
 
 
-🔥 **2025.2.14,PaddleX v3.0.0rc0 重磅升级。** 本次版本全面适配 PaddlePaddle 3.0rc0及以上版本,核心升级如下:
-
-- **新增 12 条高价值产线,重磅推出自研 [通用版面解析v3产线](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/ocr_pipelines/PP-StructureV3.html)、[PP-ChatOCRv4-doc产线](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/information_extraction_pipelines/document_scene_information_extraction_v4.html)、[表格识别v2产线](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/ocr_pipelines/table_recognition_v2.html)**。此外新增了文档处理、旋转框检测、开放词汇检测/分割、视频分析、多语种语音识别、3D 等场景的产线。
-
-- **扩充 48 个前沿模型,包括重磅推出的 OCR 领域的版面区域检测模型 [PP-DocLayout](https://paddlepaddle.github.io/PaddleX/latest/module_usage/tutorials/ocr_modules/layout_detection.html)、公式识别模型 [PP-FormulaNet](https://paddlepaddle.github.io/PaddleX/latest/module_usage/tutorials/ocr_modules/formula_recognition.html),表格结构识别模型 [SLANeXt](https://paddlepaddle.github.io/PaddleX/latest/module_usage/tutorials/ocr_modules/table_structure_recognition.html),文本识别模型 [PP-OCRv4_server_rec_doc](https://paddlepaddle.github.io/PaddleX/latest/module_usage/tutorials/ocr_modules/text_recognition.html)**。CV 领域的 3D 检测、人体关键点、开放词汇检测/分割模型,以及语音识别领域的 Whisper 系列等模型。
-
-- **优化和升级模型和产线的推理 API:** 支持更多参数的配置,提升模型和产线推理的灵活性,[详情](docs/API_change_log/v3.0.0rc.md)。
-
-- **多硬件支持扩展:** 新增燧原 GCU 支持(90+模型),昇腾 NPU/昆仑芯 XPU/寒武纪 MLU/海光 DCU 模型数量显著提升。
-
-- **全场景部署能力升级:**
-  - **高性能推理支持一键安装、Windows 系统及 220+ 模型,核心库 ultra-infer 开源;**
-  - **服务化部署新增高稳定性方案,支持动态配置优化。**
-
-- **系统兼容性增强:** 适配 Windows 训练/推理,全面支持 Python 3.11/3.12。
-
-🔥 **2024.11.15**,PaddleX 3.0 Beta2 开源版正式发布,全面适配 PaddlePaddle 3.0b2 版本。**新增通用图像识别、人脸识别、车辆属性识别和行人属性识别产线,同时新增 42 个模型开发全流程适配昇腾 910B,并全面支持[GitHub 站点文档](https://paddlepaddle.github.io/PaddleX/latest/index.html)。**
-
-🔥 **2024.9.30**,PaddleX 3.0 Beta1 开源版正式发布,提供 **200+ 模型** 通过极简的 Python API 一键调用;实现基于统一命令的模型全流程开发,并开源 **PP-ChatOCRv3** 特色模型产线基础能力;支持 **100+ 模型高性能推理和服务化部署**(持续迭代中),**4条模型产线8个重点视觉模型端侧部署**;**100+ 模型开发全流程适配昇腾 910B**,**39+ 模型开发全流程适配昆仑芯和寒武纪**。
-
-
-🔥 **2024.6.27**,PaddleX 3.0 Beta 开源版正式发布,支持以低代码的方式在本地端使用多种主流硬件进行产线和模型开发。
-
-🔥 **2024.3.25**,PaddleX 3.0 云端发布,支持在 AI Studio 星河社区 以零代码的方式【创建产线】使用。
-
-
  ## 🔠 模型产线说明
 
  **PaddleX 致力于实现产线级别的模型训练、推理与部署。模型产线是指一系列预定义好的、针对特定AI任务的开发流程,其中包含能够独立完成某类任务的单模型(单功能模块)组合。**
@@ -591,7 +565,7 @@ python -m pip install paddlepaddle==3.0.0 -i https://www.paddlepaddle.org.cn/pac
 * **安装PaddleX**
 
 ```bash
-pip install paddlex==3.0rc1
+pip install paddlex[base]==3.0.0
 ```
 
 > ❗ 更多安装方式参考 [PaddleX 安装教程](https://paddlepaddle.github.io/PaddleX/latest/installation/installation.html)

+ 1 - 3
README_en.md

@@ -574,9 +574,7 @@ python -m pip install paddlepaddle==3.0.0 -i https://www.paddlepaddle.org.cn/pac
 * **Installing PaddleX**
 
 ```bash
-pip install paddlex==3.0.0[base]
-# You can also install the sub-package for specific pipeline, such as:
-# pip install paddlex==3.0.0[ocr]
+pip install paddlex[base]==3.0.0
 ```
 
 > ❗For more installation methods, refer to the [PaddleX Installation Guide](https://paddlepaddle.github.io/PaddleX/latest/en/installation/installation.html).

+ 16 - 0
docs/CHANGELOG.en.md

@@ -6,6 +6,22 @@ comments: true
 
 ## Latest Version Information
 
+### PaddleX v3.0.0(5.20/2025) 
+
+Core upgrades are as follows:
+
+- **Major Capability Releases:**
+  - **Launch of the groundbreaking text recognition model PP-OCRv5**: Achieves a 13% improvement in OCR accuracy across all scenarios. A single model now supports 5 types of text (Simplified Chinese, Traditional Chinese, Chinese Pinyin, English, and Japanese), with significant enhancements in recognizing handwritten fonts, vertical text, and rare characters in both Chinese and English. You can experience it immediately in the [online demo](https://aistudio.baidu.com/community/app/91660/webUI?source=appCenter).
+  
+  - **Launch of the groundbreaking document parsing solution PP-StructureV3**: Enhanced capabilities in layout area detection, table recognition, Chinese and English formula recognition, and restoration of multi-column reading order, with added abilities for chart understanding. PP-StructureV3 achieves state-of-the-art (SOTA) levels in both Chinese and English editing distances on the OmniDocBench leaderboard. Experience it in the [online demo](https://aistudio.baidu.com/community/app/518494/webUI?source=appCenter).
+  
+  - **Optimization of PP-ChatOCRv4**: Supports the Ernie 4.5T. Combined with PP-DocBee2, it shows a 15.7 percentage point improvement in key information extraction accuracy compared to the previous generation. Experience it in the [online demo](https://aistudio.baidu.com/community/app/518493/webUI?source=appCenter).
+
+- **Inference Capability Optimization:**
+  - The general OCR, PP-StructureV3, formula recognition, seal text recognition, and document image preprocessing pipelines support setting batch size >1, allowing multiple pages to be processed at once.
+  
+  - 17 pipelines, including general OCR and PP-StructureV3, now support multi-GPU parallel inference. Sample code for multi-process parallel inference has been added.
+
 ### PaddleX v3.0.0rc1(4.22/2025)
 
 This version fully adapts to PaddlePaddle 3.0.0, with the following core upgrades:

+ 12 - 0
docs/CHANGELOG.md

@@ -5,6 +5,18 @@ comments: true
 # 版本更新信息
 
 ## 最新版本信息
+
+
+### PaddleX v3.0.0(5.20/2025) 
+
+- **重要能力发布:**
+  - **重磅发布文字识别模型 PP-OCRv5**:全场景 OCR 识别精度跃升13%,单模型同时支持 5 种文字类型(简体中文、繁体中文、中文拼音、英文和日文),在中英文手写字体、竖直文本、生僻字等提升非常明显。
+  - **重磅发布文档解析方案 PP-StructureV3**:强化了版面区域检测、表格识别、中英文公式识别、多栏阅读顺序的恢复能力,增加了图表理解能力,在 OmniDocBench 榜单上,PP-StructureV3 的整体中文和英文的编辑距离均达到 SOTA 水平。
+  - **优化PP-ChatOCRv4**:原生支持文心大模型4.5T,结合新增的PP-DocBee2,关键信息抽取精度相比上一代提升15.7个百分点。
+- **推理能力优化:**
+  - 通用OCR、通用版面解析v3、公式识别、印章文本识别、文档图像预处理产线支持设置batch size>1,一次处理多个页面。
+  - 通用OCR、通用版面解析v3等17条产线支持多卡并行推理;新增产线多进程并行推理示例代码。
+
 ### PaddleX v3.0.0rc1(4.22/2025)
 
 本次版本全面适配 PaddlePaddle 3.0 正式版,核心升级如下:

+ 2 - 39
docs/index.en.md

@@ -57,44 +57,7 @@ PaddleX 3.0 is a low-code development tool for AI models built on the PaddlePadd
         }
 </style>
 
-<table class="img-table">
-        <tr>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/image_classification.html"><strong>Image Classification</strong></a></th>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/image_multi_label_classification.html"><strong>Multi-label Image Classification</strong></a></th>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/object_detection.html"><strong>Object Detection</strong></a></th>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/instance_segmentation.html"><strong>Instance Segmentation</strong></a></th>
-        </tr>
-        <tr>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/b302cd7e-e027-4ea6-86d0-8a4dd6d61f39"></td>
-            <td><img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/multilabel_cls.png"></td>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/099e2b00-0bbe-4b20-9c5a-96b69e473bd2"></td>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/09f683b4-27df-4c24-b8a7-84da20fdd182"></td>
-        </tr>
-        <tr>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/semantic_segmentation.html"><strong>Semantic Segmentation</strong></a></th>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/image_anomaly_detection.html"><strong>Image Anomaly Detection</strong></a></th>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/OCR.html"><strong>OCR</strong></a></th>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/table_recognition.html"><strong>Table Recognition</strong></a></th>
-        </tr>
-        <tr>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/02637f8c-f248-415b-89ab-1276505f198c"></td>
-            <td><img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/image_anomaly_detection.png"></td>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/1ef48536-48d4-484b-a6fb-0d6631ba2386"></td>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/1e798e05-dee7-4b41-9cc4-6708b6014efa"></td>
-        </tr>
-        <tr>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/information_extraction_pipelines/document_scene_information_extraction_v3.html"><strong>PP-ChatOCRv3-doc</strong></a></th>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/time_series_pipelines/time_series_forecasting.html"><strong>Time Series Forecasting</strong></a></th>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/time_series_pipelines/time_series_anomaly_detection.html"><strong>Time Series Anomaly Detection</strong></a></th>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/time_series_pipelines/time_series_classification.html"><strong>Time Series Classification</strong></a></th>
-        </tr>
-        <tr>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/e3d97f4e-ab46-411c-8155-494c61492b0a"></td>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/6e897bf6-35fe-45e6-a040-e9a1a20cfdf2"></td>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/c54c66cc-da4f-4631-877b-43b0fbb192a6"></td>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/0ce925b2-3776-4dde-8ce0-5156d5a2476e"></td>
-        </tr>
-    </table>
+![PaddleX](https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/PaddleX_en.png)
 
 ## 🛠️ Installation
 
@@ -124,7 +87,7 @@ PaddleX 3.0 is a low-code development tool for AI models built on the PaddlePadd
 ### Installing PaddleX
 
 ```bash
-pip install paddlex==3.0.0rc1
+pip install paddlex[base]==3.0.0
 ```
 
 > ❗ For more installation methods, please refer to the [PaddleX Installation Guide](https://paddlepaddle.github.io/PaddleX/latest/installation/installation.html)

+ 2 - 39
docs/index.md

@@ -57,44 +57,7 @@ PaddleX 3.0 是基于飞桨框架构建的低代码开发工具,它集成了
         }
 </style>
 
-<table class="img-table">
-        <tr>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/cv_pipelines/image_classification.html"><strong>通用图像分类</strong></a></th>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/cv_pipelines/image_multi_label_classification.html"><strong>图像多标签分类</strong></a></th>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/cv_pipelines/object_detection.html"><strong>通用目标检测</strong></a></th>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/cv_pipelines/instance_segmentation.html"><strong>通用实例分割</strong></a></th>
-        </tr>
-        <tr>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/b302cd7e-e027-4ea6-86d0-8a4dd6d61f39"></td>
-            <td><img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/multilabel_cls.png"></td>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/099e2b00-0bbe-4b20-9c5a-96b69e473bd2"></td>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/09f683b4-27df-4c24-b8a7-84da20fdd182"></td>
-        </tr>
-        <tr>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/cv_pipelines/semantic_segmentation.html"><strong>通用语义分割</strong></a></th>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/cv_pipelines/image_anomaly_detection.html"><strong>图像异常检测</strong></a></th>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/ocr_pipelines/OCR.html"><strong>通用OCR</strong></a></th>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/ocr_pipelines/table_recognition.html"><strong>通用表格识别</strong></a></th>
-        </tr>
-        <tr>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/02637f8c-f248-415b-89ab-1276505f198c"></td>
-            <td><img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/image_anomaly_detection.png"></td>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/1ef48536-48d4-484b-a6fb-0d6631ba2386"></td>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/1e798e05-dee7-4b41-9cc4-6708b6014efa"></td>
-        </tr>
-        <tr>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/information_extraction_pipelines/document_scene_information_extraction_v3.html"><strong>文本图像智能分析</strong></a></th>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/time_series_pipelines/time_series_forecasting.html"><strong>时序预测</strong></a></th>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/time_series_pipelines/time_series_anomaly_detection.html"><strong>时序异常检测</strong></a></th>
-            <th><a href="https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/time_series_pipelines/time_series_classification.html"><strong>时序分类</strong></a></th>
-        </tr>
-        <tr>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/e3d97f4e-ab46-411c-8155-494c61492b0a"></td>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/6e897bf6-35fe-45e6-a040-e9a1a20cfdf2"></td>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/c54c66cc-da4f-4631-877b-43b0fbb192a6"></td>
-            <td><img src="https://github.com/PaddlePaddle/PaddleX/assets/142379845/0ce925b2-3776-4dde-8ce0-5156d5a2476e"></td>
-        </tr>
-    </table>
+![PaddleX](https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/PaddleX_ch.png)
 
 
 ## 🛠️ 安装
@@ -125,7 +88,7 @@ PaddleX 3.0 是基于飞桨框架构建的低代码开发工具,它集成了
 ### 安装PaddleX
 
 ```bash
-pip install paddlex==3.0.0rc1
+pip install paddlex[base]==3.0.0
 ```
 
 > ❗ 更多安装方式参考 [PaddleX 安装教程](https://paddlepaddle.github.io/PaddleX/latest/installation/installation.html)

+ 3 - 3
docs/installation/installation.en.md

@@ -18,7 +18,7 @@ After installing PaddlePaddle (refer to the [PaddlePaddle Local Installation Tut
 
 ```bash
 # Only install the required dependencies (optional dependencies can be installed later as needed)
-pip install paddlex==3.0.0rc1
+pip install paddlex==3.0.0
 ```
 
 You can install the optional dependencies as needed using the following method (For more details, please refer to [2.3 Selective Installation of Dependencies](#23-selective-installation-of-dependencies)):
@@ -26,13 +26,13 @@ You can install the optional dependencies as needed using the following method (
 Install all dependencies required for PaddleX "basic features":
 
 ```bash
-pip install "paddlex[base]==3.0.0rc1"
+pip install "paddlex[base]==3.0.0"
 ```
 
 Only install the dependencies required for a specific feature:
 
 ```bash
-pip install "paddlex[ocr]==3.0.0rc1"
+pip install "paddlex[ocr]==3.0.0"
 ```
 
 ### 1.2 Plugin Installation Mode