PaddleX 3.0 is a low-code development tool for AI models built on the PaddlePaddle framework. It integrates numerous ready-to-use pre-trained models, enabling full-process development from model training to inference, supporting a variety of mainstream hardware both domestic and international, and aiding AI developers in industrial practice.
๐จ Rich Models One-click Call: Integrate over 200 PaddlePaddle models covering multiple key areas such as OCR, object detection, and time series forecasting into 19 pipelines. Experience the model effects quickly through easy Python API calls. Also supports more than 20 modules for easy model combination use by developers.
๐ High Efficiency and Low barrier of entry: Achieve model full-process development based on graphical interfaces and unified commands, creating 8 featured model pipelines that combine large and small models, semi-supervised learning of large models, and multi-model fusion, greatly reducing the cost of iterating models.
๐ Flexible Deployment in Various Scenarios: Support various deployment methods such as high-performance inference, serving, and lite deployment to ensure efficient operation and rapid response of models in different application scenarios.
๐ง Efficient Support for Mainstream Hardware: Support seamless switching of various mainstream hardware such as NVIDIA GPUs, Kunlun XPU, Ascend NPU, and Cambricon MLU to ensure efficient operation.
๐ฅ๐ฅ 2025.2.14, PaddleX v3.0.0rc0 major upgrade. This version fully adapts to PaddlePaddle 3.0rc0, with the following core upgrades:
Added 12 high-value pipelines, launching self-developed Layout Parsing v2 Pipeline, PP-ChatOCRv4-doc Pipeline, Table Recognition v2 Pipeline. Additionally, new pipelines for document processing, rotated box detection, open vocabulary detection/segmentation, video analysis, multilingual speech recognition, 3D, and other scenarios have been added.
Expanded 48 cutting-edge models, including the major releases in the OCR field such as Document Layout Detection Model PP-DocLayout, Formula Recognition Model PP-FormulaNet, Table Structure Recognition Model SLANeXt, Text Recognition Model PP-OCRv4_server_rec_doc. In the CV field, models for 3D detection, human keypoints, open vocabulary detection/segmentation, and in the speech recognition field, models from the Whisper series, among others.
Optimized and upgraded the inference APIs for models and pipelines, supporting more parameter configurations to enhance the flexibility of model and pipeline inference. Details.
Expanded hardware support: added support for Suoyuan GCU (90+ models), and significantly increased the number of models for Ascend NPU/Kunlunxin XPU/Cambricon MLU/Hygon DCU.
Upgraded full-scenario deployment capabilities:
Enhanced system compatibility: adapted to Windows training/inference, fully supporting Python 3.11/3.12.
๐ฅ๐ฅ 11.15, 2024, PaddleX 3.0 Beta2 open source version is officially released, PaddleX 3.0 Beta2 is fully compatible with the PaddlePaddle 3.0b2 version. This update introduces new pipelines for general image recognition, face recognition, vehicle attribute recognition, and pedestrian attribute recognition. We have also developed 42 new models to fully support the Ascend 910B, with extensive documentation available on GitHub Pages.
๐ฅ๐ฅ 9.30, 2024, PaddleX 3.0 Beta1 open source version is officially released, providing more than 200 models that can be called with a simple Python API; achieve model full-process development based on unified commands, and open source the basic capabilities of the PP-ChatOCRv3 pipeline; support more than 100 models for high-performance inference and serving (iterating continuously), more than 7 key visual models for edge-deployment; more than 70 models have been adapted for the full development process of Ascend 910B, more than 15 models have been adapted for the full development process of Kunlun chips and Cambricon
๐ฅ 6.27, 2024, PaddleX 3.0 Beta open source version is officially released, supporting the use of various mainstream hardware for pipeline and model development in a low-code manner on the local side.
๐ฅ 3.25, 2024, PaddleX 3.0 cloud release, supporting the creation of pipelines in the AI Studio Galaxy Community in a zero-code manner.
PaddleX is dedicated to achieving pipeline-level model training, inference, and deployment. A pipeline refers to a series of predefined development processes for specific AI tasks, which includes a combination of single models (single-function modules) capable of independently completing a certain type of task.
All pipelines of PaddleX support online experience on AI Studio and local fast inference. You can quickly experience the effects of each pre-trained pipeline. If you are satisfied with the effects of the pre-trained pipeline, you can directly perform high-performance inference / serving / edge deployment on the pipeline. If not satisfied, you can also Custom Development to improve the pipeline effect. For the complete pipeline development process, please refer to the PaddleX pipeline Development Tool Local Use Tutorial.
In addition, PaddleX provides developers with a full-process efficient model training and deployment tool based on a cloud-based GUI. Developers do not need code development, just need to prepare a dataset that meets the pipeline requirements to quickly start model training. For details, please refer to the tutorial "Developing Industrial-level AI Models with Zero Barrier".
| Pipeline | Online Experience | Local Inference | High-Performance Inference | Serving | Edge Deployment | Custom Development | Zero-Code Development On AI Studio |
|---|---|---|---|---|---|---|---|
| OCR | Link | โ | โ | โ | โ | โ | โ |
| PP-ChatOCRv3 | Link | โ | โ | โ | ๐ง | โ | โ |
| Table Recognition | Link | โ | โ | โ | ๐ง | โ | โ |
| Object Detection | Link | โ | โ | โ | โ | โ | โ |
| Instance Segmentation | Link | โ | โ | โ | ๐ง | โ | โ |
| Image Classification | Link | โ | โ | โ | โ | โ | โ |
| Semantic Segmentation | Link | โ | โ | โ | โ | โ | โ |
| Time Series Forecasting | Link | โ | ๐ง | โ | ๐ง | โ | โ |
| Time Series Anomaly Detection | Link | โ | ๐ง | โ | ๐ง | โ | โ |
| Time Series Classification | Link | โ | ๐ง | โ | ๐ง | โ | โ |
| Small Object Detection | Link | โ | โ | โ | ๐ง | โ | โ |
| Multi-label Image Classification | Link | โ | โ | โ | ๐ง | โ | โ |
| Image Anomaly Detection | ๐ง | โ | โ | โ | ๐ง | โ | ๐ง |
| Human Keypoint Detection | ๐ง | โ | โ | โ | ๐ง | โ | ๐ง |
| Open Vocabulary Detection | ๐ง | โ | โ | โ | ๐ง | ๐ง | ๐ง |
| Open Vocabulary Segmentation | ๐ง | โ | โ | โ | ๐ง | ๐ง | ๐ง |
| Rotated Object Detection | ๐ง | โ | โ | โ | ๐ง | โ | ๐ง |
| 3D Bev Detection | ๐ง | โ | โ | โ | ๐ง | โ | ๐ง |
| Table Recognition v2 | ๐ง | โ | โ | โ | ๐ง | โ | ๐ง |
| Layout Parsing | ๐ง | โ | โ | โ | ๐ง | โ | ๐ง |
| Layout Parsing v2 | ๐ง | โ | โ | โ | ๐ง | ๐ง | ๐ง |
| Formula Recognition | Link | โ | โ | โ | ๐ง | โ | โ |
| Seal Recognition | Link | โ | โ | โ | ๐ง | โ | โ |
| Document Image Preprocessing | ๐ง | โ | โ | โ | ๐ง | โ | ๐ง |
| Image Recognition | ๐ง | โ | ๐ง | โ | ๐ง | โ | ๐ง |
| Pedestrian Attribute Recognition | Link | โ | ๐ง | โ | ๐ง | โ | โ |
| Vehicle Attribute Recognition | Link | โ | ๐ง | โ | ๐ง | โ | โ |
| Face Recognition | ๐ง | โ | โ | โ | ๐ง | โ | ๐ง |
| Multilingual Speech Recognition | ๐ง | โ | โ | โ | ๐ง | ๐ง | ๐ง |
| Video Classification | ๐ง | โ | โ | โ | ๐ง | โ | ๐ง |
| Video Detection | ๐ง | โ | โ | โ | ๐ง | โ | ๐ง |
โNote: The above capabilities are implemented based on GPU/CPU. PaddleX can also perform local inference and custom development on mainstream hardware such as Kunlunxin, Ascend, Cambricon, and Haiguang. The table below details the support status of the pipelines. For specific supported model lists, please refer to the Model List (Kunlunxin XPU)/Model List (Ascend NPU)/Model List (Cambricon MLU)/Model List (Haiguang DCU). We are continuously adapting more models and promoting the implementation of high-performance and serving on mainstream hardware.
๐ฅ๐ฅ Support for Domestic Hardware Capabilities
| Pipeline | Ascend 910B | Kunlunxin R200/R300 | Cambricon MLU370X8 | Haiguang Z100 |
|---|---|---|---|---|
| OCR | โ | โ | โ | ๐ง |
| Table Recognition | โ | ๐ง | ๐ง | ๐ง |
| Object Detection | โ | โ | โ | ๐ง |
| Instance Segmentation | โ | ๐ง | โ | ๐ง |
| Image Classification | โ | โ | โ | โ |
| Semantic Segmentation | โ | โ | โ | โ |
| Time Series Forecasting | โ | โ | โ | ๐ง |
| Time Series Anomaly Detection | โ | ๐ง | ๐ง | ๐ง |
| Time Series Classification | โ | ๐ง | ๐ง | ๐ง |
| Multi-label Image Classification | โ | ๐ง | ๐ง | โ |
| Pedestrian Attribute Recognition | โ | ๐ง | ๐ง | ๐ง |
| Vehicle Attribute Recognition | โ | ๐ง | ๐ง | ๐ง |
| Image Recognition | โ | ๐ง | โ | โ |
| Seal Recognition | โ | ๐ง | ๐ง | ๐ง |
| Image Anomaly Detection | โ | โ | โ | โ |
| Face Recognition | โ | โ | โ | โ |
โBefore installing PaddleX, please ensure you have a basic Python environment (Note: Currently supports Python 3.8 to Python 3.10, with more Python versions being adapted). The PaddleX 3.0-beta2 version depends on PaddlePaddle version 3.0.0b2.
Installing PaddlePaddle
# cpu
python -m pip install paddlepaddle==3.0.0rc0 -i https://www.paddlepaddle.org.cn/packages/stable/cpu/
# gpu๏ผ่ฏฅๅฝไปคไป
้็จไบ CUDA ็ๆฌไธบ 11.8 ็ๆบๅจ็ฏๅข
python -m pip install paddlepaddle-gpu==3.0.0rc0 -i https://www.paddlepaddle.org.cn/packages/stable/cu118/
# gpu๏ผ่ฏฅๅฝไปคไป
้็จไบ CUDA ็ๆฌไธบ 12.3 ็ๆบๅจ็ฏๅข
python -m pip install paddlepaddle-gpu==3.0.0rc0 -i https://www.paddlepaddle.org.cn/packages/stable/cu123/
โFor more PaddlePaddle versions, please refer to the PaddlePaddle official website.
Installing PaddleX
pip install https://paddle-model-ecology.bj.bcebos.com/paddlex/whl/paddlex-3.0.0rc0-py3-none-any.whl
โFor more installation methods, refer to the PaddleX Installation Guide.
One command can quickly experience the pipeline effect, the unified CLI format is:
paddlex --pipeline [Pipeline Name] --input [Input Image] --device [Running Device]
You only need to specify three parameters:
pipeline: The name of the pipelineinput: The local path or URL of the input image to be processeddevice: The GPU number used (for example, gpu:0 means using the 0th GPU), you can also choose to use the CPU (cpu)For example, using the OCR pipeline:
paddlex --pipeline OCR --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_ocr_002.png --device gpu:0
To use the command line for other pipelines, simply adjust the pipeline parameter to the name of the corresponding pipeline. Below are the commands for each pipeline:
A few lines of code can complete the quick inference of the pipeline, the unified Python script format is as follows:
from paddlex import create_pipeline
pipeline = create_pipeline(pipeline=[Pipeline Name])
output = pipeline.predict([Input Image Name])
for res in output:
res.print()
res.save_to_img("./output/")
res.save_to_json("./output/")
The following steps are executed:
create_pipeline() instantiates the pipeline objectpredict method of the pipeline object for inference predictionFor other pipelines in Python scripts, just adjust the pipeline parameter of the create_pipeline() method to the corresponding name of the pipeline. Below is a list of each pipeline's corresponding parameter name and detailed usage explanation:
๐ OCR
๐ฅ Computer Vision
โฑ๏ธ Time Series Analysis
๐ค Speech Analysis
๐ฅ Video Processing
๐ง Related Instructions
๐ผ๏ธ Image Classification
๐๏ธ Image Features
๐ฏ Object Detection
๐ผ๏ธ Image Segmentation
โฑ๏ธ Time Series Analysis
๐ 3D
๐ค Speech
๐ฅ Video
๐ Related Instructions
For answers to some common questions about our project, please refer to the FAQ. If your question has not been answered, please feel free to raise it in Issues.
We warmly welcome and encourage community members to raise questions, share ideas, and feedback in the Discussions section. Whether you want to report a bug, discuss a feature request, seek help, or just want to keep up with the latest project news, this is a great platform.
The release of this project is licensed under the Apache 2.0 license.