|
|
@@ -4,13 +4,13 @@
|
|
|
|
|
|
<p align="center">
|
|
|
<a href="./LICENSE"><img src="https://img.shields.io/badge/License-Apache%202-red.svg"></a>
|
|
|
- <a href=""><img src="https://img.shields.io/badge/Python-3.8%2C%203.9%2C%203.10-blue.svg"></a>
|
|
|
+ <a href=""><img src="https://img.shields.io/badge/Python-3.8~3.12-blue.svg"></a>
|
|
|
<a href=""><img src="https://img.shields.io/badge/OS-Linux%2C%20Windows%2C%20Mac-orange.svg"></a>
|
|
|
<a href=""><img src="https://img.shields.io/badge/hardware-CPU%2C%20GPU%2C%20XPU%2C%20NPU%2C%20MLU%2C%20DCU-yellow.svg"></a>
|
|
|
</p>
|
|
|
|
|
|
<h4 align="center">
|
|
|
- <a href=#-why-paddlex->🌟 Features</a> | <a href=https://aistudio.baidu.com/pipeline/mine>🌐 Online Experience</a>|<a href=#️-quick-start>🚀 Quick Start</a> | <a href=https://addlepaddle.github.io/PaddleX/latest/en/index.html> 📖 Documentation</a> | <a href=#-what-can-paddlex-do> 🔥Capabilities</a> | <a href=https://paddlepaddle.github.io/PaddleX/latest/en/support_list/models_list.html> 📋 Models</a>
|
|
|
+ <a href=#-why-paddlex->🌟 Features</a> | <a href=https://aistudio.baidu.com/application/center/app?tag=%E5%85%A8%E9%83%A8&flod=86503>>🌐 Online Experience</a>|<a href=#️-quick-start>🚀 Quick Start</a> | <a href=https://addlepaddle.github.io/PaddleX/latest/en/index.html> 📖 Documentation</a> | <a href=#-what-can-paddlex-do> 🔥Capabilities</a> | <a href=https://paddlepaddle.github.io/PaddleX/latest/en/support_list/models_list.html> 📋 Models</a>
|
|
|
</h4>
|
|
|
|
|
|
<h5 align="center">
|
|
|
@@ -210,47 +210,57 @@ In addition, PaddleX provides developers with a full-process efficient model tra
|
|
|
<td>✅</td>
|
|
|
</tr>
|
|
|
<tr>
|
|
|
- <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/image_anomaly_detection.html">Image Anomaly Detection</a></td>
|
|
|
+ <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/pedestrian_attribute.html">Pedestrian Attribute Recognition</a></td>
|
|
|
+ <td><a href="https://aistudio.baidu.com/community/app/387978/webUI?source=appCenter">Link</a></td>
|
|
|
+ <td>✅</td>
|
|
|
+ <td>🚧</td>
|
|
|
+ <td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
+ </tr>
|
|
|
+ <tr>
|
|
|
+ <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/vehicle_attribute.html">Vehicle Attribute Recognition</a></td>
|
|
|
+ <td><a href="https://aistudio.baidu.com/community/app/387979/webUI?source=appCenter">Link</a></td>
|
|
|
<td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
+ <td>✅</td>
|
|
|
+ <td>✅</td>
|
|
|
</tr>
|
|
|
<tr>
|
|
|
- <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/human_keypoint_detection.html">Human Keypoint Detection</a></td>
|
|
|
- <td>🚧</td>
|
|
|
+ <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/formula_recognition.html">Formula Recognition</a></td>
|
|
|
+ <td><a href="https://aistudio.baidu.com/community/app/387976/webUI?source=appCenter">Link</a></td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
- <td>🚧</td>
|
|
|
+ <td>✅</td>
|
|
|
</tr>
|
|
|
<tr>
|
|
|
- <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/open_vocabulary_detection.html">Open Vocabulary Detection</a></td>
|
|
|
- <td>🚧</td>
|
|
|
+ <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/seal_recognition.html">Seal Recognition</a></td>
|
|
|
+ <td><a href="https://aistudio.baidu.com/community/app/387977/webUI?source=appCenter">Link</a></td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
- <td>🚧</td>
|
|
|
- <td>🚧</td>
|
|
|
+ <td>✅</td>
|
|
|
+ <td>✅</td>
|
|
|
</tr>
|
|
|
<tr>
|
|
|
- <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/open_vocabulary_segmentation.html">Open Vocabulary Segmentation</a></td>
|
|
|
+ <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/image_anomaly_detection.html">Image Anomaly Detection</a></td>
|
|
|
<td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
- <td>🚧</td>
|
|
|
+ <td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
</tr>
|
|
|
<tr>
|
|
|
- <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/rotated_object_detection.html">Rotated Object Detection</a></td>
|
|
|
+ <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/human_keypoint_detection.html">Human Keypoint Detection</a></td>
|
|
|
<td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
@@ -260,27 +270,27 @@ In addition, PaddleX provides developers with a full-process efficient model tra
|
|
|
<td>🚧</td>
|
|
|
</tr>
|
|
|
<tr>
|
|
|
- <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/3d_bev_detection.html">3D Bev Detection</a></td>
|
|
|
+ <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/open_vocabulary_detection.html">Open Vocabulary Detection</a></td>
|
|
|
<td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
- <td>✅</td>
|
|
|
+ <td>🚧</td>
|
|
|
<td>🚧</td>
|
|
|
</tr>
|
|
|
<tr>
|
|
|
- <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/table_recognition_v2.html">Table Recognition v2</a></td>
|
|
|
+ <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/open_vocabulary_segmentation.html">Open Vocabulary Segmentation</a></td>
|
|
|
<td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
- <td>✅</td>
|
|
|
+ <td>🚧</td>
|
|
|
<td>🚧</td>
|
|
|
</tr>
|
|
|
<tr>
|
|
|
- <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/layout_parsing.html">Layout Parsing</a></td>
|
|
|
+ <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/rotated_object_detection.html">Rotated Object Detection</a></td>
|
|
|
<td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
@@ -290,74 +300,64 @@ In addition, PaddleX provides developers with a full-process efficient model tra
|
|
|
<td>🚧</td>
|
|
|
</tr>
|
|
|
<tr>
|
|
|
- <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/layout_parsing_v2.html">Layout Parsing v2</a></td>
|
|
|
+ <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/3d_bev_detection.html">3D Bev Detection</a></td>
|
|
|
<td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
- <td>🚧</td>
|
|
|
+ <td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
</tr>
|
|
|
<tr>
|
|
|
- <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/formula_recognition.html">Formula Recognition</a></td>
|
|
|
- <td><a href="https://aistudio.baidu.com/community/app/387976/webUI?source=appCenter">Link</a></td>
|
|
|
+ <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/table_recognition_v2.html">Table Recognition v2</a></td>
|
|
|
+ <td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
- <td>✅</td>
|
|
|
+ <td>🚧</td>
|
|
|
</tr>
|
|
|
<tr>
|
|
|
- <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/seal_recognition.html">Seal Recognition</a></td>
|
|
|
- <td><a href="https://aistudio.baidu.com/community/app/387977/webUI?source=appCenter">Link</a></td>
|
|
|
+ <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/layout_parsing.html">Layout Parsing</a></td>
|
|
|
+ <td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
- <td>✅</td>
|
|
|
+ <td>🚧</td>
|
|
|
</tr>
|
|
|
<tr>
|
|
|
- <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/doc_preprocessor.html">Document Image Preprocessing</a></td>
|
|
|
+ <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/layout_parsing_v2.html">Layout Parsing v2</a></td>
|
|
|
<td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
- <td>✅</td>
|
|
|
+ <td>🚧</td>
|
|
|
<td>🚧</td>
|
|
|
</tr>
|
|
|
<tr>
|
|
|
- <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/general_image_recognition.html">Image Recognition</a></td>
|
|
|
+ <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/doc_preprocessor.html">Document Image Preprocessing</a></td>
|
|
|
<td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
- <td>🚧</td>
|
|
|
+ <td>✅</td>
|
|
|
<td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
</tr>
|
|
|
<tr>
|
|
|
- <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/pedestrian_attribute.html">Pedestrian Attribute Recognition</a></td>
|
|
|
- <td><a href="https://aistudio.baidu.com/community/app/387978/webUI?source=appCenter">Link</a></td>
|
|
|
- <td>✅</td>
|
|
|
+ <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/general_image_recognition.html">Image Recognition</a></td>
|
|
|
<td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
- <td>✅</td>
|
|
|
- </tr>
|
|
|
- <tr>
|
|
|
- <td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/vehicle_attribute.html">Vehicle Attribute Recognition</a></td>
|
|
|
- <td><a href="https://aistudio.baidu.com/community/app/387979/webUI?source=appCenter">Link</a></td>
|
|
|
- <td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
<td>✅</td>
|
|
|
<td>🚧</td>
|
|
|
- <td>✅</td>
|
|
|
- <td>✅</td>
|
|
|
</tr>
|
|
|
<tr>
|
|
|
<td><a href="https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/face_recognition.html">Face Recognition</a></td>
|
|
|
@@ -533,21 +533,21 @@ In addition, PaddleX provides developers with a full-process efficient model tra
|
|
|
|
|
|
### 🛠️ Installation
|
|
|
|
|
|
-> ❗Before installing PaddleX, please ensure you have a basic **Python environment** (Note: Currently supports Python 3.8 to Python 3.10, with more Python versions being adapted). The PaddleX 3.0-beta2 version depends on PaddlePaddle version 3.0.0b2.
|
|
|
+> ❗Before installing PaddleX, please ensure you have a basic **Python runtime environment** (Note: Currently supports running under Python 3.8 to Python 3.10, with more Python versions under adaptation). The PaddlePaddle version required by PaddleX
|
|
|
|
|
|
* **Installing PaddlePaddle**
|
|
|
|
|
|
```bash
|
|
|
-# cpu
|
|
|
+# CPU
|
|
|
python -m pip install paddlepaddle==3.0.0rc0 -i https://www.paddlepaddle.org.cn/packages/stable/cpu/
|
|
|
|
|
|
-# gpu,该命令仅适用于 CUDA 版本为 11.8 的机器环境
|
|
|
+# gpu,requires GPU driver version ≥450.80.02 (Linux) or ≥452.39 (Windows)
|
|
|
python -m pip install paddlepaddle-gpu==3.0.0rc0 -i https://www.paddlepaddle.org.cn/packages/stable/cu118/
|
|
|
|
|
|
-# gpu,该命令仅适用于 CUDA 版本为 12.3 的机器环境
|
|
|
+# gpu,requires GPU driver version ≥545.23.06 (Linux) or ≥545.84 (Windows)
|
|
|
python -m pip install paddlepaddle-gpu==3.0.0rc0 -i https://www.paddlepaddle.org.cn/packages/stable/cu123/
|
|
|
```
|
|
|
-> ❗For more PaddlePaddle versions, please refer to the [PaddlePaddle official website](https://www.paddlepaddle.org.cn/install/quick?docurl=/documentation./docs/zh/install/pip/linux-pip.html).
|
|
|
+> ❗No need to focus on the CUDA version on the physical machine, only the GPU driver version needs attention. For more information on PaddlePaddle Wheel versions, please refer to the [PaddlePaddle Official Website](https://www.paddlepaddle.org.cn/install/quick?docurl=/documentation./docs/en/install/pip/linux-pip.html).
|
|
|
|
|
|
* **Installing PaddleX**
|
|
|
|
|
|
@@ -566,38 +566,32 @@ One command can quickly experience the pipeline effect, the unified CLI format i
|
|
|
paddlex --pipeline [Pipeline Name] --input [Input Image] --device [Running Device]
|
|
|
```
|
|
|
|
|
|
-You only need to specify three parameters:
|
|
|
-* `pipeline`: The name of the pipeline
|
|
|
-* `input`: The local path or URL of the input image to be processed
|
|
|
-* `device`: The GPU number used (for example, `gpu:0` means using the 0th GPU), you can also choose to use the CPU (`cpu`)
|
|
|
+Each Pipeline in PaddleX corresponds to specific parameters, which you can view in the respective Pipeline documentation for detailed explanations. Each Pipeline requires specifying three necessary parameters:
|
|
|
+* `pipeline`: The name of the Pipeline or the configuration file of the Pipeline
|
|
|
+* `input`: The local path, directory, or URL of the input file (e.g., an image) to be processed
|
|
|
+* `device`: The hardware device and its index to use (e.g., `gpu:0` indicates using the 0th GPU), or you can choose to use NPU (`npu:0`), XPU (`xpu:0`), CPU (`cpu`), etc.
|
|
|
|
|
|
For example, using the OCR pipeline:
|
|
|
```bash
|
|
|
-paddlex --pipeline OCR --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_ocr_002.png --device gpu:0
|
|
|
+paddlex --pipeline OCR \
|
|
|
+ --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_ocr_002.png \
|
|
|
+ --use_doc_orientation_classify False \
|
|
|
+ --use_doc_unwarping False \
|
|
|
+ --use_textline_orientation False \
|
|
|
+ --save_path ./output \
|
|
|
+ --device gpu:0
|
|
|
```
|
|
|
<details>
|
|
|
<summary><b>👉 Click to view the running result</b></summary>
|
|
|
|
|
|
```bash
|
|
|
-{
|
|
|
-'input_path': '/root/.paddlex/predict_input/general_ocr_002.png',
|
|
|
-'dt_polys': [array([[161, 27],
|
|
|
- [353, 22],
|
|
|
- [354, 69],
|
|
|
- [162, 74]], dtype=int16), array([[426, 26],
|
|
|
- [657, 21],
|
|
|
- [657, 58],
|
|
|
- [426, 62]], dtype=int16), array([[702, 18],
|
|
|
- [822, 13],
|
|
|
- [824, 57],
|
|
|
- [704, 62]], dtype=int16), array([[341, 106],
|
|
|
- [405, 106],
|
|
|
- [405, 128],
|
|
|
- [341, 128]], dtype=int16)
|
|
|
- ...],
|
|
|
-'dt_scores': [0.758478200014338, 0.7021546472698513, 0.8536622648391111, 0.8619181462164781, 0.8321051217096188, 0.8868756173427551, 0.7982964727675609, 0.8289939036796322, 0.8289428877522524, 0.8587063317632897, 0.7786755892491615, 0.8502032769081344, 0.8703346500042997, 0.834490931790065, 0.908291103353393, 0.7614978661708064, 0.8325774055997542, 0.7843421347676149, 0.8680889482955594, 0.8788859304537682, 0.8963341277518075, 0.9364654810069546, 0.8092413027028257, 0.8503743089091863, 0.7920740420391101, 0.7592224394793805, 0.7920547400069311, 0.6641757962457888, 0.8650289477605955, 0.8079483304467047, 0.8532207681055275, 0.8913377034754717],
|
|
|
-'rec_text': ['登机牌', 'BOARDING', 'PASS', '舱位', 'CLASS', '序号 SERIALNO.', '座位号', '日期 DATE', 'SEAT NO', '航班 FLIGHW', '035', 'MU2379', '始发地', 'FROM', '登机口', 'GATE', '登机时间BDT', '目的地TO', '福州', 'TAIYUAN', 'G11', 'FUZHOU', '身份识别IDNO', '姓名NAME', 'ZHANGQIWEI', 票号TKTNO', '张祺伟', '票价FARE', 'ETKT7813699238489/1', '登机口于起飞前10分钟关闭GATESCLOSE10MINUTESBEFOREDEPARTURETIME'],
|
|
|
-'rec_score': [0.9985831379890442, 0.999696917533874512, 0.9985735416412354, 0.9842517971992493, 0.9383274912834167, 0.9943678975105286, 0.9419361352920532, 0.9221674799919128, 0.9555020928382874, 0.9870321154594421, 0.9664073586463928, 0.9988052248954773, 0.9979352355003357, 0.9985110759735107, 0.9943482875823975, 0.9991195797920227, 0.9936401844024658, 0.9974591135978699, 0.9743705987930298, 0.9980487823486328, 0.9874696135520935, 0.9900962710380554, 0.9952947497367859, 0.9950481653213501, 0.989926815032959, 0.9915552139282227, 0.9938777685165405, 0.997239887714386, 0.9963340759277344, 0.9936134815216064, 0.97223961353302]}
|
|
|
+{'res': {'input_path': 'general_ocr_002.png', 'page_index': None, 'model_settings': {'use_doc_preprocessor': False, 'use_textline_orientation': False}, 'doc_preprocessor_res': {'input_path': None, 'model_settings': {'use_doc_orientation_classify': True, 'use_doc_unwarping': False}, 'angle': 0},'dt_polys': [array([[ 3, 10],
|
|
|
+ [82, 10],
|
|
|
+ [82, 33],
|
|
|
+ [ 3, 33]], dtype=int16), ...], 'text_det_params': {'limit_side_len': 960, 'limit_type': 'max', 'thresh': 0.3, 'box_thresh': 0.6, 'unclip_ratio': 2.0}, 'text_type': 'general', 'textline_orientation_angles': [-1, ...], 'text_rec_score_thresh': 0.0, 'rec_texts': ['www.99*', ...], 'rec_scores': [0.8980069160461426, ...], 'rec_polys': [array([[ 3, 10],
|
|
|
+ [82, 10],
|
|
|
+ [82, 33],
|
|
|
+ [ 3, 33]], dtype=int16), ...], 'rec_boxes': array([[ 3, 10, 82, 33], ...], dtype=int16)}}
|
|
|
```
|
|
|
|
|
|
The visualization result is as follows:
|
|
|
@@ -606,30 +600,41 @@ The visualization result is as follows:
|
|
|
|
|
|
</details>
|
|
|
|
|
|
-To use the command line for other pipelines, simply adjust the `pipeline` parameter to the name of the corresponding pipeline. Below are the commands for each pipeline:
|
|
|
+To use the command line for other pipelines, simply adjust the `pipeline` parameter to the name of the corresponding pipeline and modify the parameters accordingly. Below are the commands for each pipeline:
|
|
|
|
|
|
<details>
|
|
|
<summary><b>👉 More CLI usage for pipelines</b></summary>
|
|
|
|
|
|
| Pipeline Name | Command |
|
|
|
|------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|
|
|
-| Image Classification | `paddlex --pipeline image_classification --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_image_classification_001.jpg --device gpu:0` |
|
|
|
-| Object Detection | `paddlex --pipeline object_detection --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_object_detection_002.png --device gpu:0` |
|
|
|
-| Instance Segmentation| `paddlex --pipeline instance_segmentation --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_instance_segmentation_004.png --device gpu:0` |
|
|
|
-| Semantic Segmentation| `paddlex --pipeline semantic_segmentation --input https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/application/semantic_segmentation/makassaridn-road_demo.png --device gpu:0` |
|
|
|
-| Image Multi-label Classification | `paddlex --pipeline multi_label_image_classification --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_image_classification_001.jpg --device gpu:0` |
|
|
|
-| Small Object Detection | `paddlex --pipeline small_object_detection --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/small_object_detection.jpg --device gpu:0` |
|
|
|
-| Image Anomaly Detection | `paddlex --pipeline anomaly_detection --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/uad_grid.png --device gpu:0` |
|
|
|
-| Pedestrian Attribute Recognition | `paddlex --pipeline pedestrian_attribute --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/pedestrian_attribute_002.jpg --device gpu:0` |
|
|
|
-| Vehicle Attribute Recognition | `paddlex --pipeline vehicle_attribute --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/vehicle_attribute_002.jpg --device gpu:0` |
|
|
|
-| OCR | `paddlex --pipeline OCR --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_ocr_002.png --device gpu:0` |
|
|
|
-| Table Recognition | `paddlex --pipeline table_recognition --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/table_recognition.jpg --device gpu:0` |
|
|
|
-| Layout Parsing | `paddlex --pipeline layout_parsing --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/demo_paper.png --device gpu:0` |
|
|
|
-| Formula Recognition | `paddlex --pipeline formula_recognition --input https://paddle-model-ecology.bj.bcebos.com/paddlex/demo_image/general_formula_recognition.png --device gpu:0` |
|
|
|
-| Seal Recognition | `paddlex --pipeline seal_recognition --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/seal_text_det.png --device gpu:0` |
|
|
|
-| Time Series Forecasting | `paddlex --pipeline ts_fc --input https://paddle-model-ecology.bj.bcebos.com/paddlex/ts/demo_ts/ts_fc.csv --device gpu:0` |
|
|
|
-| Time Series Anomaly Detection | `paddlex --pipeline ts_ad --input https://paddle-model-ecology.bj.bcebos.com/paddlex/ts/demo_ts/ts_ad.csv --device gpu:0` |
|
|
|
-| Time Series Classification | `paddlex --pipeline ts_cls --input https://paddle-model-ecology.bj.bcebos.com/paddlex/ts/demo_ts/ts_cls.csv --device gpu:0` |
|
|
|
+| General Image Classification | `paddlex --pipeline image_classification --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_image_classification_001.jpg --device gpu:0 --save_path ./output/` |
|
|
|
+| General Object Detection | `paddlex --pipeline object_detection --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_object_detection_002.png --threshold 0.5 --save_path ./output/ --device gpu:0` |
|
|
|
+| General Instance Segmentation | `paddlex --pipeline instance_segmentation --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_instance_segmentation_004.png --threshold 0.5 --save_path ./output --device gpu:0` |
|
|
|
+| General Semantic Segmentation | `paddlex --pipeline semantic_segmentation --input https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/application/semantic_segmentation/makassaridn-road_demo.png --target_size -1 --save_path ./output --device gpu:0` |
|
|
|
+| Image Multi-label Classification | `paddlex --pipeline image_multilabel_classification --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_image_classification_001.jpg --save_path ./output --device gpu:0` |
|
|
|
+| Small Object Detection | `paddlex --pipeline small_object_detection --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/small_object_detection.jpg --threshold 0.5 --save_path ./output --device gpu:0` |
|
|
|
+| Image Anomaly Detection | `paddlex --pipeline anomaly_detection --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/uad_grid.png --save_path ./output --device gpu:0` |
|
|
|
+| Pedestrian Attribute Recognition | `paddlex --pipeline pedestrian_attribute_recognition --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/pedestrian_attribute_002.jpg --save_path ./output/ --device gpu:0` |
|
|
|
+| Vehicle Attribute Recognition | `paddlex --pipeline vehicle_attribute_recognition --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/vehicle_attribute_002.jpg --save_path ./output/ --device gpu:0` |
|
|
|
+| 3D Multi-modal Fusion Detection | `paddlex --pipeline 3d_bev_detection --input https://paddle-model-ecology.bj.bcebos.com/paddlex/det_3d/demo_det_3d/nuscenes_demo_infer.tar --device gpu:0 --save_path ./output/` |
|
|
|
+| Human Keypoint Detection | `paddlex --pipeline human_keypoint_detection --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/keypoint_detection_001.jpg --det_threshold 0.5 --save_path ./output/ --device gpu:0` |
|
|
|
+| Open Vocabulary Detection | `paddlex --pipeline open_vocabulary_detection --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/open_vocabulary_detection.jpg --prompt "bus . walking man . rearview mirror ." --thresholds "{'text_threshold': 0.25, 'box_threshold': 0.3}" --save_path ./output --device gpu:0` |
|
|
|
+| Open Vocabulary Segmentation | `paddlex --pipeline open_vocabulary_segmentation --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/open_vocabulary_segmentation.jpg --prompt_type box --prompt "[[112.9,118.4,513.8,382.1],[4.6,263.6,92.2,336.6],[592.4,260.9,607.2,294.2]]" --save_path ./output --device gpu:0` |
|
|
|
+| Rotated Object Detection | `paddlex --pipeline rotated_object_detection --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/rotated_object_detection_001.png --threshold 0.5 --save_path ./output --device gpu:0` |
|
|
|
+| General OCR | `paddlex --pipeline OCR --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_ocr_002.png --use_doc_orientation_classify False --use_doc_unwarping False --use_textline_orientation False --save_path ./output --device gpu:0` |
|
|
|
+| Document Image Preprocessor | `paddlex --pipeline doc_preprocessor --input https://paddle-model-ecology.bj.bcebos.com/paddlex/demo_image/doc_test_rotated.jpg --use_doc_orientation_classify True --use_doc_unwarping True --save_path ./output --device gpu:0` |
|
|
|
+| General Table Recognition | `paddlex --pipeline table_recognition --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/table_recognition.jpg --save_path ./output --device gpu:0` |
|
|
|
+| General Table Recognition v2 | `paddlex --pipeline table_recognition_v2 --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/table_recognition.jpg --save_path ./output --device gpu:0` |
|
|
|
+| General Layout Parsing | `paddlex --pipeline layout_parsing --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/demo_paper.png --use_doc_orientation_classify False --use_doc_unwarping False --use_textline_orientation False --save_path ./output --device gpu:0` |
|
|
|
+| General Layout Parsing v2 | `paddlex --pipeline layout_parsing_v2 --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/layout_parsing_v2_demo.png --use_doc_orientation_classify False --use_doc_unwarping False --use_textline_orientation False --save_path ./output --device gpu:0` |
|
|
|
+| Formula Recognition | `paddlex --pipeline formula_recognition --input https://paddle-model-ecology.bj.bcebos.com/paddlex/demo_image/general_formula_recognition.png --use_layout_detection True --use_doc_orientation_classify False --use_doc_unwarping False --layout_threshold 0.5 --layout_nms True --layout_unclip_ratio 1.0 --layout_merge_bboxes_mode large --save_path ./output --device gpu:0` |
|
|
|
+| Seal Text Recognition | `paddlex --pipeline seal_recognition --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/seal_text_det.png --use_doc_orientation_classify False --use_doc_unwarping False --device gpu:0 --save_path ./output` |
|
|
|
+| Time Series Forecasting | `paddlex --pipeline ts_forecast --input https://paddle-model-ecology.bj.bcebos.com/paddlex/ts/demo_ts/ts_fc.csv --device gpu:0 --save_path ./output` |
|
|
|
+| Time Series Anomaly Detection | `paddlex --pipeline ts_anomaly_detection --input https://paddle-model-ecology.bj.bcebos.com/paddlex/ts/demo_ts/ts_ad.csv --device gpu:0 --save_path ./output` |
|
|
|
+| Time Series Classification | `paddlex --pipeline ts_classification --input https://paddle-model-ecology.bj.bcebos.com/paddlex/ts/demo_ts/ts_cls.csv --device gpu:0 --save_path ./output` |
|
|
|
+| Multilingual Speech Recognition | `paddlex --pipeline multilingual_speech_recognition --input https://paddlespeech.bj.bcebos.com/PaddleAudio/zh.wav --save_path ./output --device gpu:0` |
|
|
|
+| General Video Classification | `paddlex --pipeline video_classification --input https://paddle-model-ecology.bj.bcebos.com/paddlex/videos/demo_video/general_video_classification_001.mp4 --topk 5 --save_path ./output --device gpu:0` |
|
|
|
+| General Video Detection | `paddlex --pipeline video_detection --input https://paddle-model-ecology.bj.bcebos.com/paddlex/videos/demo_video/HorseRiding.avi --device gpu:0 --save_path ./output` |
|
|
|
|
|
|
</details>
|
|
|
|
|
|
@@ -649,10 +654,10 @@ for res in output:
|
|
|
The following steps are executed:
|
|
|
|
|
|
* `create_pipeline()` instantiates the pipeline object
|
|
|
-* Passes the image and calls the `predict` method of the pipeline object for inference prediction
|
|
|
+* Passes the image and calls the `predict()` method of the pipeline object for inference prediction
|
|
|
* Processes the prediction results
|
|
|
|
|
|
-For other pipelines in Python scripts, just adjust the `pipeline` parameter of the `create_pipeline()` method to the corresponding name of the pipeline. Below is a list of each pipeline's corresponding parameter name and detailed usage explanation:
|
|
|
+To use the Python script for other pipelines, simply adjust the `pipeline` parameter in the `create_pipeline()` method to the name of the corresponding pipeline and modify the parameters accordingly. Below are the parameter names and detailed usage explanations for each pipeline:
|
|
|
|
|
|
<details>
|
|
|
<summary>👉 More Python script usage for pipelines</summary>
|
|
|
@@ -671,14 +676,25 @@ For other pipelines in Python scripts, just adjust the `pipeline` parameter of t
|
|
|
| Face Recognition | `face_recognition` | [Face Recognition Pipeline Python Script Usage Instructions](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/face_recognition.html) |
|
|
|
| Pedestrian Attribute Recognition | `pedestrian_attribute` | [Pedestrian Attribute Recognition Pipeline Python Script Usage Instructions](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/pedestrian_attribute.html) |
|
|
|
|Vehicle Attribute Recognition | `vehicle_attribute` | [Vehicle Attribute Recognition Pipeline Python Script Usage Instructions](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/cv_pipelines/vehicle_attribute.html) |
|
|
|
-| OCR | `OCR` | [ OCR Pipeline Python Script Usage Instructions](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/OCR.html) |
|
|
|
-| Table Recognition | `table_recognition` | [Table Recognition Pipeline Python Script Usage Instructions](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/table_recognition.html) |
|
|
|
-| Layout Parsing | `layout_parsing` | [Layout Parsing Pipeline Python Script Usage Instructions](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/layout_parsing.html) |
|
|
|
-| Formula Recognition | `formula_recognition` | [Formula Recognition Pipeline Python Script Usage Instructions](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/formula_recognition.html) |
|
|
|
-| Seal Recognition | `seal_recognition` | [Seal Recognition Pipeline Python Script Usage Instructions](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/ocr_pipelines/seal_recognition.html) |
|
|
|
-| Time Series Forecast | `ts_forecast` | [ Time Series Forecast Pipeline Python Script Usage Instructions](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/time_series_pipelines/time_series_forecasting.html) |
|
|
|
-| Time Series Anomaly Detection | `ts_anomaly_detection` | [ Time Series Anomaly Detection Pipeline Python Script Usage Instructions](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/time_series_pipelines/time_series_anomaly_detection.html) |
|
|
|
-| Time Series Classification | `ts_cls` | [ Time Series Classification Pipeline Python Script Usage Instructions](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/time_series_pipelines/time_series_classification.html) |
|
|
|
+| 3D Multi-modal Fusion Detection | `3d_bev_detection` | [Instructions for Using the 3D Multi-modal Fusion Detection Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/cv_pipelines/3d_bev_detection.html#222-python-script-integration) |
|
|
|
+| Human Keypoint Detection | `human_keypoint_detection` | [Instructions for Using the Human Keypoint Detection Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/cv_pipelines/human_keypoint_detection.html#222-python-script-integration) |
|
|
|
+| Open Vocabulary Detection | `open_vocabulary_detection` | [Instructions for Using the Open Vocabulary Detection Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/cv_pipelines/open_vocabulary_detection.html#212-python-script-integration) |
|
|
|
+| Open Vocabulary Segmentation | `open_vocabulary_segmentation` | [Instructions for Using the Open Vocabulary Segmentation Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/cv_pipelines/open_vocabulary_segmentation.html#212-python-script-integration) |
|
|
|
+| Rotated Object Detection | `rotated_object_detection` | [Instructions for Using the Rotated Object Detection Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/cv_pipelines/rotated_object_detection.html#212-python-script-integration) |
|
|
|
+| OCR | `OCR` | [Instructions for Using the General OCR Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/ocr_pipelines/OCR.html#222-python-script-integration) |
|
|
|
+| Document Image Preprocessing | `doc_preprocessor` | [Instructions for Using the Document Image Preprocessing Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/ocr_pipelines/doc_preprocessor.html#212-python-script-integration) |
|
|
|
+| General Table Recognition | `table_recognition` | [Instructions for Using the General Table Recognition Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/ocr_pipelines/table_recognition.html#22-python-script-integration) |
|
|
|
+| General Table Recognition v2 | `table_recognition_v2` | [Instructions for Using the General Table Recognition v2 Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/ocr_pipelines/table_recognition_v2.html#22-python-script-integration) |
|
|
|
+| General Layout Parsing | `layout_parsing` | [Instructions for Using the General Layout Parsing Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/ocr_pipelines/layout_parsing.html#22-python-script-integration) |
|
|
|
+| General Layout Parsing v2 | `layout_parsing_v2` | [Instructions for Using the General Layout Parsing v2 Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/ocr_pipelines/layout_parsing_v2.html#22-python-script-integration) |
|
|
|
+| Formula Recognition | `formula_recognition` | [Instructions for Using the Formula Recognition Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/ocr_pipelines/formula_recognition.html#22-python-script-integration) |
|
|
|
+| Seal Text Recognition | `seal_recognition` | [Instructions for Using the Seal Text Recognition Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/ocr_pipelines/seal_recognition.html#22-python-script-integration) |
|
|
|
+| Time Series Forecasting | `ts_forecast` | [Instructions for Using the Time Series Forecasting Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/time_series_pipelines/time_series_forecasting.html#222-python-script-integration) |
|
|
|
+| Time Series Anomaly Detection | `ts_anomaly_detection` | [Instructions for Using the Time Series Anomaly Detection Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/time_series_pipelines/time_series_anomaly_detection.html#222-python-script-integration) |
|
|
|
+| Time Series Classification | `ts_classification` | [Instructions for Using the Time Series Classification Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/time_series_pipelines/time_series_classification.html#222-python-script-integration) |
|
|
|
+| Multilingual Speech Recognition | `multilingual_speech_recognition` | [Instructions for Using the Multilingual Speech Recognition Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/time_series_pipelines/multilingual_speech_recognition.html#212-python-script-integration) |
|
|
|
+| General Video Classification | `video_classification` | [Instructions for Using the General Video Classification Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/time_series_pipelines/video_classification.html#22-python-script-integration) |
|
|
|
+| General Video Detection | `video_detection` | [Instructions for Using the General Video Detection Pipeline Python Script](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/tutorials/time_series_pipelines/video_detection.html#212-python-script-integration) |
|
|
|
</details>
|
|
|
|
|
|
## 📖 Documentation
|
|
|
@@ -744,12 +760,12 @@ For other pipelines in Python scripts, just adjust the `pipeline` parameter of t
|
|
|
</details>
|
|
|
|
|
|
* <details open>
|
|
|
- <summary> <b> 🎤 Speech Analysis</b> </summary>
|
|
|
+ <summary> <b> 🎤 Speech Recognition</b> </summary>
|
|
|
|
|
|
* [🌐 Multilingual Speech Recognition Pipeline Usage Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/speech_pipelines/multilingual_speech_recognition.html)
|
|
|
|
|
|
* <details open>
|
|
|
- <summary> <b> 🎥 Video Processing</b> </summary>
|
|
|
+ <summary> <b> 🎥 Video Recognition</b> </summary>
|
|
|
|
|
|
* [📈 General Video Classification Pipeline Usage Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/video_pipelines/video_classification.html)
|
|
|
* [🔍 General Video Detection Pipeline Usage Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/tutorials/video_pipelines/video_detection.html)
|
|
|
@@ -809,18 +825,36 @@ For other pipelines in Python scripts, just adjust the `pipeline` parameter of t
|
|
|
* [🔍 Mainbody Detection Module Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/cv_modules/mainbody_detection.html)
|
|
|
* [🚶 Pedestrian Detection Module Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/cv_modules/human_detection.html)
|
|
|
* [🚗 Vehicle Detection Module Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/cv_modules/vehicle_detection.html)
|
|
|
- * [🚶♂️ Human Keypoint Detection Module Usage Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/cv_modules/human_keypoint_detection.html)
|
|
|
- * [🌐 Open-Vocabulary Object Detection Module Usage Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/cv_modules/open_vocabulary_detection.html)
|
|
|
* [🔄 Rotated Object Detection Module Usage Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/cv_modules/rotated_object_detection.html)
|
|
|
|
|
|
</details>
|
|
|
|
|
|
* <details open>
|
|
|
+ <summary> <b> 🌐 Open-Vocabulary Object Detection </b></summary>
|
|
|
+
|
|
|
+ * [🌐 Open-Vocabulary Object Detection Module Usage Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/cv_modules/open_vocabulary_detection.html)
|
|
|
+</details>
|
|
|
+
|
|
|
+* <details open>
|
|
|
+ <summary> <b> 🎯 Keypoint Detection </b></summary>
|
|
|
+
|
|
|
+ * [🚶♂️ Human Keypoint Detection Module Usage Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/cv_modules/human_keypoint_detection.html)
|
|
|
+ </details>
|
|
|
+
|
|
|
+
|
|
|
+
|
|
|
+
|
|
|
+* <details open>
|
|
|
<summary> <b> 🖼️ Image Segmentation </b></summary>
|
|
|
|
|
|
* [🗺️ Semantic Segmentation Module Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/cv_modules/semantic_segmentation.html)
|
|
|
* [🔍 Instance Segmentation Module Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/cv_modules/instance_segmentation.html)
|
|
|
* [🚨 Image Anomaly Detection Module Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/cv_modules/anomaly_detection.html)
|
|
|
+ </details>
|
|
|
+
|
|
|
+* <details open>
|
|
|
+ <summary> <b> 🌐 Open-Vocabulary Segmentation </b></summary>
|
|
|
+
|
|
|
* [🌐 Open-Vocabulary Segmentation Module Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/cv_modules/open_vocabulary_segmentation.html)
|
|
|
</details>
|
|
|
|
|
|
@@ -833,17 +867,17 @@ For other pipelines in Python scripts, just adjust the `pipeline` parameter of t
|
|
|
</details>
|
|
|
|
|
|
* <details open>
|
|
|
- <summary> <b> 🌐 3D </b></summary>
|
|
|
+ <summary> <b> 📦 3D </b></summary>
|
|
|
|
|
|
- * [🚗 3D Multimodal Fusion Detection Module Usage Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/cv_modules/3d_bev_detection.html)
|
|
|
+ * [📦 3D Multimodal Fusion Detection Module Usage Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/cv_modules/3d_bev_detection.html)
|
|
|
|
|
|
* <details open>
|
|
|
- <summary> <b> 🎤 Speech </b></summary>
|
|
|
+ <summary> <b> 🎤 Speech Recognition </b></summary>
|
|
|
|
|
|
* [🌐 Multilingual Speech Recognition Module Usage Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/speech_modules/multilingual_speech_recognition.html)
|
|
|
|
|
|
* <details open>
|
|
|
- <summary> <b> 🎥 Video </b></summary>
|
|
|
+ <summary> <b> 🎥 Video Recognition </b></summary>
|
|
|
|
|
|
* [📈 Video Classification Module Usage Tutorial](https://paddlepaddle.github.io/PaddleX/latest/module_usage/tutorials/video_modules/video_classification.html)
|
|
|
* [🔍 Video Detection Module Usage Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/video_modules/video_detection.html)
|