vor 10 Monaten · 75bd49fca0
--- a/.github/ISSUE_TEMPLATE/3_deploy.md
+++ b/.github/ISSUE_TEMPLATE/3_deploy.md
@@ -26,7 +26,7 @@ assignees: ''
 
				 
			
 
				 2. 服务化部署
			
 
				 
			
 
				-    * 您是否完全按照[服务化部署文档教程](https://paddlepaddle.github.io/PaddleX/main/pipeline_deploy/service_deploy.html)跑通了流程？
			
 
				+    * 您是否完全按照[服务化部署文档教程](https://paddlepaddle.github.io/PaddleX/main/pipeline_deploy/serving.html)跑通了流程？
			
 
				 
			
 
				     * 您在服务化部署中是否有使用高性能推理插件，如果是，您使用的是离线激活方式还是在线激活方式？
			
 
				 
			
--- a/README.md
+++ b/README.md
@@ -35,7 +35,7 @@ PaddleX 3.0 是基于飞桨框架构建的低代码开发工具，它集成了
 
				 
			
 
				   🚀 **提高效率降低门槛**：实现基于统一命令和图形界面的模型**全流程开发**，打造大小模型结合、大模型半监督学习和多模型融合的[**8 条特色模型产线**](https://aistudio.baidu.com/intro/paddlex)，大幅度降低迭代模型的成本。
			
 
				 
			
 
				-  🌐 **多种场景灵活部署**：支持**高性能部署**、**服务化部署**和**端侧部署**等多种部署方式，确保不同应用场景下模型的高效运行和快速响应。
			
 
				+  🌐 **多种场景灵活部署**：支持**高性能推理**、**服务化部署**和**端侧部署**等多种部署方式，确保不同应用场景下模型的高效运行和快速响应。
			
 
				 
			
 
				   🔧 **主流硬件高效支持**：支持英伟达 GPU、昆仑芯、昇腾和寒武纪等**多种主流硬件**的无缝切换，确保高效运行。
			
 
				 
			
@@ -59,7 +59,7 @@ PaddleX 3.0 是基于飞桨框架构建的低代码开发工具，它集成了
 
				  ## 📊 能力支持
			
 
				 
			
 
				 
			
 
				-PaddleX的各个产线均支持本地**快速推理**，部分模型支持在[AI Studio星河社区](https://aistudio.baidu.com/overview)上进行**在线体验**，您可以快速体验各个产线的预训练模型效果，如果您对产线的预训练模型效果满意，可以直接对产线进行[高性能推理](https://paddlepaddle.github.io/PaddleX/latest/pipeline_deploy/high_performance_inference.html)/[服务化部署](https://paddlepaddle.github.io/PaddleX/latest/pipeline_deploy/service_deploy.html)/[端侧部署](https://paddlepaddle.github.io/PaddleX/latest/pipeline_deploy/edge_deploy.html)，如果不满意，您也可以使用产线的**二次开发**能力，提升效果。完整的产线开发流程请参考[PaddleX产线使用概览](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/pipeline_develop_guide.html)或各产线使用[教程](#-文档)。
			
 
				+PaddleX的各个产线均支持本地**快速推理**，部分模型支持在[AI Studio星河社区](https://aistudio.baidu.com/overview)上进行**在线体验**，您可以快速体验各个产线的预训练模型效果，如果您对产线的预训练模型效果满意，可以直接对产线进行[高性能推理](https://paddlepaddle.github.io/PaddleX/latest/pipeline_deploy/high_performance_inference.html)/[服务化部署](https://paddlepaddle.github.io/PaddleX/latest/pipeline_deploy/serving.html)/[端侧部署](https://paddlepaddle.github.io/PaddleX/latest/pipeline_deploy/edge_deploy.html)，如果不满意，您也可以使用产线的**二次开发**能力，提升效果。完整的产线开发流程请参考[PaddleX产线使用概览](https://paddlepaddle.github.io/PaddleX/latest/pipeline_usage/pipeline_develop_guide.html)或各产线使用[教程](#-文档)。
			
 
				 
			
 
				 
			
 
				 此外，PaddleX在[AI Studio星河社区](https://aistudio.baidu.com/overview)为开发者提供了基于[云端图形化开发界面](https://aistudio.baidu.com/pipeline/mine)的全流程开发工具, 点击【创建产线】，选择对应的任务场景和模型产线，就可以开启全流程开发。详细请参考[教程《零门槛开发产业级AI模型》](https://aistudio.baidu.com/practical/introduce/546656605663301)
			
@@ -652,7 +652,7 @@ for res in output:
 
				   <summary> <b> 🏗️ 模型产线部署 </b></summary>
			
 
				 
			
 
				   * [🚀 PaddleX 高性能推理指南](https://paddlepaddle.github.io/PaddleX/latest/pipeline_deploy/high_performance_inference.html)
			
 
				-  * [🖥️ PaddleX 服务化部署指南](https://paddlepaddle.github.io/PaddleX/latest/pipeline_deploy/service_deploy.html)
			
 
				+  * [🖥️ PaddleX 服务化部署指南](https://paddlepaddle.github.io/PaddleX/latest/pipeline_deploy/serving.html)
			
 
				   * [📱 PaddleX 端侧部署指南](https://paddlepaddle.github.io/PaddleX/latest/pipeline_deploy/edge_deploy.html)
			
 
				 
			
 
				 </details>
			
--- a/README_en.md
+++ b/README_en.md
@@ -38,7 +38,7 @@ PaddleX 3.0 is a low-code development tool for AI models built on the PaddlePadd
 
				 
			
 
				   🚀 **High Efficiency and Low barrier of entry**: Achieve model **full-process development** based on graphical interfaces and unified commands, creating **8 featured model pipelines** that combine large and small models, semi-supervised learning of large models, and multi-model fusion, greatly reducing the cost of iterating models.
			
 
				 
			
 
				-  🌐 **Flexible Deployment in Various Scenarios**: Support various deployment methods such as **high-performance inference**, **service deployment**, and **lite deployment** to ensure efficient operation and rapid response of models in different application scenarios.
			
 
				+  🌐 **Flexible Deployment in Various Scenarios**: Support various deployment methods such as **high-performance inference**, **serving**, and **lite deployment** to ensure efficient operation and rapid response of models in different application scenarios.
			
 
				 
			
 
				   🔧 **Efficient Support for Mainstream Hardware**: Support seamless switching of various mainstream hardware such as NVIDIA GPUs, Kunlun XPU, Ascend NPU, and Cambricon MLU to ensure efficient operation.
			
 
				 
			
@@ -50,7 +50,7 @@ PaddleX 3.0 is a low-code development tool for AI models built on the PaddlePadd
 
				 
			
 
				 🔥🔥 **11.15, 2024**, PaddleX 3.0 Beta2 open source version is officially released, PaddleX 3.0 Beta2 is fully compatible with the PaddlePaddle 3.0b2 version. <b>This update introduces new pipelines for general image recognition, face recognition, vehicle attribute recognition, and pedestrian attribute recognition. We have also developed 42 new models to fully support the Ascend 910B, with extensive documentation available on [GitHub Pages](https://paddlepaddle.github.io/PaddleX/latest/en/index.html).</b>
			
 
				 
			
 
				-🔥🔥 **9.30, 2024**, PaddleX 3.0 Beta1 open source version is officially released, providing **more than 200 models** that can be called with a simple Python API; achieve model full-process development based on unified commands, and open source the basic capabilities of the **PP-ChatOCRv3** pipeline; support **more than 100 models for high-performance inference and service-oriented deployment** (iterating continuously), **more than 7 key visual models for edge-deployment**; **more than 70 models have been adapted for the full development process of Ascend 910B**, **more than 15 models have been adapted for the full development process of Kunlun chips and Cambricon**
			
 
				+🔥🔥 **9.30, 2024**, PaddleX 3.0 Beta1 open source version is officially released, providing **more than 200 models** that can be called with a simple Python API; achieve model full-process development based on unified commands, and open source the basic capabilities of the **PP-ChatOCRv3** pipeline; support **more than 100 models for high-performance inference and serving** (iterating continuously), **more than 7 key visual models for edge-deployment**; **more than 70 models have been adapted for the full development process of Ascend 910B**, **more than 15 models have been adapted for the full development process of Kunlun chips and Cambricon**
			
 
				 
			
 
				 🔥 **6.27, 2024**, PaddleX 3.0 Beta open source version is officially released, supporting the use of various mainstream hardware for pipeline and model development in a low-code manner on the local side.
			
 
				 
			
@@ -62,7 +62,7 @@ PaddleX is dedicated to achieving pipeline-level model training, inference, and
 
				 ## 📊 What can PaddleX do？
			
 
				 
			
 
				 
			
 
				-All pipelines of PaddleX support **online experience** on [AI Studio]((https://aistudio.baidu.com/overview)) and local **fast inference**. You can quickly experience the effects of each pre-trained pipeline. If you are satisfied with the effects of the pre-trained pipeline, you can directly perform [high-performance inference](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_deploy/high_performance_inference.html) / [serving deployment](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_deploy/service_deploy.html) / [edge deployment](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_deploy/edge_deploy.html) on the pipeline. If not satisfied, you can also **Custom Development** to improve the pipeline effect. For the complete pipeline development process, please refer to the [PaddleX pipeline Development Tool Local Use Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/pipeline_develop_guide.html).
			
 
				+All pipelines of PaddleX support **online experience** on [AI Studio]((https://aistudio.baidu.com/overview)) and local **fast inference**. You can quickly experience the effects of each pre-trained pipeline. If you are satisfied with the effects of the pre-trained pipeline, you can directly perform [high-performance inference](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_deploy/high_performance_inference.html) / [serving](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_deploy/serving.html) / [edge deployment](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_deploy/edge_deploy.html) on the pipeline. If not satisfied, you can also **Custom Development** to improve the pipeline effect. For the complete pipeline development process, please refer to the [PaddleX pipeline Development Tool Local Use Tutorial](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_usage/pipeline_develop_guide.html).
			
 
				 
			
 
				 In addition, PaddleX provides developers with a full-process efficient model training and deployment tool based on a [cloud-based GUI](https://aistudio.baidu.com/pipeline/mine). Developers **do not need code development**, just need to prepare a dataset that meets the pipeline requirements to **quickly start model training**. For details, please refer to the tutorial ["Developing Industrial-level AI Models with Zero Barrier"](https://aistudio.baidu.com/practical/introduce/546656605663301).
			
 
				 
			
@@ -72,7 +72,7 @@ In addition, PaddleX provides developers with a full-process efficient model tra
 
				         <th>Online Experience</th>
			
 
				         <th>Local Inference</th>
			
 
				         <th>High-Performance Inference</th>
			
 
				-        <th>Service-Oriented Deployment</th>
			
 
				+        <th>Serving</th>
			
 
				         <th>Edge Deployment</th>
			
 
				         <th>Custom Development</th>
			
 
				         <th><a href="https://aistudio.baidu.com/pipeline/mine">Zero-Code Development On AI Studio</a></td>
			
@@ -279,7 +279,7 @@ In addition, PaddleX provides developers with a full-process efficient model tra
 
				     </tr>
			
 
				 </table>
			
 
				 
			
 
				-> ❗Note: The above capabilities are implemented based on GPU/CPU. PaddleX can also perform local inference and custom development on mainstream hardware such as Kunlunxin, Ascend, Cambricon, and Haiguang. The table below details the support status of the pipelines. For specific supported model lists, please refer to the [Model List (Kunlunxin XPU)](https://paddlepaddle.github.io/PaddleX/latest/en/support_list/model_list_xpu.html)/[Model List (Ascend NPU)](https://paddlepaddle.github.io/PaddleX/latest/en/support_list/model_list_npu.html)/[Model List (Cambricon MLU)](https://paddlepaddle.github.io/PaddleX/latest/en/support_list/model_list_mlu.html)/[Model List (Haiguang DCU)](https://paddlepaddle.github.io/PaddleX/latest/en/support_list/model_list_dcu.html). We are continuously adapting more models and promoting the implementation of high-performance and service-oriented deployment on mainstream hardware.
			
 
				+> ❗Note: The above capabilities are implemented based on GPU/CPU. PaddleX can also perform local inference and custom development on mainstream hardware such as Kunlunxin, Ascend, Cambricon, and Haiguang. The table below details the support status of the pipelines. For specific supported model lists, please refer to the [Model List (Kunlunxin XPU)](https://paddlepaddle.github.io/PaddleX/latest/en/support_list/model_list_xpu.html)/[Model List (Ascend NPU)](https://paddlepaddle.github.io/PaddleX/latest/en/support_list/model_list_npu.html)/[Model List (Cambricon MLU)](https://paddlepaddle.github.io/PaddleX/latest/en/support_list/model_list_mlu.html)/[Model List (Haiguang DCU)](https://paddlepaddle.github.io/PaddleX/latest/en/support_list/model_list_dcu.html). We are continuously adapting more models and promoting the implementation of high-performance and serving on mainstream hardware.
			
 
				 
			
 
				 🔥🔥 **Support for Domestic Hardware Capabilities**
			
 
				 
			
@@ -648,7 +648,7 @@ For other pipelines in Python scripts, just adjust the `pipeline` parameter of t
 
				   <summary> <b> 🏗️ Pipeline Deployment </b></summary>
			
 
				 
			
 
				   * [🚀 PaddleX High-Performance Inference Guide](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_deploy/high_performance_inference.html)
			
 
				-  * [🖥️ PaddleX Service Deployment Guide](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_deploy/service_deploy.html)
			
 
				+  * [🖥️ PaddleX Serving Guide](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_deploy/serving.html)
			
 
				   * [📱 PaddleX Edge Deployment Guide](https://paddlepaddle.github.io/PaddleX/latest/en/pipeline_deploy/edge_deploy.html)
			
 
				 
			
 
				 </details>
			
--- a/docs/pipeline_deploy/high_performance_inference.en.md
+++ b/docs/pipeline_deploy/high_performance_inference.en.md
@@ -87,7 +87,7 @@ paddlex \
 
				     --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_image_classification_001.jpg \
			
 
				     --device gpu:0 \
			
 
				     --use_hpip \
			
 
				-    --serial_number {serial_number}
			
 
				+    --serial_number {serial number}
			
 
				 
			
 
				 # If you wish to perform online activation
			
 
				 paddlex \
			
@@ -95,7 +95,7 @@ paddlex \
 
				     --input https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_image_classification_001.jpg \
			
 
				     --device gpu:0 \
			
 
				     --use_hpip \
			
 
				-    --serial_number {serial_number} \
			
 
				+    --serial_number {serial number} \
			
 
				     --update_license
			
 
				 ```
			
 
				 
			
@@ -107,7 +107,7 @@ from paddlex import create_pipeline
 
				 pipeline = create_pipeline(
			
 
				     pipeline="image_classification",
			
 
				     use_hpip=True,
			
 
				-    hpi_params={"serial_number": "{serial_number}"},
			
 
				+    hpi_params={"serial_number": "{serial number}"},
			
 
				 )
			
 
				 
			
 
				 output = pipeline.predict("https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_image_classification_001.jpg")
			
--- a/docs/pipeline_deploy/high_performance_inference.md
+++ b/docs/pipeline_deploy/high_performance_inference.md
@@ -6,7 +6,7 @@ comments: true
 
				 
			
 
				 在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速。本文档将首先介绍高性能推理插件的安装和使用方式，然后列举目前支持使用高性能推理插件的产线与模型。
			
 
				 
			
 
				-## 1.高性能推理插件的安装与使用
			
 
				+## 1. 高性能推理插件的安装与使用
			
 
				 
			
 
				 使用高性能推理插件前，请确保您已经按照[PaddleX本地安装教程](../installation/installation.md) 完成了PaddleX的安装，且按照PaddleX产线命令行使用说明或PaddleX产线Python脚本使用说明跑通了产线的快速推理。
			
 
				 
			
@@ -172,7 +172,7 @@ PaddleX 结合模型信息与运行环境信息为每个模型提供默认的高
 
				 
			
 
				     在完成修改后，请删除模型目录中的缓存文件（`shape_range_info.pbtxt` 与 `trt_serialized` 开头的文件）。
			
 
				 
			
 
				-## 2、支持使用高性能推理插件的产线与模型
			
 
				+## 2. 支持使用高性能推理插件的产线与模型
			
 
				 
			
 
				 <table>
			
 
				   <tr>
			
--- a/docs/pipeline_deploy/service_deploy.en.md
+++ b/docs/pipeline_deploy/service_deploy.en.md
@@ -1,118 +0,0 @@
 
				----
			
 
				-comments: true
			
 
				----
			
 
				-
			
 
				-# PaddleX Serving Deployment Guide
			
 
				-
			
 
				-Serving deployment is a common form of deployment in real-world production environments. By encapsulating inference capabilities as services, clients can access these services through network requests to obtain inference results. PaddleX enables users to achieve low-cost serving deployment for production lines. This document will first introduce the basic process of serving deployment using PaddleX, followed by considerations and potential operations when using the service in a production environment.
			
 
				-
			
 
				-<b>Note</b>
			
 
				-- <b>Serving deployment provides services for model pipelines, not specific to individual pipeline modules.</b>
			
 
				-
			
 
				-Serving Deployment Example Diagram:
			
 
				-
			
 
				-<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/pipeline_deploy/serving_en.png"  width="300" />
			
 
				-
			
 
				-## 1. Basic Process
			
 
				-
			
 
				-### 1.1 Install the Serving Deployment Plugin
			
 
				-
			
 
				-Execute the following command to install the serving deployment plugin:
			
 
				-
			
 
				-```shell
			
 
				-paddlex --install serving
			
 
				-```
			
 
				-
			
 
				-### 1.2 Start the Service
			
 
				-
			
 
				-Start the service through the PaddleX CLI with the following command format:
			
 
				-
			
 
				-```shell
			
 
				-paddlex --serve --pipeline {pipeline_name_or_path} [{other_command_line_options}]
			
 
				-```
			
 
				-
			
 
				-Taking the General Image Classification Pipeline as an example:
			
 
				-
			
 
				-```shell
			
 
				-paddlex --serve --pipeline image_classification
			
 
				-```
			
 
				-
			
 
				-After the service starts successfully, you will see information similar to the following:
			
 
				-
			
 
				-```
			
 
				-INFO:     Started server process [63108]
			
 
				-INFO:     Waiting for application startup.
			
 
				-INFO:     Application startup complete.
			
 
				-INFO:     Uvicorn running on http://0.0.0.0:8080 (Press CTRL+C to quit)
			
 
				-```
			
 
				-
			
 
				-`--pipeline` can be specified as an official pipeline name or the path to a local pipeline configuration file. PaddleX uses this to build the pipeline and deploy it as a service. To adjust configurations (such as model path, batch_size, deployment device), please refer to the <b>"Model Application"</b> section in the [General Image Classification Pipeline Tutorial](../pipeline_usage/tutorials/cv_pipelines/image_classification.en.md).
			
 
				-
			
 
				-Command-line options related to serving deployment are as follows:
			
 
				-
			
 
				-<table>
			
 
				-<thead>
			
 
				-<tr>
			
 
				-<th>Name</th>
			
 
				-<th>Description</th>
			
 
				-</tr>
			
 
				-</thead>
			
 
				-<tbody>
			
 
				-<tr>
			
 
				-<td><code>--pipeline</code></td>
			
 
				-<td>Pipeline name or pipeline configuration file path.</td>
			
 
				-</tr>
			
 
				-<tr>
			
 
				-<td><code>--device</code></td>
			
 
				-<td>Deployment device for the pipeline. Defaults to <code>cpu</code> (If GPU is unavailable) or <code>gpu</code> (If GPU is available).</td>
			
 
				-</tr>
			
 
				-<tr>
			
 
				-<td><code>--host</code></td>
			
 
				-<td>Hostname or IP address bound to the server. Defaults to 0.0.0.0.</td>
			
 
				-</tr>
			
 
				-<tr>
			
 
				-<td><code>--port</code></td>
			
 
				-<td>Port number listened to by the server. Defaults to 8080.</td>
			
 
				-</tr>
			
 
				-<tr>
			
 
				-<td><code>--use_hpip</code></td>
			
 
				-<td>Enables the high-performance inference plugin if specified.</td>
			
 
				-</tr>
			
 
				-<tr>
			
 
				-<td><code>--serial_number</code></td>
			
 
				-<td>Serial number used by the high-performance inference plugin. Only valid when the high-performance inference plugin is enabled. Note that not all pipelines and models support the use of the high-performance inference plugin. For detailed support, please refer to the <a href="./high_performance_inference.en.md">PaddleX High-Performance Inference Guide</a>.</td>
			
 
				-</tr>
			
 
				-<tr>
			
 
				-<td><code>--update_license</code></td>
			
 
				-<td>Activates the license online if specified. Only valid when the high-performance inference plugin is enabled.</td>
			
 
				-</tr>
			
 
				-</tbody>
			
 
				-</table>
			
 
				-</table>
			
 
				-
			
 
				-### 1.3 Call the Service
			
 
				-
			
 
				-Please refer to the <b>"Development Integration/Deployment"</b> section in the usage tutorials for each pipeline. You can find the pipeline tutorials [here](../pipeline_usage/pipeline_develop_guide.en.md).
			
 
				-
			
 
				-## 2. Deploy Services for Production
			
 
				-
			
 
				-When deploying services into production environments, the stability, efficiency, and security of the services are of paramount importance. Below are some recommendations for deploying services into production.
			
 
				-
			
 
				-### 2.1 Utilize PaddleX high-performance inference Plugin
			
 
				-
			
 
				-In scenarios where strict response time requirements are imposed on applications, the PaddleX high-performance inference Plugin can be used to accelerate model inference and pre/post-processing, thereby reducing response time and increasing throughput.
			
 
				-
			
 
				-To use the PaddleX high-performance inference Plugin, please refer to the [PaddleX High-Performance Inference Guide](./high_performance_inference.en.md) for installing the high-performance inference plugin, obtaining serial numbers, and activating the plugin. Additionally, not all pipelines, models, and environments support the use of the high-performance inference plugin. For detailed support information, please refer to the section on pipelines and models that support the high-performance inference plugin.
			
 
				-
			
 
				-When starting the PaddleX pipeline service, you can specify `--use_hpip` along with the serial number to use the high-performance inference plugin. If you wish to perform online activation, you should also specify `--update_license`. Example usage:
			
 
				-
			
 
				-```bash
			
 
				-paddlex --serve --pipeline image_classification --use_hpip --serial_number {serial_number}
			
 
				-
			
 
				-# If you wish to perform online activation
			
 
				-paddlex --serve --pipeline image_classification --use_hpip --serial_number {serial_number} --update_license
			
 
				-```
			
 
				-
			
 
				-### 2.2 Consider Security
			
 
				-
			
 
				-A typical scenario involves an application accepting inputs from the network, with the PaddleX pipeline service acting as a module within the application, interacting with other modules through APIs. In this case, the position of the PaddleX pipeline service within the application is crucial. The service-oriented deployment solution provided by PaddleX focuses on efficiency and ease of use but does not perform sufficient security checks on request bodies. Malicious requests from the network, such as excessively large images or carefully crafted data, can lead to severe consequences like service crashes. Therefore, it is recommended to place the PaddleX pipeline service within the application's internal network, avoiding direct processing of external inputs, and ensuring it only processes trustworthy requests. Appropriate protective measures, such as input validation and authentication, should be added at the application's outer layer.
			
--- a/docs/pipeline_deploy/service_deploy.md
+++ b/docs/pipeline_deploy/service_deploy.md
@@ -1,118 +0,0 @@
 
				----
			
 
				-comments: true
			
 
				----
			
 
				-
			
 
				-# PaddleX 服务化部署指南
			
 
				-
			
 
				-服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署。本文档将首先介绍使用 PaddleX 进行服务化部署的基本流程，然后介绍将服务用于生产环境时的注意事项以及可能需要执行的操作。
			
 
				-
			
 
				-<b>注意</b>
			
 
				-- <b>服务化部署提供模型产线的服务，而不具体到某个产线模块。</b>
			
 
				-
			
 
				-服务化部署示例图：
			
 
				-
			
 
				-<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/pipeline_deploy/serving.png"  width="300" />
			
 
				-
			
 
				-## 1、基本流程
			
 
				-
			
 
				-### 1.1 安装服务化部署插件
			
 
				-
			
 
				-执行如下指令，安装服务化部署插件：
			
 
				-
			
 
				-```
			
 
				-paddlex --install serving
			
 
				-```
			
 
				-
			
 
				-### 1.2 启动服务
			
 
				-
			
 
				-通过 PaddleX CLI 启动服务，指令格式为：
			
 
				-
			
 
				-```shell
			
 
				-paddlex --serve --pipeline {产线名称或产线配置文件路径} [{其他命令行选项}]
			
 
				-```
			
 
				-
			
 
				-以通用图像分类产线为例：
			
 
				-
			
 
				-```shell
			
 
				-paddlex --serve --pipeline image_classifcation
			
 
				-```
			
 
				-
			
 
				-服务启动成功后，可以看到类似以下展示的信息：
			
 
				-
			
 
				-```
			
 
				-INFO:     Started server process [63108]
			
 
				-INFO:     Waiting for application startup.
			
 
				-INFO:     Application startup complete.
			
 
				-INFO:     Uvicorn running on http://0.0.0.0:8080 (Press CTRL+C to quit)
			
 
				-```
			
 
				-
			
 
				---pipeline可指定为官方产线名称或本地产线配置文件路径。PaddleX 以此构建产线并部署为服务。如需调整配置（如模型路径、batch_size、部署设备等），请参考[通用图像分类产线使用教程](../pipeline_usage/tutorials/cv_pipelines/image_classification.md)中的 <b>“模型应用</b> 部分。
			
 
				-
			
 
				-与服务化部署相关的命令行选项如下：
			
 
				-
			
 
				-<table>
			
 
				-<thead>
			
 
				-<tr>
			
 
				-<th>名称</th>
			
 
				-<th>说明</th>
			
 
				-</tr>
			
 
				-</thead>
			
 
				-<tbody>
			
 
				-<tr>
			
 
				-<td><code>--pipeline</code></td>
			
 
				-<td>产线名称或产线配置文件路径。</td>
			
 
				-</tr>
			
 
				-<tr>
			
 
				-<td><code>--device</code></td>
			
 
				-<td>产线部署设备。默认为 <code>cpu</code>（如 GPU 不可用）或 <code>gpu</code>（如 GPU 可用）。</td>
			
 
				-</tr>
			
 
				-<tr>
			
 
				-<td><code>--host</code></td>
			
 
				-<td>服务器绑定的主机名或 IP 地址。默认为0.0.0.0。</td>
			
 
				-</tr>
			
 
				-<tr>
			
 
				-<td><code>--port</code></td>
			
 
				-<td>服务器监听的端口号。默认为8080。</td>
			
 
				-</tr>
			
 
				-<tr>
			
 
				-<td><code>--use_hpip</code></td>
			
 
				-<td>如果指定，则启用高性能推理插件。</td>
			
 
				-</tr>
			
 
				-<tr>
			
 
				-<td><code>--serial_number</code></td>
			
 
				-<td>高性能推理插件使用的序列号。只在启用高性能推理插件时生效。 请注意，并非所有产线、模型都支持使用高性能推理插件，详细的支持情况请参考<a href="./high_performance_inference.md">PaddleX 高性能推理指南</a>。</td>
			
 
				-</tr>
			
 
				-<tr>
			
 
				-<td><code>--update_license</code></td>
			
 
				-<td>如果指定，则进行联网激活。只在启用高性能推理插件时生效。</td>
			
 
				-</tr>
			
 
				-</tbody>
			
 
				-</table>
			
 
				-</table>
			
 
				-
			
 
				-### 1.3 调用服务
			
 
				-
			
 
				-请参考各产线使用教程中的 <b>“开发集成/部署”</b> 部分。在[此处](../pipeline_usage/pipeline_develop_guide.md)可以找到各产线的使用教程。
			
 
				-
			
 
				-## 2、将服务用于生产
			
 
				-
			
 
				-将服务应用于生产环境中时，服务的稳定性、高效性与安全性尤为重要。以下是一些针对将服务用于生产的建议。
			
 
				-
			
 
				-### 2.1 使用 PaddleX 高性能推理插件
			
 
				-
			
 
				-在对于服务响应时间要求较严格的应用场景中，可以使用 PaddleX 高性能推理插件对模型推理及前后处理进行加速，从而降低响应时间、提升吞吐量。
			
 
				-
			
 
				-使用 PaddleX 高性能推理插件，请参考[PaddleX 高性能推理指南](./high_performance_inference.md)中安装高性能推理插件、获取序列号与激活部分完成插件的安装与序列号的申请。同时，不是所有的产线、模型和环境都支持使用高性能推理插件，支持的详细情况请参考支持使用高性能推理插件的产线与模型部分。
			
 
				-
			
 
				-在启动 PaddleX 产线服务时，可以通过指定 `--use_hpip` 及序列号以使用高性能推理插件。如果希望进行联网激活 需指定 `--update_license`。使用示例：
			
 
				-
			
 
				-```
			
 
				-paddlex --serve --pipeline image_classfication --use_hpip --serial_number {序列号}
			
 
				-
			
 
				-# 如果希望进行联网激活
			
 
				-paddlex --serve --pipeline image_classfication --use_hpip --serial_number {序列号} --update_license
			
 
				-```
			
 
				-
			
 
				-### 2.2 考虑安全性
			
 
				-
			
 
				-一个典型的场景是，应用接受来自网络的输入，PaddleX 产线服务作为应用中的一个模块，通过 API 与其他模块交互。此时，PaddleX 产线服务在应用中所处的位置尤为重要。PaddleX 提供的服务化部署方案专注于提供高效性和易用性，但未对请求体的内容进行足够的安全检查。来自网络的恶意请求，例如超大尺寸的图像或精心构造的数据，都可能导致服务崩溃等严重后果。因此，建议将 PaddleX 产线服务放置于应用内部，不直接处理外部输入，确保其只处理可信赖的请求。在应用的外层，需要添加合适的防护措施，例如输入验证、身份验证等。
			
--- a/docs/pipeline_deploy/serving.en.md
+++ b/docs/pipeline_deploy/serving.en.md
@@ -0,0 +1,332 @@
 
				+---
			
 
				+comments: true
			
 
				+---
			
 
				+
			
 
				+# PaddleX Serving Guide
			
 
				+
			
 
				+Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines.
			
 
				+
			
 
				+Demonstration of PaddleX pipeline serving:
			
 
				+
			
 
				+<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/pipeline_deploy/serving_en.png" width="300"/>
			
 
				+
			
 
				+To address different user needs, PaddleX offers multiple pipeline serving solutions:
			
 
				+
			
 
				+- Basic serving: A simple and easy-to-use serving solution with low development costs.
			
 
				+- High-stability serving: Built on [NVIDIA Triton Inference Server](https://developer.nvidia.com/triton-inference-server). Compared to basic serving, this solution offers higher stability and allows users to adjust configurations to optimize performance.
			
 
				+
			
 
				+**It is recommended to first use the basic serving solution for quick verification**, and then evaluate whether to try more complex solutions based on actual needs.
			
 
				+
			
 
				+<b>Note</b>
			
 
				+
			
 
				+- PaddleX serves pipelines rather than modules.
			
 
				+
			
 
				+## 1. Basic Serving
			
 
				+
			
 
				+### 1.1 Install the Serving Plugin
			
 
				+
			
 
				+Execute the following command to install the serving plugin:
			
 
				+
			
 
				+```bash
			
 
				+paddlex --install serving
			
 
				+```
			
 
				+
			
 
				+### 1.2 Run the Server
			
 
				+
			
 
				+Run the server via PaddleX CLI:
			
 
				+
			
 
				+```bash
			
 
				+paddlex --serve --pipeline {pipeline name or path to pipeline config file} [{other commandline options}]
			
 
				+```
			
 
				+
			
 
				+Take the general image classification pipeline as an example:
			
 
				+
			
 
				+```bash
			
 
				+paddlex --serve --pipeline image_classification
			
 
				+```
			
 
				+
			
 
				+You can see the following information:
			
 
				+
			
 
				+```text
			
 
				+INFO:     Started server process [63108]
			
 
				+INFO:     Waiting for application startup.
			
 
				+INFO:     Application startup complete.
			
 
				+INFO:     Uvicorn running on http://0.0.0.0:8080 (Press CTRL+C to quit)
			
 
				+```
			
 
				+
			
 
				+`--pipeline` can be specified as the official pipeline name or the path to a local pipeline configuration file. PaddleX builds the pipeline and deploys it as a service. If you need to adjust configurations (such as model paths, batch size, device for deployment, etc.), please refer to the <b>"Model Application"</b> section in [General Image Classification Pipeline Tutorial](../pipeline_usage/tutorials/cv_pipelines/image_classification.en.md).
			
 
				+
			
 
				+The command-line options related to serving are as follows:
			
 
				+
			
 
				+<table>
			
 
				+<thead>
			
 
				+<tr>
			
 
				+<th>Name</th>
			
 
				+<th>Description</th>
			
 
				+</tr>
			
 
				+</thead>
			
 
				+<tbody>
			
 
				+<tr>
			
 
				+<td><code>--pipeline</code></td>
			
 
				+<td>Pipeline name or path to the pipeline configuration file.</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td><code>--device</code></td>
			
 
				+<td>Device for pipeline deployment. Defaults to <code>cpu</code> (if GPU is unavailable) or <code>gpu</code> (if GPU is available).</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td><code>--host</code></td>
			
 
				+<td>Hostname or IP address the server binds to. Defaults to `0.0.0.0`.</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td><code>--port</code></td>
			
 
				+<td>Port number the server listens on. Defaults to `8080`.</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td><code>--use_hpip</code></td>
			
 
				+<td>If specified, enables the high-performance inference plugin.</td>
			
 
				+</tr>
			
 
				+</tbody>
			
 
				+</table>
			
 
				+</table>
			
 
				+
			
 
				+In application scenarios where strict requirements are placed on service response time, the PaddleX high-performance inference plugin can be used to accelerate model inference and pre/post-processing, thereby reducing response time and increasing throughput.
			
 
				+
			
 
				+To use the PaddleX high-performance inference plugin, please refer to the [PaddleX High-Performance Inference Guide](./high_performance_inference.en.md) for instructions on installing the high-performance inference plugin, obtaining a serial number, and completing the activation process. Additionally, note that not all pipelines, models, and environments support the use of the high-performance inference plugin. For detailed information on supported pipelines and models, please refer to the section on supported pipelines and models for high-performance inference plugins.
			
 
				+
			
 
				+You can use the `--use_hpip` flag to enable the high-performance inference plugin. An example is as follows:
			
 
				+
			
 
				+```bash
			
 
				+paddlex --serve --pipeline image_classification --use_hpip
			
 
				+```
			
 
				+
			
 
				+### 1.3 Invoke the Service
			
 
				+
			
 
				+The "Development Integration/Deployment" section in each pipeline’s tutorial provides API references and multi-language invocation examples for the service. You can find the tutorials for each pipeline [here](../pipeline_usage/pipeline_develop_guide.en.md).
			
 
				+
			
 
				+## 2. High-Stability Serving
			
 
				+
			
 
				+**Please note that the current high-stability serving solution only supports Linux systems.**
			
 
				+
			
 
				+### 2.1 Download the High-Stability Serving SDK
			
 
				+
			
 
				+Find the high-stability serving SDK corresponding to the pipeline in the table below and download it:
			
 
				+
			
 
				+<details>
			
 
				+<summary> 👉Click to view</summary>
			
 
				+<table>
			
 
				+<thead>
			
 
				+<tr>
			
 
				+<th>Pipeline</th>
			
 
				+<th>SDK</th>
			
 
				+</tr>
			
 
				+</thead>
			
 
				+<tbody>
			
 
				+<tr>
			
 
				+<td>PP-ChatOCRv3-doc</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_PP-ChatOCRv3-doc_sdk.tar.gz">paddlex_hps_PP-ChatOCRv3-doc_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>General image classification</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_image_classification.tar.gz">paddlex_hps_image_classification.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>General object detection</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_object_detection_sdk.tar.gz">paddlex_hps_object_detection_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>General instance segmentation</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_instance_segmentation_sdk.tar.gz">paddlex_hps_instance_segmentation_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>General semantic segmentation</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_semantic_segmentation_sdk.tar.gz">paddlex_hps_semantic_segmentation_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>General image multi-label classification</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_image_multi_label_classification_sdk.tar.gz">paddlex_hps_image_multi_label_classification_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>General image recognition</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_general_image_recognition_sdk.tar.gz">paddlex_hps_general_image_recognition_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>Pedestrian attribute recognition</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_pedestrian_attribute_recognition_sdk.tar.gz">paddlex_hps_pedestrian_attribute_recognition_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>Vehicle attribute recognition</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_vehicle_attribute_recognition_sdk.tar.gz">paddlex_hps_vehicle_attribute_recognition_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>Face recognition</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_face_recognition_sdk.tar.gz">paddlex_hps_face_recognition_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>Small object detection</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_small_object_detection_sdk.tar.gz">paddlex_hps_small_object_detection_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>Image anomaly detection</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_image_anomaly_detection_sdk.tar.gz">paddlex_hps_image_anomaly_detection_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>General OCR</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_OCR_sdk.tar.gz">paddlex_hps_OCR_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>General table recognition</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_table_recognition_sdk.tar.gz">paddlex_hps_table_recognition_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>Layout parsing</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_layout_parsing_sdk.tar.gz">paddlex_hps_layout_parsing_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>Formula recognition</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_formula_recognition_sdk.tar.gz">paddlex_hps_formula_recognition_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>Seal recognition</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_seal_recognition_sdk.tar.gz">paddlex_hps_seal_recognition_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>Time series forecasting</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_time_series_forecasting_sdk.tar.gz">paddlex_hps_time_series_forecasting_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>Time series anomaly detection</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_time_series_anomaly_detection_sdk.tar.gz">paddlex_hps_time_series_anomaly_detection_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>Time series classification</td>
			
 
				+<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_time_series_classification_sdk.tar.gz">paddlex_hps_time_series_classification_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+</tbody>
			
 
				+</table>
			
 
				+</details>
			
 
				+
			
 
				+### 2.2 Obtain the Serial Number
			
 
				+
			
 
				+Using the high-stability serving solution requires obtaining a serial number and completing activation on the machine used for deployment. The method to obtain the serial number is as follows:
			
 
				+
			
 
				+On [Baidu AI Studio Community - AI Learning and Training Platform](https://aistudio.baidu.com/paddlex/commercialization), under the "开源模型产线部署序列号咨询与获取" (open-source pipeline deployment serial number inquiry and acquisition) section, select "立即获取" (acquire now) as shown in the following image:
			
 
				+
			
 
				+<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/pipeline_deploy/image-1.png"/>
			
 
				+
			
 
				+Select the pipeline you wish to deploy and click "获取" (acquire). Afterwards, you can find the acquired serial number in the "开源产线部署SDK序列号管理" (open-source pipeline deployment SDK serial number management) section at the bottom of the page:
			
 
				+
			
 
				+<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/pipeline_deploy/image-2.png"/>
			
 
				+
			
 
				+**Please note**: Each serial number can only be bound to a unique device fingerprint and can only be bound once. This means that if a user deploys a pipeline on different machines, a separate serial number must be prepared for each machine.
			
 
				+
			
 
				+### 2.3 Adjust Configurations
			
 
				+
			
 
				+The PaddleX high-stability serving solution is built on NVIDIA Triton Inference Server, allowing users to modify the configuration files of Triton Inference Server.
			
 
				+
			
 
				+In the `model_repo/{endpoint name}` directory of the high-stability serving SDK, you can find one or more `config*.pbtxt` files. If a `config_{device type}.pbtxt` file exists in the directory, please modify the configuration file corresponding to the desired device type. Otherwise, please modify `config.pbtxt`.
			
 
				+
			
 
				+A common requirement is to adjust the number of execution instances for horizontal scaling. To achieve this, you need to modify the `instance_group` setting in the configuration file, using `count` to specify the number of instances placed on each device, `kind` to specify the device type, and `gpus` to specify the GPU IDs. An example is as follows:
			
 
				+
			
 
				+- Place 4 instances on GPU 0:
			
 
				+
			
 
				+    ```text
			
 
				+    instance_group [
			
 
				+    {
			
 
				+        count: 4
			
 
				+        kind: KIND_GPU
			
 
				+        gpus: [ 0 ]
			
 
				+    }
			
 
				+    ]
			
 
				+    ```
			
 
				+
			
 
				+- Place 2 instances on GPU 1, and 1 instance each on GPUs 2 and 3:
			
 
				+
			
 
				+    ```text
			
 
				+    instance_group [
			
 
				+    {
			
 
				+        count: 2
			
 
				+        kind: KIND_GPU
			
 
				+        gpus: [ 1 ]
			
 
				+    },
			
 
				+    {
			
 
				+        count: 1
			
 
				+        kind: KIND_GPU
			
 
				+        gpus: [ 2, 3 ]
			
 
				+    }
			
 
				+    ]
			
 
				+    ```
			
 
				+
			
 
				+For more configuration details, please refer to the [Triton Inference Server documentation](https://docs.nvidia.com/deeplearning/triton-inference-server/user-guide/docs/user_guide/model_configuration.html).
			
 
				+
			
 
				+### 2.4 Run the Server
			
 
				+
			
 
				+The machine used for deployment needs to have Docker Engine version 19.03 or higher installed.
			
 
				+
			
 
				+First, pull the Docker image as needed:
			
 
				+
			
 
				+- Image supporting deployment with NVIDIA GPU (the machine must have NVIDIA drivers that support CUDA 11.8 installed):
			
 
				+
			
 
				+    ```bash
			
 
				+    docker pull ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlex/hps:paddlex3.0.0b2-gpu
			
 
				+    ```
			
 
				+
			
 
				+- CPU-only Image:
			
 
				+
			
 
				+    ```bash
			
 
				+    docker pull ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlex/hps:paddlex3.0.0b2-cpu
			
 
				+    ```
			
 
				+
			
 
				+With the image prepared, execute the following command to run the server:
			
 
				+
			
 
				+```bash
			
 
				+docker run \
			
 
				+    -it \
			
 
				+    -v "$(pwd)":/workspace \
			
 
				+    -v "${HOME}/.baidu/paddlex/licenses":/root/.baidu/paddlex/licenses \
			
 
				+    -v /dev/disk/by-uuid:/dev/disk/by-uuid \
			
 
				+    -w /workspace \
			
 
				+    -e PADDLEX_HPS_DEVICE_TYPE={deployment device type} \
			
 
				+    -e PADDLEX_HPS_SERIAL_NUMBER={serial number} \
			
 
				+    --rm \
			
 
				+    --gpus all \
			
 
				+    --network host \
			
 
				+    --shm-size 8g \
			
 
				+    {image name} \
			
 
				+    ./server.sh
			
 
				+```
			
 
				+
			
 
				+- The deployment device type can be `cpu` or `gpu`, and the CPU-only image supports only `cpu`.
			
 
				+- If CPU deployment is required, there is no need to specify `--gpus`.
			
 
				+- The above commands can only be executed properly after successful activation. PaddleX offers two activation methods: online activation and offline activation. They are detailed as follows:
			
 
				+
			
 
				+    - Online activation: Add `-e PADDLEX_HPS_UPDATE_LICENSE=1` to the command to enable the program to complete activation automatically.
			
 
				+    - Offline Activation: Follow the instructions in the serial number management section to obtain the machine’s device fingerprint. Bind the serial number with the device fingerprint to obtain the certificate and complete the activation. For this activation method, you need to manually place the certificate in the `${HOME}/.baidu/paddlex/licenses` directory on the machine (if the directory does not exist, you will need to create it).
			
 
				+
			
 
				+- It is necessary to ensure that the `/dev/disk/by-uuid` directory on the host machine exists and is not empty, and that this directory is correctly mounted in order to perform activation properly.
			
 
				+- If you need to enter the container for debugging, you can replace ``./server.sh` in the command with `/bin/bash`. Then execute `./server.sh` inside the container.
			
 
				+- If you want the server to run in the background, you can replace `-it` in the command with `-d`. After the container starts, you can view the container logs with `docker logs -f {container ID}`.
			
 
				+
			
 
				+You may observe output similar to the following:
			
 
				+
			
 
				+```text
			
 
				+I1216 11:37:21.601943 35 grpc_server.cc:4117] Started GRPCInferenceService at 0.0.0.0:8001
			
 
				+I1216 11:37:21.602333 35 http_server.cc:2815] Started HTTPService at 0.0.0.0:8000
			
 
				+I1216 11:37:21.643494 35 http_server.cc:167] Started Metrics Service at 0.0.0.0:8002
			
 
				+```
			
 
				+
			
 
				+### 2.5 Invoke the Service
			
 
				+
			
 
				+Currently, only the Python client is supported for calling the service. Supported Python versions are 3.8, 3.9, and 3.10.
			
 
				+
			
 
				+Navigate to the `client` directory of the high-stability serving SDK, and run the following command to install dependencies:
			
 
				+
			
 
				+```bash
			
 
				+# It is recommended to install in a virtual environment
			
 
				+python -m pip install paddlex_hps_client-*.whl
			
 
				+python -m pip install -r requirements.txt
			
 
				+```
			
 
				+
			
 
				+The `client.py` script in the `client` directory contains examples of how to call the service and provides a command-line interface.
			
 
				+
			
 
				+The services deployed using the high-stability serving solution offer the primary operations that match those of the basic serving solution. For each primary operation, the endpoint names and the request and response data fields are consistent with the basic serving solution. Please refer to the "Development Integration/Deployment" section in the tutorials for each pipeline. The tutorials for each pipeline can be found [here](../pipeline_usage/pipeline_develop_guide.en.md).
			
--- a/docs/pipeline_deploy/serving.md
+++ b/docs/pipeline_deploy/serving.md
@@ -0,0 +1,333 @@
 
				+---
			
 
				+comments: true
			
 
				+---
			
 
				+
			
 
				+# PaddleX 服务化部署指南
			
 
				+
			
 
				+服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。
			
 
				+
			
 
				+PaddleX 产线服务化部署示意图：
			
 
				+
			
 
				+<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/pipeline_deploy/serving.png"  width="300" />
			
 
				+
			
 
				+针对用户的不同需求，PaddleX 提供多种产线服务化部署方案：
			
 
				+
			
 
				+- 基础服务化部署：简单易用的服务化部署方案，开发成本低。
			
 
				+- 高稳定性服务化部署：基于 [NVIDIA Triton Inference Server](https://developer.nvidia.com/triton-inference-server) 打造。与基础服务化部署相比，该方案提供更高的稳定性，并允许用户调整配置以优化性能。
			
 
				+
			
 
				+**建议首先使用基础服务化部署方案进行快速验证**，然后根据实际需要，评估是否尝试更复杂的方案。
			
 
				+
			
 
				+**注意**
			
 
				+
			
 
				+- PaddleX 对产线而不是模块进行服务化部署。
			
 
				+
			
 
				+## 1. 基础服务化部署
			
 
				+
			
 
				+### 1.1 安装服务化部署插件
			
 
				+
			
 
				+执行如下命令，安装服务化部署插件：
			
 
				+
			
 
				+```bash
			
 
				+paddlex --install serving
			
 
				+```
			
 
				+
			
 
				+### 1.2 运行服务器
			
 
				+
			
 
				+通过 PaddleX CLI 运行服务器：
			
 
				+
			
 
				+```bash
			
 
				+paddlex --serve --pipeline {产线名称或产线配置文件路径} [{其他命令行选项}]
			
 
				+```
			
 
				+
			
 
				+以通用图像分类产线为例：
			
 
				+
			
 
				+```bash
			
 
				+paddlex --serve --pipeline image_classification
			
 
				+```
			
 
				+
			
 
				+可以看到类似以下展示的信息：
			
 
				+
			
 
				+```text
			
 
				+INFO:     Started server process [63108]
			
 
				+INFO:     Waiting for application startup.
			
 
				+INFO:     Application startup complete.
			
 
				+INFO:     Uvicorn running on http://0.0.0.0:8080 (Press CTRL+C to quit)
			
 
				+```
			
 
				+
			
 
				+`--pipeline` 可指定为官方产线名称或本地产线配置文件路径。PaddleX 以此构建产线并部署为服务。如需调整配置（如模型路径、batch size、部署设备等），请参考 [通用图像分类产线使用教程](../pipeline_usage/tutorials/cv_pipelines/image_classification.md) 中的 <b>”模型应用“</b> 部分。
			
 
				+
			
 
				+与服务化部署相关的命令行选项如下：
			
 
				+
			
 
				+<table>
			
 
				+<thead>
			
 
				+<tr>
			
 
				+<th>名称</th>
			
 
				+<th>说明</th>
			
 
				+</tr>
			
 
				+</thead>
			
 
				+<tbody>
			
 
				+<tr>
			
 
				+<td><code>--pipeline</code></td>
			
 
				+<td>产线名称或产线配置文件路径。</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td><code>--device</code></td>
			
 
				+<td>产线部署设备。默认为 <code>cpu</code>（如 GPU 不可用）或 <code>gpu</code>（如 GPU 可用）。</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td><code>--host</code></td>
			
 
				+<td>服务器绑定的主机名或 IP 地址。默认为 `0.0.0.0`。</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td><code>--port</code></td>
			
 
				+<td>服务器监听的端口号。默认为 `8080`。</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td><code>--use_hpip</code></td>
			
 
				+<td>如果指定，则启用高性能推理插件。</td>
			
 
				+</tr>
			
 
				+</tbody>
			
 
				+</table>
			
 
				+</table>
			
 
				+
			
 
				+在对于服务响应时间要求较严格的应用场景中，可以使用 PaddleX 高性能推理插件对模型推理及前后处理进行加速，从而降低响应时间、提升吞吐量。
			
 
				+
			
 
				+使用 PaddleX 高性能推理插件，请参考 [PaddleX 高性能推理指南](./high_performance_inference.md) 中安装高性能推理插件、获取序列号与激活部分完成插件的安装与序列号的申请。同时，不是所有的产线、模型和环境都支持使用高性能推理插件，支持的详细情况请参考支持使用高性能推理插件的产线与模型部分。
			
 
				+
			
 
				+可以通过指定 `--use_hpip` 以使用高性能推理插件。示例如下：
			
 
				+
			
 
				+```bash
			
 
				+paddlex --serve --pipeline image_classification --use_hpip
			
 
				+```
			
 
				+
			
 
				+### 1.3 调用服务
			
 
				+
			
 
				+各产线使用教程中的 <b>“开发集成/部署”</b> 部分提供了服务的 API 参考与多语言调用示例。在 [此处](../pipeline_usage/pipeline_develop_guide.md) 可以找到各产线的使用教程。
			
 
				+
			
 
				+## 2. 高稳定性服务化部署
			
 
				+
			
 
				+**请注意，当前高稳定性服务化部署方案仅支持 Linux 系统。**
			
 
				+
			
 
				+### 2.1 下载高稳定性服务化部署 SDK
			
 
				+
			
 
				+在下表中找到产线对应的高稳定性服务化部署 SDK 并下载：
			
 
				+
			
 
				+<details>
			
 
				+<summary> 👉点击查看</summary>
			
 
				+<table>
			
 
				+<thead>
			
 
				+<tr>
			
 
				+<th>产线</th>
			
 
				+<th>SDK</th>
			
 
				+</tr>
			
 
				+</thead>
			
 
				+<tbody>
			
 
				+<tr>
			
 
				+<td>文档场景信息抽取v3</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_PP-ChatOCRv3-doc_sdk.tar.gz>paddlex_hps_PP-ChatOCRv3-doc_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>通用图像分类</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_image_classification.tar.gz>paddlex_hps_image_classification.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>通用目标检测</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_object_detection_sdk.tar.gz>paddlex_hps_object_detection_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>通用实例分割</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_instance_segmentation_sdk.tar.gz>paddlex_hps_instance_segmentation_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>通用语义分割</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_semantic_segmentation_sdk.tar.gz>paddlex_hps_semantic_segmentation_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>通用图像多标签分类</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_image_multi_label_classification_sdk.tar.gz>paddlex_hps_image_multi_label_classification_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>通用图像识别</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_general_image_recognition_sdk.tar.gz>paddlex_hps_general_image_recognition_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>行人属性识别</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_pedestrian_attribute_recognition_sdk.tar.gz>paddlex_hps_pedestrian_attribute_recognition_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>车辆属性识别</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_vehicle_attribute_recognition_sdk.tar.gz>paddlex_hps_vehicle_attribute_recognition_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>人脸识别</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_face_recognition_sdk.tar.gz>paddlex_hps_face_recognition_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>小目标检测</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_small_object_detection_sdk.tar.gz>paddlex_hps_small_object_detection_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>图像异常检测</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_image_anomaly_detection_sdk.tar.gz>paddlex_hps_image_anomaly_detection_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>通用OCR</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_OCR_sdk.tar.gz>paddlex_hps_OCR_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>通用表格识别</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_table_recognition_sdk.tar.gz>paddlex_hps_table_recognition_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>通用版面解析</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_layout_parsing_sdk.tar.gz>paddlex_hps_layout_parsing_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>公式识别</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_formula_recognition_sdk.tar.gz>paddlex_hps_formula_recognition_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>印章文本识别</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_seal_recognition_sdk.tar.gz>paddlex_hps_seal_recognition_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>时序预测</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_time_series_forecasting_sdk.tar.gz>paddlex_hps_time_series_forecasting_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>时序异常检测</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_time_series_anomaly_detection_sdk.tar.gz>paddlex_hps_time_series_anomaly_detection_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td>时序分类</td>
			
 
				+<td><a href=https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/paddlex_hps/public/sdks/v3.0.0b2/paddlex_hps_time_series_classification_sdk.tar.gz>paddlex_hps_time_series_classification_sdk.tar.gz</a></td>
			
 
				+</tr>
			
 
				+</tbody>
			
 
				+</table>
			
 
				+</details>
			
 
				+
			
 
				+
			
 
				+### 2.2 获取序列号
			
 
				+
			
 
				+使用高稳定性服务化部署方案需要获取序列号，并在用于部署的机器上完成激活。获取序列号的方式如下：
			
 
				+
			
 
				+在 [飞桨 AI Studio 星河社区-人工智能学习与实训社区](https://aistudio.baidu.com/paddlex/commercialization) 的“开源模型产线部署序列号咨询与获取”部分选择“立即获取”，如下图所示：
			
 
				+
			
 
				+<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/pipeline_deploy/image-1.png">
			
 
				+
			
 
				+选择需要部署的产线，并点击“获取”。之后，可以在页面下方的“开源产线部署SDK序列号管理”部分找到获取到的序列号：
			
 
				+
			
 
				+<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/pipeline_deploy/image-2.png">
			
 
				+
			
 
				+**请注意**：每个序列号只能绑定到唯一的设备指纹，且只能绑定一次。这意味着用户如果使用不同的机器部署产线，则必须为每台机器准备单独的序列号。
			
 
				+
			
 
				+### 2.3 调整配置
			
 
				+
			
 
				+PaddleX 高稳定性服务化部署方案基于 NVIDIA Triton Inference Server 打造，支持用户修改 Triton Inference Server 的配置文件。
			
 
				+
			
 
				+在高稳定性服务化部署 SDK 的 `model_repo/{端点名称}` 目录中，可以找到一个或多个 `config*.pbtxt` 文件。如果目录中存在 `config_{设备类型}.pbtxt` 文件，请修改期望使用的设备类型对应的配置文件；否则，请修改 `config.pbtxt`。
			
 
				+
			
 
				+一个常见的需求是调整执行实例数量，以进行水平扩展。为了实现这一点，需要修改配置文件中的 `instance_group` 配置，使用 `count` 指定每一设备上放置的实例数量，使用 `kind` 指定设备类型，使用 `gpus` 指定 GPU 编号。示例如下：
			
 
				+
			
 
				+- 在 GPU 0 上放置 4 个实例：
			
 
				+
			
 
				+    ```text
			
 
				+    instance_group [
			
 
				+    {
			
 
				+        count: 4
			
 
				+        kind: KIND_GPU
			
 
				+        gpus: [ 0 ]
			
 
				+    }
			
 
				+    ]
			
 
				+    ```
			
 
				+
			
 
				+- 在 GPU 1 上放置 2 个实例，在 GPU 2 和 3 上分别放置 1 个实例：
			
 
				+
			
 
				+    ```text
			
 
				+    instance_group [
			
 
				+    {
			
 
				+        count: 2
			
 
				+        kind: KIND_GPU
			
 
				+        gpus: [ 1 ]
			
 
				+    },
			
 
				+    {
			
 
				+        count: 1
			
 
				+        kind: KIND_GPU
			
 
				+        gpus: [ 2, 3 ]
			
 
				+    }
			
 
				+    ]
			
 
				+    ```
			
 
				+
			
 
				+关于更多配置细节，请参阅 [Triton Inference Server 文档](https://docs.nvidia.com/deeplearning/triton-inference-server/user-guide/docs/user_guide/model_configuration.html)。
			
 
				+
			
 
				+### 2.4 运行服务器
			
 
				+
			
 
				+用于部署的机器上需要安装有 19.03 或更高版本的 Docker Engine。
			
 
				+
			
 
				+首先，根据需要拉取 Docker 镜像：
			
 
				+
			
 
				+- 支持使用 NVIDIA GPU 部署的镜像（机器上需要安装有支持 CUDA 11.8 的 NVIDIA 驱动）：
			
 
				+
			
 
				+    ```bash
			
 
				+    docker pull ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlex/hps:paddlex3.0.0b2-gpu
			
 
				+    ```
			
 
				+
			
 
				+- CPU-only 镜像：
			
 
				+
			
 
				+    ```bash
			
 
				+    docker pull ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlex/hps:paddlex3.0.0b2-cpu
			
 
				+    ```
			
 
				+
			
 
				+准备好镜像后，执行如下命令运行服务器：
			
 
				+
			
 
				+```bash
			
 
				+docker run \
			
 
				+    -it \
			
 
				+    -v "$(pwd)":/workspace \
			
 
				+    -v "${HOME}/.baidu/paddlex/licenses":/root/.baidu/paddlex/licenses \
			
 
				+    -v /dev/disk/by-uuid:/dev/disk/by-uuid \
			
 
				+    -w /workspace \
			
 
				+    -e PADDLEX_HPS_DEVICE_TYPE={部署设备类型} \
			
 
				+    -e PADDLEX_HPS_SERIAL_NUMBER={序列号} \
			
 
				+    --rm \
			
 
				+    --gpus all \
			
 
				+    --network host \
			
 
				+    --shm-size 8g \
			
 
				+    {镜像名称} \
			
 
				+    ./server.sh
			
 
				+```
			
 
				+
			
 
				+- 部署设备类型可以为 `cpu` 或 `gpu`，CPU-only 镜像仅支持 `cpu`。
			
 
				+- 如果希望使用 CPU 部署，则不需要指定 `--gpus`。
			
 
				+- 以上命令必须在激活成功后才可以正常执行。PaddleX 提供两种激活方式：离线激活和在线激活。具体说明如下：
			
 
				+
			
 
				+    - 联网激活：在命令中添加 `-e PADDLEX_HPS_UPDATE_LICENSE=1`，使程序自动完成激活。
			
 
				+    - 离线激活：按照序列号管理部分中的指引，获取机器的设备指纹，并将序列号与设备指纹绑定以获取证书，完成激活。使用这种激活方式，需要手动将证书存放在机器的 `${HOME}/.baidu/paddlex/licenses` 目录中（如果目录不存在，需要创建目录）。
			
 
				+
			
 
				+- 必须确保宿主机的 `/dev/disk/by-uuid` 存在且非空，并正确挂载该目录，才能正常执行激活。
			
 
				+- 如果需要进入容器内部调试，可以将命令中的 `./server.sh` 替换为 `/bin/bash`，在容器中执行 `./server.sh`。
			
 
				+- 如果希望服务器在后台运行，可以将命令中的 `-it` 替换为 `-d`。容器启动后，可通过 `docker logs -f {容器 ID}` 查看容器日志。
			
 
				+
			
 
				+可观察到类似下面的输出信息：
			
 
				+
			
 
				+```text
			
 
				+I1216 11:37:21.601943 35 grpc_server.cc:4117] Started GRPCInferenceService at 0.0.0.0:8001
			
 
				+I1216 11:37:21.602333 35 http_server.cc:2815] Started HTTPService at 0.0.0.0:8000
			
 
				+I1216 11:37:21.643494 35 http_server.cc:167] Started Metrics Service at 0.0.0.0:8002
			
 
				+```
			
 
				+
			
 
				+### 2.5 调用服务
			
 
				+
			
 
				+目前，仅支持使用 Python 客户端调用服务。支持的 Python 版本为 3.8、3.9 和 3.10。
			
 
				+
			
 
				+切换到高稳定性服务化部署 SDK 的 `client` 目录，执行如下命令安装依赖：
			
 
				+
			
 
				+```bash
			
 
				+# 建议在虚拟环境中安装
			
 
				+python -m pip install paddlex_hps_client-*.whl
			
 
				+python -m pip install -r requirements.txt
			
 
				+```
			
 
				+
			
 
				+`client` 目录的 `client.py` 脚本包含服务的调用示例，并提供命令行接口。
			
 
				+
			
 
				+使用高稳定性服务化部署方案部署的服务，提供与基础服务化部署方案相匹配的主要操作。对于每个主要操作，端点名称以及请求和响应的数据字段都与基础服务化部署方案保持一致。请参阅各产线使用教程中的 <b>“开发集成/部署”</b> 部分。在 [此处](../pipeline_usage/pipeline_develop_guide.md) 可以找到各产线的使用教程。
			
--- a/docs/pipeline_usage/pipeline_develop_guide.en.md
+++ b/docs/pipeline_usage/pipeline_develop_guide.en.md
@@ -167,7 +167,7 @@ In addition, PaddleX also provides three other deployment methods, with detailed
 
				 
			
 
				 🚀 <b>high-performance inference</b>: In actual production environments, many applications have stringent standards for the performance metrics (especially response speed) of deployment strategies to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins that aim to deeply optimize model inference and pre/post-processing for significant speedups in the end-to-end process. Refer to the [PaddleX High-Performance Inference Guide](../pipeline_deploy/high_performance_inference.en.md) for detailed high-performance inference procedures.
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving low-cost service-oriented deployment of pipelines. Refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/service_deploy.en.md) for detailed service-oriented deployment procedures.
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../pipeline_deploy/serving.md).
			
 
				 
			
 
				 📱 <b>Edge Deployment</b>: Edge deployment is a method that places computing and data processing capabilities on user devices themselves, allowing devices to process data directly without relying on remote servers. PaddleX supports deploying models on edge devices such as Android. Refer to the [PaddleX Edge Deployment Guide](../pipeline_deploy/edge_deploy.en.md) for detailed edge deployment procedures.
			
 
				 
			
--- a/docs/pipeline_usage/pipeline_develop_guide.md
+++ b/docs/pipeline_usage/pipeline_develop_guide.md
@@ -169,7 +169,7 @@ Pipeline:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能部署流程请参考[PaddleX高性能部署指南](../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../pipeline_deploy/serving.md)。
			
 
				 
			
 
				 📱 <b>端侧部署</b>：端侧部署是一种将计算和数据处理功能放在用户设备本身上的方式，设备可以直接处理数据，而不需要依赖远程的服务器。PaddleX 支持将模型部署在 Android 等端侧设备上，详细的端侧部署流程请参考[PaddleX端侧部署指南](../pipeline_deploy/edge_deploy.md)。
			
 
				 您可以根据需要选择合适的方式部署模型产线，进而进行后续的 AI 应用集成。
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/face_recognition.en.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/face_recognition.en.md
@@ -393,13 +393,13 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for the performance metrics of deployment strategies (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing to significantly speed up the end-to-end process. For detailed high-performance inference procedures, please refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.md).
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functionality as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving service-oriented deployment of pipelines at low cost. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API reference and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -465,7 +465,7 @@ Below are the API reference and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>The main operations provided by the service are as follows:</p>
			
 
				+<p>The primary operations provided by the service are as follows:</p>
			
 
				 <ul>
			
 
				 <li><b><code>buildIndex</code></b></li>
			
 
				 </ul>
			
@@ -505,7 +505,7 @@ Below are the API reference and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of the image file accessible to the service or the Base64 encoded content of the image file.</td>
			
 
				+<td>The URL of the image file accessible to the server or the Base64 encoded content of the image file.</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				 <td><code>label</code></td>
			
@@ -566,7 +566,7 @@ Below are the API reference and multi-language service invocation examples:
 
				 <td><code>indexKey</code></td>
			
 
				 <td><code>string</code></td>
			
 
				 <td>The key corresponding to the index. Provided by the <code>buildIndex</code> operation.</td>
			
 
				-<td>Yes</td>
			
 
				+<td>No</td>
			
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
@@ -583,7 +583,7 @@ Below are the API reference and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of the image file accessible to the service or the Base64 encoded content of the image file.</td>
			
 
				+<td>The URL of the image file accessible to the server or the Base64 encoded content of the image file.</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				 <td><code>label</code></td>
			
@@ -639,7 +639,7 @@ Below are the API reference and multi-language service invocation examples:
 
				 <td><code>indexKey</code></td>
			
 
				 <td><code>string</code></td>
			
 
				 <td>The key corresponding to the index. Provided by the <code>buildIndex</code> operation.</td>
			
 
				-<td>Yes</td>
			
 
				+<td>No</td>
			
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
@@ -683,7 +683,7 @@ Below are the API reference and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of the image file accessible to the service or the Base64 encoded content of the image file.</td>
			
 
				+<td>The URL of the image file accessible to the server or the Base64 encoded content of the image file.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 <tr>
			
@@ -769,7 +769,7 @@ Below are the API reference and multi-language service invocation examples:
 
				 </table>
			
 
				 </details>
			
 
				 
			
 
				-<details><summary>Multilingual Service Invocation Examples</summary>
			
 
				+<details><summary>Multi-Language Service Invocation Examples</summary>
			
 
				 
			
 
				 <pre><code class="language-python">import base64
			
 
				 import pprint
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/face_recognition.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/face_recognition.md
@@ -423,9 +423,9 @@ data_root             # 数据集根目录，目录名称可以改变
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -535,7 +535,7 @@ data_root             # 数据集根目录，目录名称可以改变
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				 <td><code>label</code></td>
			
@@ -596,7 +596,7 @@ data_root             # 数据集根目录，目录名称可以改变
 
				 <td><code>indexKey</code></td>
			
 
				 <td><code>string</code></td>
			
 
				 <td>索引对应的键。由<code>buildIndex</code>操作提供。</td>
			
 
				-<td>是</td>
			
 
				+<td>否</td>
			
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
@@ -613,7 +613,7 @@ data_root             # 数据集根目录，目录名称可以改变
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				 <td><code>label</code></td>
			
@@ -669,7 +669,7 @@ data_root             # 数据集根目录，目录名称可以改变
 
				 <td><code>indexKey</code></td>
			
 
				 <td><code>string</code></td>
			
 
				 <td>索引对应的键。由<code>buildIndex</code>操作提供。</td>
			
 
				-<td>是</td>
			
 
				+<td>否</td>
			
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
@@ -713,7 +713,7 @@ data_root             # 数据集根目录，目录名称可以改变
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 <tr>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/general_image_recognition.en.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/general_image_recognition.en.md
@@ -362,13 +362,13 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for the performance metrics of deployment strategies (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end speedups. For detailed high-performance inference procedures, refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.en.md).
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving low-cost service-oriented deployment of pipelines. For detailed service-oriented deployment procedures, refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.en.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -434,7 +434,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>The main operations provided by the service are as follows:</p>
			
 
				+<p>The primary operations provided by the service are as follows:</p>
			
 
				 <ul>
			
 
				 <li><b><code>buildIndex</code></b></li>
			
 
				 </ul>
			
@@ -474,7 +474,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of an image file accessible by the service, or the Base64 encoding result of the image file content.</td>
			
 
				+<td>The URL of an image file accessible by the server, or the Base64 encoding result of the image file content.</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				 <td><code>label</code></td>
			
@@ -535,7 +535,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <td><code>indexKey</code></td>
			
 
				 <td><code>string</code></td>
			
 
				 <td>The key corresponding to the index. Provided by the <code>buildIndex</code> operation.</td>
			
 
				-<td>Yes</td>
			
 
				+<td>No</td>
			
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
@@ -552,7 +552,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of an image file accessible by the service, or the Base64 encoding result of the image file content.</td>
			
 
				+<td>The URL of an image file accessible by the server, or the Base64 encoding result of the image file content.</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				 <td><code>label</code></td>
			
@@ -608,7 +608,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <td><code>indexKey</code></td>
			
 
				 <td><code>string</code></td>
			
 
				 <td>The key corresponding to the index. Provided by the <code>buildIndex</code> operation.</td>
			
 
				-<td>Yes</td>
			
 
				+<td>No</td>
			
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
@@ -652,7 +652,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of an image file accessible by the service, or the Base64 encoding result of the image file content.</td>
			
 
				+<td>The URL of an image file accessible by the server, or the Base64 encoding result of the image file content.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 <tr>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/general_image_recognition.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/general_image_recognition.md
@@ -393,9 +393,9 @@ data_root             # 数据集根目录，目录名称可以改变
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -505,7 +505,7 @@ data_root             # 数据集根目录，目录名称可以改变
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				 <td><code>label</code></td>
			
@@ -566,7 +566,7 @@ data_root             # 数据集根目录，目录名称可以改变
 
				 <td><code>indexKey</code></td>
			
 
				 <td><code>string</code></td>
			
 
				 <td>索引对应的键。由<code>buildIndex</code>操作提供。</td>
			
 
				-<td>是</td>
			
 
				+<td>否</td>
			
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
@@ -583,7 +583,7 @@ data_root             # 数据集根目录，目录名称可以改变
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				 <td><code>label</code></td>
			
@@ -639,7 +639,7 @@ data_root             # 数据集根目录，目录名称可以改变
 
				 <td><code>indexKey</code></td>
			
 
				 <td><code>string</code></td>
			
 
				 <td>索引对应的键。由<code>buildIndex</code>操作提供。</td>
			
 
				-<td>是</td>
			
 
				+<td>否</td>
			
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
@@ -683,7 +683,7 @@ data_root             # 数据集根目录，目录名称可以改变
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 <tr>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/human_keypoint_detection.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/human_keypoint_detection.md
@@ -189,13 +189,13 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
 
				-<p>对于服务提供的所有操作：</p>
			
 
				+<p>对于服务提供的主要操作：</p>
			
 
				 <ul>
			
 
				 <li>响应体以及POST请求的请求体均为JSON数据（JSON对象）。</li>
			
 
				 <li>当请求处理成功时，响应状态码为<code>200</code>，响应体的属性如下：</li>
			
@@ -268,7 +268,7 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 <tr>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/image_anomaly_detection.en.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/image_anomaly_detection.en.md
@@ -211,13 +211,13 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for the performance metrics of deployment strategies (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing to significantly speed up the end-to-end process. For detailed high-performance inference procedures, refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.en.md).
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving low-cost service-oriented deployment of pipelines. For detailed service-oriented deployment procedures, refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.en.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -283,7 +283,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service:</p>
			
 
				+<p>Primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -305,7 +305,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of the image file accessible by the service or the Base64 encoded result of the image file content.</td>
			
 
				+<td>The URL of the image file accessible by the server or the Base64 encoded result of the image file content.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/image_anomaly_detection.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/image_anomaly_detection.md
@@ -210,9 +210,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -304,7 +304,7 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/image_classification.en.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/image_classification.en.md
@@ -872,13 +872,13 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for the performance metrics of deployment strategies (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end speedups. For detailed high-performance inference procedures, refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.en.md).
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving low-cost service-oriented deployment of pipelines. For detailed service-oriented deployment procedures, refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.en.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -944,7 +944,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service are as follows:</p>
			
 
				+<p>Primary operations provided by the service are as follows:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -966,7 +966,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of an image file accessible by the service or the Base64 encoded result of the image file content.</td>
			
 
				+<td>The URL of an image file accessible by the server or the Base64 encoded result of the image file content.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 <tr>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/image_classification.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/image_classification.md
@@ -864,9 +864,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -958,7 +958,7 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 <tr>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/image_multi_label_classification.en.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/image_multi_label_classification.en.md
@@ -235,13 +235,13 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have strict standards for the performance metrics of deployment strategies (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins that aim to deeply optimize model inference and pre/post-processing to significantly speed up the end-to-end process. For detailed high-performance inference procedures, refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.en.md).
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving low-cost service-oriented deployment of pipelines. For detailed service-oriented deployment procedures, refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.en.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -307,7 +307,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service are as follows:</p>
			
 
				+<p>Primary operations provided by the service are as follows:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -329,7 +329,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of the image file accessible by the service or the Base64 encoded result of the image file content.</td>
			
 
				+<td>The URL of the image file accessible by the server or the Base64 encoded result of the image file content.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 <tr>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/image_multi_label_classification.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/image_multi_label_classification.md
@@ -236,9 +236,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -330,7 +330,7 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 <tr>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/instance_segmentation.en.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/instance_segmentation.en.md
@@ -358,13 +358,13 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for the performance metrics of deployment strategies (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins that aim to deeply optimize model inference and pre/post-processing for significant speedups in the end-to-end process. For detailed high-performance inference procedures, please refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.en.md).
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving low-cost service-oriented deployment of pipelines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.en.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -430,7 +430,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service:</p>
			
 
				+<p>Primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -452,7 +452,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of an image file accessible by the service or the Base64 encoded result of the image file content.</td>
			
 
				+<td>The URL of an image file accessible by the server or the Base64 encoded result of the image file content.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/instance_segmentation.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/instance_segmentation.md
@@ -353,9 +353,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -447,7 +447,7 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/object_detection.en.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/object_detection.en.md
@@ -544,13 +544,13 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for the performance metrics of deployment strategies, especially response speed, to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing to significantly speed up the end-to-end process. Refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.md) for detailed high-performance inference procedures.
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving low-cost service-oriented deployment of pipelines. Refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.md) for detailed service-oriented deployment procedures.
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -616,7 +616,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service are as follows:</p>
			
 
				+<p>Primary operations provided by the service are as follows:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -638,7 +638,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of an image file accessible by the service or the Base64 encoded result of the image file content.</td>
			
 
				+<td>The URL of an image file accessible by the server or the Base64 encoded result of the image file content.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/object_detection.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/object_detection.md
@@ -543,9 +543,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -637,7 +637,7 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/pedestrian_attribute_recognition.en.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/pedestrian_attribute_recognition.en.md
@@ -243,13 +243,13 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for the performance metrics of deployment strategies (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing to significantly speed up the end-to-end process. For detailed high-performance inference procedures, please refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.md).
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functionality as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving service-oriented deployment of pipelines at low cost. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API reference and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -315,7 +315,7 @@ Below are the API reference and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service are as follows:</p>
			
 
				+<p>Primary operations provided by the service are as follows:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -337,7 +337,7 @@ Below are the API reference and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of an image file accessible by the service or the Base64 encoded result of the image file content.</td>
			
 
				+<td>The URL of an image file accessible by the server or the Base64 encoded result of the image file content.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/pedestrian_attribute_recognition.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/pedestrian_attribute_recognition.md
@@ -243,9 +243,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -337,7 +337,7 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/semantic_segmentation.en.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/semantic_segmentation.en.md
@@ -384,13 +384,13 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for the performance metrics of deployment strategies (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins that aim to deeply optimize model inference and pre/post-processing for significant end-to-end speedups. For detailed high-performance inference procedures, refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.en.md).
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving low-cost service-oriented deployment of pipelines. For detailed service-oriented deployment procedures, refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.en.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -456,7 +456,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service are as follows:</p>
			
 
				+<p>Primary operations provided by the service are as follows:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -478,7 +478,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of an image file accessible by the service or the Base64 encoded result of the image file content.</td>
			
 
				+<td>The URL of an image file accessible by the server or the Base64 encoded result of the image file content.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/semantic_segmentation.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/semantic_segmentation.md
@@ -383,9 +383,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -477,7 +477,7 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/small_object_detection.en.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/small_object_detection.en.md
@@ -231,14 +231,14 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for the performance metrics of deployment strategies (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end speedups. For detailed high-performance inference procedures, refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.en.md).
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving low-cost service-oriented deployment of pipelines. For detailed service-oriented deployment procedures, refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.en.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -304,7 +304,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service are as follows:</p>
			
 
				+<p>Primary operations provided by the service are as follows:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -326,7 +326,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of an image file accessible by the service or the Base64 encoded result of the image file content.</td>
			
 
				+<td>The URL of an image file accessible by the server or the Base64 encoded result of the image file content.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/small_object_detection.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/small_object_detection.md
@@ -230,9 +230,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -324,7 +324,7 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/vehicle_attribute_recognition.en.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/vehicle_attribute_recognition.en.md
@@ -240,13 +240,13 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for the performance metrics of deployment strategies (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing to significantly speed up the end-to-end process. For detailed high-performance inference procedures, please refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.md).
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functionality as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving service-oriented deployment of pipelines at low cost. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API reference and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -312,7 +312,7 @@ Below are the API reference and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service are as follows:</p>
			
 
				+<p>Primary operations provided by the service are as follows:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -334,7 +334,7 @@ Below are the API reference and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of an image file accessible by the service or the Base64 encoded result of the image file content.</td>
			
 
				+<td>The URL of an image file accessible by the server or the Base64 encoded result of the image file content.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/cv_pipelines/vehicle_attribute_recognition.md
+++ b/docs/pipeline_usage/tutorials/cv_pipelines/vehicle_attribute_recognition.md
@@ -240,9 +240,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -334,7 +334,7 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>image</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/information_extraction_pipelines/document_scene_information_extraction.en.md
+++ b/docs/pipeline_usage/tutorials/information_extraction_pipelines/document_scene_information_extraction.en.md
@@ -619,13 +619,13 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for the performance metrics (especially response speed) of deployment strategies to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing to significantly speed up the end-to-end process. For detailed high-performance inference procedures, please refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.en.md).
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving low-cost service-oriented deployment of pipelines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.en.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -691,12 +691,12 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service are as follows:</p>
			
 
				+<p>Primary operations provided by the service are as follows:</p>
			
 
				 <ul>
			
 
				 <li><b><code>analyzeImages</code></b></li>
			
 
				 </ul>
			
 
				 <p>Analyze images using computer vision models to obtain OCR, table recognition results, and extract key information from the images.</p>
			
 
				-<p><code>POST /chatocr-vision</code></p>
			
 
				+<p><code>POST /chatocr-visual</code></p>
			
 
				 <ul>
			
 
				 <li>Request body properties:</li>
			
 
				 </ul>
			
@@ -719,7 +719,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>fileType</code></td>
			
 
				 <td><code>integer</code></td>
			
 
				-<td>File type. <code>0</code> represents PDF files, <code>1</code> represents image files. If this property is not present in the request body, the service will attempt to infer the file type automatically based on the URL.</td>
			
 
				+<td>File type. <code>0</code> represents PDF files, <code>1</code> represents image files. If this property is not present in the request body, the file type will be inferred based on the URL.</td>
			
 
				 <td>No</td>
			
 
				 </tr>
			
 
				 <tr>
			
@@ -780,12 +780,12 @@ Below are the API references and multi-language service invocation examples:
 
				 </thead>
			
 
				 <tbody>
			
 
				 <tr>
			
 
				-<td><code>visionResults</code></td>
			
 
				+<td><code>visualResults</code></td>
			
 
				 <td><code>array</code></td>
			
 
				 <td>Analysis results obtained using the computer vision model. The array length is 1 (for image input) or the smaller of the number of document pages and 10 (for PDF input). For PDF input, each element in the array represents the processing result of each page in the PDF file in sequence.</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				-<td><code>visionInfo</code></td>
			
 
				+<td><code>visualInfo</code></td>
			
 
				 <td><code>object</code></td>
			
 
				 <td>Key information in the image, which can be used as input for other operations.</td>
			
 
				 </tr>
			
@@ -796,7 +796,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Each element in <code>visionResults</code> is an <code>object</code> with the following properties:</p>
			
 
				+<p>Each element in <code>visualResults</code> is an <code>object</code> with the following properties:</p>
			
 
				 <table>
			
 
				 <thead>
			
 
				 <tr>
			
@@ -901,7 +901,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </thead>
			
 
				 <tbody>
			
 
				 <tr>
			
 
				-<td><code>visionInfo</code></td>
			
 
				+<td><code>visualInfo</code></td>
			
 
				 <td><code>object</code></td>
			
 
				 <td>Key information from the image. Provided by the <code>analyzeImages</code> operation.</td>
			
 
				 <td>Yes</td>
			
@@ -1063,7 +1063,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				-<td><code>visionInfo</code></td>
			
 
				+<td><code>visualInfo</code></td>
			
 
				 <td><code>object</code></td>
			
 
				 <td>Key information from images. Provided by the <code>analyzeImages</code> operation.</td>
			
 
				 <td>Yes</td>
			
@@ -1218,30 +1218,31 @@ payload = {
 
				     &quot;useImgUnwarping&quot;: True,
			
 
				     &quot;useSealTextDet&quot;: True,
			
 
				 }
			
 
				-resp_vision = requests.post(url=f&quot;{API_BASE_URL}/chatocr-vision&quot;, json=payload)
			
 
				-if resp_vision.status_code != 200:
			
 
				+resp_visual = requests.post(url=f&quot;{API_BASE_URL}/chatocr-visual&quot;, json=payload)
			
 
				+if resp_visual.status_code != 200:
			
 
				     print(
			
 
				-        f&quot;Request to chatocr-vision failed with status code {resp_vision.status_code}.&quot;
			
 
				+        f&quot;Request to chatocr-visual failed with status code {resp_visual.status_code}.&quot;,
			
 
				+        file=sys.stderr,
			
 
				     )
			
 
				-    pprint.pp(resp_vision.json())
			
 
				+    pprint.pp(resp_visual.json())
			
 
				     sys.exit(1)
			
 
				-result_vision = resp_vision.json()[&quot;result&quot;]
			
 
				+result_visual = resp_visual.json()[&quot;result&quot;]
			
 
				 
			
 
				-for i, res in enumerate(result_vision[&quot;visionResults&quot;]):
			
 
				+for i, res in enumerate(result_visual[&quot;visualResults&quot;]):
			
 
				     print(&quot;Texts:&quot;)
			
 
				     pprint.pp(res[&quot;texts&quot;])
			
 
				     print(&quot;Tables:&quot;)
			
 
				     pprint.pp(res[&quot;tables&quot;])
			
 
				-    ocr_img_path = f&quot;ocr_{i}.jpg&quot;
			
 
				-    with open(ocr_img_path, &quot;wb&quot;) as f:
			
 
				-        f.write(base64.b64decode(res[&quot;ocrImage&quot;]))
			
 
				     layout_img_path = f&quot;layout_{i}.jpg&quot;
			
 
				     with open(layout_img_path, &quot;wb&quot;) as f:
			
 
				         f.write(base64.b64decode(res[&quot;layoutImage&quot;]))
			
 
				-    print(f&quot;Output images saved at {ocr_img_path} and {layout_img_path}&quot;)
			
 
				+    ocr_img_path = f&quot;ocr_{i}.jpg&quot;
			
 
				+    with open(ocr_img_path, &quot;wb&quot;) as f:
			
 
				+        f.write(base64.b64decode(res[&quot;ocrImage&quot;]))
			
 
				+    print(f&quot;Output images saved at {layout_img_path} and {ocr_img_path}&quot;)
			
 
				 
			
 
				 payload = {
			
 
				-    &quot;visionInfo&quot;: result_vision[&quot;visionInfo&quot;],
			
 
				+    &quot;visualInfo&quot;: result_visual[&quot;visualInfo&quot;],
			
 
				     &quot;minChars&quot;: 200,
			
 
				     &quot;llmRequestInterval&quot;: 1000,
			
 
				     &quot;llmName&quot;: LLM_NAME,
			
@@ -1250,7 +1251,8 @@ payload = {
 
				 resp_vector = requests.post(url=f&quot;{API_BASE_URL}/chatocr-vector&quot;, json=payload)
			
 
				 if resp_vector.status_code != 200:
			
 
				     print(
			
 
				-        f&quot;Request to chatocr-vector failed with status code {resp_vector.status_code}.&quot;
			
 
				+        f&quot;Request to chatocr-vector failed with status code {resp_vector.status_code}.&quot;,
			
 
				+        file=sys.stderr,
			
 
				     )
			
 
				     pprint.pp(resp_vector.json())
			
 
				     sys.exit(1)
			
@@ -1265,7 +1267,8 @@ payload = {
 
				 resp_retrieval = requests.post(url=f&quot;{API_BASE_URL}/chatocr-retrieval&quot;, json=payload)
			
 
				 if resp_retrieval.status_code != 200:
			
 
				     print(
			
 
				-        f&quot;Request to chatocr-retrieval failed with status code {resp_retrieval.status_code}.&quot;
			
 
				+        f&quot;Request to chatocr-retrieval failed with status code {resp_retrieval.status_code}.&quot;,
			
 
				+        file=sys.stderr,
			
 
				     )
			
 
				     pprint.pp(resp_retrieval.json())
			
 
				     sys.exit(1)
			
@@ -1273,7 +1276,7 @@ result_retrieval = resp_retrieval.json()[&quot;result&quot;]
 
				 
			
 
				 payload = {
			
 
				     &quot;keys&quot;: keys,
			
 
				-    &quot;visionInfo&quot;: result_vision[&quot;visionInfo&quot;],
			
 
				+    &quot;visualInfo&quot;: result_visual[&quot;visualInfo&quot;],
			
 
				     &quot;vectorStore&quot;: result_vector[&quot;vectorStore&quot;],
			
 
				     &quot;retrievalResult&quot;: result_retrieval[&quot;retrievalResult&quot;],
			
 
				     &quot;taskDescription&quot;: &quot;&quot;,
			
@@ -1286,7 +1289,8 @@ payload = {
 
				 resp_chat = requests.post(url=f&quot;{API_BASE_URL}/chatocr-chat&quot;, json=payload)
			
 
				 if resp_chat.status_code != 200:
			
 
				     print(
			
 
				-        f&quot;Request to chatocr-chat failed with status code {resp_chat.status_code}.&quot;
			
 
				+        f&quot;Request to chatocr-chat failed with status code {resp_chat.status_code}.&quot;,
			
 
				+        file=sys.stderr,
			
 
				     )
			
 
				     pprint.pp(resp_chat.json())
			
 
				     sys.exit(1)
			
--- a/docs/pipeline_usage/tutorials/information_extraction_pipelines/document_scene_information_extraction.md
+++ b/docs/pipeline_usage/tutorials/information_extraction_pipelines/document_scene_information_extraction.md
@@ -588,9 +588,9 @@ chat_result.print()
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -665,7 +665,7 @@ chat_result.print()
 
				 <li><b><code>analyzeImages</code></b></li>
			
 
				 </ul>
			
 
				 <p>使用计算机视觉模型对图像进行分析，获得OCR、表格识别结果等，并提取图像中的关键信息。</p>
			
 
				-<p><code>POST /chatocr-vision</code></p>
			
 
				+<p><code>POST /chatocr-visual</code></p>
			
 
				 <ul>
			
 
				 <li>请求体的属性如下：</li>
			
 
				 </ul>
			
@@ -682,13 +682,13 @@ chat_result.print()
 
				 <tr>
			
 
				 <td><code>file</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件或PDF文件的URL，或上述类型文件内容的Base64编码结果。对于超过10页的PDF文件，只有前10页的内容会被使用。</td>
			
 
				+<td>服务器可访问的图像文件或PDF文件的URL，或上述类型文件内容的Base64编码结果。对于超过10页的PDF文件，只有前10页的内容会被使用。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				 <td><code>fileType</code></td>
			
 
				 <td><code>integer</code></td>
			
 
				-<td>文件类型。<code>0</code>表示PDF文件，<code>1</code>表示图像文件。若请求体无此属性，则服务将尝试根据URL自动推断文件类型。</td>
			
 
				+<td>文件类型。<code>0</code>表示PDF文件，<code>1</code>表示图像文件。若请求体无此属性，则将根据URL推断文件类型。</td>
			
 
				 <td>否</td>
			
 
				 </tr>
			
 
				 <tr>
			
@@ -749,12 +749,12 @@ chat_result.print()
 
				 </thead>
			
 
				 <tbody>
			
 
				 <tr>
			
 
				-<td><code>visionResults</code></td>
			
 
				+<td><code>visualResults</code></td>
			
 
				 <td><code>array</code></td>
			
 
				 <td>使用计算机视觉模型得到的分析结果。数组长度为1（对于图像输入）或文档页数与10中的较小者（对于PDF输入）。对于PDF输入，数组中的每个元素依次表示PDF文件中每一页的处理结果。</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				-<td><code>visionInfo</code></td>
			
 
				+<td><code>visualInfo</code></td>
			
 
				 <td><code>object</code></td>
			
 
				 <td>图像中的关键信息，可用作其他操作的输入。</td>
			
 
				 </tr>
			
@@ -765,7 +765,7 @@ chat_result.print()
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p><code>visionResults</code>中的每个元素为一个<code>object</code>，具有如下属性：</p>
			
 
				+<p><code>visualResults</code>中的每个元素为一个<code>object</code>，具有如下属性：</p>
			
 
				 <table>
			
 
				 <thead>
			
 
				 <tr>
			
@@ -791,14 +791,14 @@ chat_result.print()
 
				 <td>输入图像。图像为JPEG格式，使用Base64编码。</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				-<td><code>ocrImage</code></td>
			
 
				+<td><code>layoutImage</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>OCR结果图。图像为JPEG格式，使用Base64编码。</td>
			
 
				+<td>版面区域检测结果图。图像为JPEG格式，使用Base64编码。</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				-<td><code>layoutImage</code></td>
			
 
				+<td><code>ocrImage</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>版面区域检测结果图。图像为JPEG格式，使用Base64编码。</td>
			
 
				+<td>OCR结果图。图像为JPEG格式，使用Base64编码。</td>
			
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
@@ -870,7 +870,7 @@ chat_result.print()
 
				 </thead>
			
 
				 <tbody>
			
 
				 <tr>
			
 
				-<td><code>visionInfo</code></td>
			
 
				+<td><code>visualInfo</code></td>
			
 
				 <td><code>object</code></td>
			
 
				 <td>图像中的关键信息。由<code>analyzeImages</code>操作提供。</td>
			
 
				 <td>是</td>
			
@@ -1032,7 +1032,7 @@ chat_result.print()
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				-<td><code>visionInfo</code></td>
			
 
				+<td><code>visualInfo</code></td>
			
 
				 <td><code>object</code></td>
			
 
				 <td>图像中的关键信息。由<code>analyzeImages</code>操作提供。</td>
			
 
				 <td>是</td>
			
@@ -1188,30 +1188,31 @@ payload = {
 
				     &quot;useImgUnwarping&quot;: True,
			
 
				     &quot;useSealTextDet&quot;: True,
			
 
				 }
			
 
				-resp_vision = requests.post(url=f&quot;{API_BASE_URL}/chatocr-vision&quot;, json=payload)
			
 
				-if resp_vision.status_code != 200:
			
 
				+resp_visual = requests.post(url=f&quot;{API_BASE_URL}/chatocr-visual&quot;, json=payload)
			
 
				+if resp_visual.status_code != 200:
			
 
				     print(
			
 
				-        f&quot;Request to chatocr-vision failed with status code {resp_vision.status_code}.&quot;
			
 
				+        f&quot;Request to chatocr-visual failed with status code {resp_visual.status_code}.&quot;,
			
 
				+        file=sys.stderr,
			
 
				     )
			
 
				-    pprint.pp(resp_vision.json())
			
 
				+    pprint.pp(resp_visual.json())
			
 
				     sys.exit(1)
			
 
				-result_vision = resp_vision.json()[&quot;result&quot;]
			
 
				+result_visual = resp_visual.json()[&quot;result&quot;]
			
 
				 
			
 
				-for i, res in enumerate(result_vision[&quot;visionResults&quot;]):
			
 
				+for i, res in enumerate(result_visual[&quot;visualResults&quot;]):
			
 
				     print(&quot;Texts:&quot;)
			
 
				     pprint.pp(res[&quot;texts&quot;])
			
 
				     print(&quot;Tables:&quot;)
			
 
				     pprint.pp(res[&quot;tables&quot;])
			
 
				-    ocr_img_path = f&quot;ocr_{i}.jpg&quot;
			
 
				-    with open(ocr_img_path, &quot;wb&quot;) as f:
			
 
				-        f.write(base64.b64decode(res[&quot;ocrImage&quot;]))
			
 
				     layout_img_path = f&quot;layout_{i}.jpg&quot;
			
 
				     with open(layout_img_path, &quot;wb&quot;) as f:
			
 
				         f.write(base64.b64decode(res[&quot;layoutImage&quot;]))
			
 
				-    print(f&quot;Output images saved at {ocr_img_path} and {layout_img_path}&quot;)
			
 
				+    ocr_img_path = f&quot;ocr_{i}.jpg&quot;
			
 
				+    with open(ocr_img_path, &quot;wb&quot;) as f:
			
 
				+        f.write(base64.b64decode(res[&quot;ocrImage&quot;]))
			
 
				+    print(f&quot;Output images saved at {layout_img_path} and {ocr_img_path}&quot;)
			
 
				 
			
 
				 payload = {
			
 
				-    &quot;visionInfo&quot;: result_vision[&quot;visionInfo&quot;],
			
 
				+    &quot;visualInfo&quot;: result_visual[&quot;visualInfo&quot;],
			
 
				     &quot;minChars&quot;: 200,
			
 
				     &quot;llmRequestInterval&quot;: 1000,
			
 
				     &quot;llmName&quot;: LLM_NAME,
			
@@ -1220,7 +1221,8 @@ payload = {
 
				 resp_vector = requests.post(url=f&quot;{API_BASE_URL}/chatocr-vector&quot;, json=payload)
			
 
				 if resp_vector.status_code != 200:
			
 
				     print(
			
 
				-        f&quot;Request to chatocr-vector failed with status code {resp_vector.status_code}.&quot;
			
 
				+        f&quot;Request to chatocr-vector failed with status code {resp_vector.status_code}.&quot;,
			
 
				+        file=sys.stderr,
			
 
				     )
			
 
				     pprint.pp(resp_vector.json())
			
 
				     sys.exit(1)
			
@@ -1235,7 +1237,8 @@ payload = {
 
				 resp_retrieval = requests.post(url=f&quot;{API_BASE_URL}/chatocr-retrieval&quot;, json=payload)
			
 
				 if resp_retrieval.status_code != 200:
			
 
				     print(
			
 
				-        f&quot;Request to chatocr-retrieval failed with status code {resp_retrieval.status_code}.&quot;
			
 
				+        f&quot;Request to chatocr-retrieval failed with status code {resp_retrieval.status_code}.&quot;,
			
 
				+        file=sys.stderr,
			
 
				     )
			
 
				     pprint.pp(resp_retrieval.json())
			
 
				     sys.exit(1)
			
@@ -1243,7 +1246,7 @@ result_retrieval = resp_retrieval.json()[&quot;result&quot;]
 
				 
			
 
				 payload = {
			
 
				     &quot;keys&quot;: keys,
			
 
				-    &quot;visionInfo&quot;: result_vision[&quot;visionInfo&quot;],
			
 
				+    &quot;visualInfo&quot;: result_visual[&quot;visualInfo&quot;],
			
 
				     &quot;vectorStore&quot;: result_vector[&quot;vectorStore&quot;],
			
 
				     &quot;retrievalResult&quot;: result_retrieval[&quot;retrievalResult&quot;],
			
 
				     &quot;taskDescription&quot;: &quot;&quot;,
			
@@ -1256,7 +1259,8 @@ payload = {
 
				 resp_chat = requests.post(url=f&quot;{API_BASE_URL}/chatocr-chat&quot;, json=payload)
			
 
				 if resp_chat.status_code != 200:
			
 
				     print(
			
 
				-        f&quot;Request to chatocr-chat failed with status code {resp_chat.status_code}.&quot;
			
 
				+        f&quot;Request to chatocr-chat failed with status code {resp_chat.status_code}.&quot;,
			
 
				+        file=sys.stderr,
			
 
				     )
			
 
				     pprint.pp(resp_chat.json())
			
 
				     sys.exit(1)
			
--- a/docs/pipeline_usage/tutorials/ocr_pipelines/OCR.en.md
+++ b/docs/pipeline_usage/tutorials/ocr_pipelines/OCR.en.md
@@ -301,13 +301,13 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for the performance metrics of deployment strategies (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end speedups. For detailed high-performance inference procedures, refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.en.md).
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving low-cost service-oriented deployment of pipelines. For detailed service-oriented deployment procedures, refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.en.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -373,7 +373,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service:</p>
			
 
				+<p>Primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -393,12 +393,18 @@ Below are the API references and multi-language service invocation examples:
 
				 </thead>
			
 
				 <tbody>
			
 
				 <tr>
			
 
				-<td><code>image</code></td>
			
 
				+<td><code>file</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of an image file accessible by the service or the Base64 encoded result of the image file content.</td>
			
 
				+<td>The URL of an image file or PDF file accessible by the server, or the Base64 encoded result of the content of the above-mentioned file types. For PDF files with more than 10 pages, only the content of the first 10 pages will be used.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				+<td><code>fileType</code></td>
			
 
				+<td><code>integer</code></td>
			
 
				+<td>File type. <code>0</code> indicates a PDF file, and <code>1</code> indicates an image file. If this property is not present in the request body, the file type will be inferred based on the URL.</td>
			
 
				+<td>No</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				 <td><code>inferenceParams</code></td>
			
 
				 <td><code>object</code></td>
			
 
				 <td>Inference parameters.</td>
			
@@ -437,6 +443,27 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </thead>
			
 
				 <tbody>
			
 
				+<td><code>ocrResults</code></td>
			
 
				+<td><code>array</code></td>
			
 
				+<td>OCR results. The array length is 1 (for image input) or the smaller of the number of document pages and 10 (for PDF input). For PDF input, each element in the array represents the processing result of each page in the PDF file.</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td><code>dataInfo</code></td>
			
 
				+<td><code>object</code></td>
			
 
				+<td>Information about the input data.</td>
			
 
				+</tr>
			
 
				+</tbody>
			
 
				+</table>
			
 
				+<p>Each element in <code>ocrResults</code> is an <code>object</code> with the following properties:</p>
			
 
				+<table>
			
 
				+<thead>
			
 
				+<tr>
			
 
				+<th>Name</th>
			
 
				+<th>Type</th>
			
 
				+<th>Description</th>
			
 
				+</tr>
			
 
				+</thead>
			
 
				+<tbody>
			
 
				 <tr>
			
 
				 <td><code>texts</code></td>
			
 
				 <td><code>array</code></td>
			
@@ -476,57 +503,6 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Example of <code>result</code>:</p>
			
 
				-<pre><code class="language-json">{
			
 
				-&quot;texts&quot;: [
			
 
				-{
			
 
				-&quot;poly&quot;: [
			
 
				-[
			
 
				-444,
			
 
				-244
			
 
				-],
			
 
				-[
			
 
				-705,
			
 
				-244
			
 
				-],
			
 
				-[
			
 
				-705,
			
 
				-311
			
 
				-],
			
 
				-[
			
 
				-444,
			
 
				-311
			
 
				-]
			
 
				-],
			
 
				-&quot;text&quot;: &quot;Beijing South Railway Station&quot;,
			
 
				-&quot;score&quot;: 0.9
			
 
				-},
			
 
				-{
			
 
				-&quot;poly&quot;: [
			
 
				-[
			
 
				-992,
			
 
				-248
			
 
				-],
			
 
				-[
			
 
				-1263,
			
 
				-251
			
 
				-],
			
 
				-[
			
 
				-1263,
			
 
				-318
			
 
				-],
			
 
				-[
			
 
				-992,
			
 
				-315
			
 
				-]
			
 
				-],
			
 
				-&quot;text&quot;: &quot;Tianjin Railway Station&quot;,
			
 
				-&quot;score&quot;: 0.5
			
 
				-}
			
 
				-],
			
 
				-&quot;image&quot;: &quot;xxxxxx&quot;
			
 
				-}
			
 
				-</code></pre></details>
			
 
				 
			
 
				 <details><summary>Multi-Language Service Invocation Examples</summary>
			
 
				 
			
@@ -538,335 +514,25 @@ Below are the API references and multi-language service invocation examples:
 
				 import requests
			
 
				 
			
 
				 API_URL = &quot;http://localhost:8080/ocr&quot;
			
 
				-image_path = &quot;./demo.jpg&quot;
			
 
				-output_image_path = &quot;./out.jpg&quot;
			
 
				+file_path = &quot;./demo.jpg&quot;
			
 
				 
			
 
				-with open(image_path, &quot;rb&quot;) as file:
			
 
				-    image_bytes = file.read()
			
 
				-    image_data = base64.b64encode(image_bytes).decode(&quot;ascii&quot;)
			
 
				+with open(file_path, &quot;rb&quot;) as file:
			
 
				+    file_bytes = file.read()
			
 
				+    file_data = base64.b64encode(file_bytes).decode(&quot;ascii&quot;)
			
 
				 
			
 
				-payload = {&quot;image&quot;: image_data}
			
 
				+payload = {&quot;file&quot;: file_data, &quot;fileType&quot;: 1}
			
 
				 
			
 
				 response = requests.post(API_URL, json=payload)
			
 
				 
			
 
				 assert response.status_code == 200
			
 
				 result = response.json()[&quot;result&quot;]
			
 
				-with open(output_image_path, &quot;wb&quot;) as file:
			
 
				-    file.write(base64.b64decode(result[&quot;image&quot;]))
			
 
				-print(f&quot;Output image saved at {output_image_path}&quot;)
			
 
				-print(&quot;\nDetected texts:&quot;)
			
 
				-print(result[&quot;texts&quot;])
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>C++</summary>
			
 
				-
			
 
				-<pre><code class="language-cpp">#include &lt;iostream&gt;
			
 
				-#include &quot;cpp-httplib/httplib.h&quot; // https://github.com/Huiyicc/cpp-httplib
			
 
				-#include &quot;nlohmann/json.hpp&quot; // https://github.com/nlohmann/json
			
 
				-#include &quot;base64.hpp&quot; // https://github.com/tobiaslocker/base64
			
 
				-
			
 
				-int main() {
			
 
				-    httplib::Client client(&quot;localhost:8080&quot;);
			
 
				-    const std::string imagePath = &quot;./demo.jpg&quot;;
			
 
				-    const std::string outputImagePath = &quot;./out.jpg&quot;;
			
 
				-
			
 
				-    httplib::Headers headers = {
			
 
				-        {&quot;Content-Type&quot;, &quot;application/json&quot;}
			
 
				-    };
			
 
				-
			
 
				-    std::ifstream file(imagePath, std::ios::binary | std::ios::ate);
			
 
				-    std::streamsize size = file.tellg();
			
 
				-    file.seekg(0, std::ios::beg);
			
 
				-
			
 
				-    std::vector&lt;char&gt; buffer(size);
			
 
				-    if (!file.read(buffer.data(), size)) {
			
 
				-        std::cerr &lt;&lt; &quot;Error reading file.&quot; &lt;&lt; std::endl;
			
 
				-        return 1;
			
 
				-    }
			
 
				-    std::string bufferStr(reinterpret_cast&lt;const char*&gt;(buffer.data()), buffer.size());
			
 
				-    std::string encodedImage = base64::to_base64(bufferStr);
			
 
				-
			
 
				-    nlohmann::json jsonObj;
			
 
				-    jsonObj[&quot;image&quot;] = encodedImage;
			
 
				-    std::string body = jsonObj.dump();
			
 
				-
			
 
				-    auto response = client.Post(&quot;/ocr&quot;, headers, body, &quot;application/json&quot;);
			
 
				-    if (response &amp;&amp; response-&gt;status == 200) {
			
 
				-        nlohmann::json jsonResponse = nlohmann::json::parse(response-&gt;body);
			
 
				-        auto result = jsonResponse[&quot;result&quot;];
			
 
				-
			
 
				-        encodedImage = result[&quot;image&quot;];
			
 
				-        std::string decodedString = base64::from_base64(encodedImage);
			
 
				-        std::vector&lt;unsigned char&gt; decodedImage(decodedString.begin(), decodedString.end());
			
 
				-        std::ofstream outputImage(outPutImagePath, std::ios::binary | std::ios::out);
			
 
				-        if (outputImage.is_open()) {
			
 
				-            outputImage.write(reinterpret_cast&lt;char*&gt;(decodedImage.data()), decodedImage.size());
			
 
				-            outputImage.close();
			
 
				-            std::cout &lt;&lt; &quot;Output image saved at &quot; &lt;&lt; outPutImagePath &lt;&lt; std::endl;
			
 
				-        } else {
			
 
				-            std::cerr &lt;&lt; &quot;Unable to open file for writing: &quot; &lt;&lt; outPutImagePath &lt;&lt; std::endl;
			
 
				-        }
			
 
				-
			
 
				-        auto texts = result[&quot;texts&quot;];
			
 
				-        std::cout &lt;&lt; &quot;\nDetected texts:&quot; &lt;&lt; std::endl;
			
 
				-        for (const auto&amp; text : texts) {
			
 
				-            std::cout &lt;&lt; text &lt;&lt; std::endl;
			
 
				-        }
			
 
				-    } else {
			
 
				-        std::cout &lt;&lt; &quot;Failed to send HTTP request.&quot; &lt;&lt; std::endl;
			
 
				-        return 1;
			
 
				-    }
			
 
				-
			
 
				-    return 0;
			
 
				-}
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>Java</summary>
			
 
				-
			
 
				-<pre><code class="language-java">import okhttp3.*;
			
 
				-import com.fasterxml.jackson.databind.ObjectMapper;
			
 
				-import com.fasterxml.jackson.databind.JsonNode;
			
 
				-import com.fasterxml.jackson.databind.node.ObjectNode;
			
 
				-
			
 
				-import java.io.File;
			
 
				-import java.io.FileOutputStream;
			
 
				-import java.io.IOException;
			
 
				-import java.util.Base64;
			
 
				-
			
 
				-public class Main {
			
 
				-    public static void main(String[] args) throws IOException {
			
 
				-        String API_URL = &quot;http://localhost:8080/ocr&quot;;
			
 
				-        String imagePath = &quot;./demo.jpg&quot;;
			
 
				-        String outputImagePath = &quot;./out.jpg&quot;;
			
 
				-
			
 
				-        File file = new File(imagePath);
			
 
				-        byte[] fileContent = java.nio.file.Files.readAllBytes(file.toPath());
			
 
				-        String imageData = Base64.getEncoder().encodeToString(fileContent);
			
 
				-
			
 
				-        ObjectMapper objectMapper = new ObjectMapper();
			
 
				-        ObjectNode params = objectMapper.createObjectNode();
			
 
				-        params.put(&quot;image&quot;, imageData);
			
 
				-
			
 
				-        OkHttpClient client = new OkHttpClient();
			
 
				-        MediaType JSON = MediaType.Companion.get(&quot;application/json; charset=utf-8&quot;);
			
 
				-        RequestBody body = RequestBody.Companion.create(params.toString(), JSON);
			
 
				-        Request request = new Request.Builder()
			
 
				-                .url(API_URL)
			
 
				-                .post(body)
			
 
				-                .build();
			
 
				-
			
 
				-        try (Response response = client.newCall(request).execute()) {
			
 
				-            if (response.isSuccessful()) {
			
 
				-                String responseBody = response.body().string();
			
 
				-                JsonNode resultNode = objectMapper.readTree(responseBody);
			
 
				-                JsonNode result = resultNode.get(&quot;result&quot;);
			
 
				-                String base64Image = result.get(&quot;image&quot;).asText();
			
 
				-                JsonNode texts = result.get(&quot;texts&quot;);
			
 
				-
			
 
				-                byte[] imageBytes = Base64.getDecoder().decode(base64Image);
			
 
				-                try (FileOutputStream fos = new FileOutputStream(outputImagePath)) {
			
 
				-                    fos.write(imageBytes);
			
 
				-                }
			
 
				-                System.out.println(&quot;Output image saved at &quot; + outputImagePath);
			
 
				-                System.out.println(&quot;\nDetected texts: &quot; + texts.toString());
			
 
				-            } else {
			
 
				-                System.err.println(&quot;Request failed with code: &quot; + response.code());
			
 
				-            }
			
 
				-        }
			
 
				-    }
			
 
				-}
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>Go</summary>
			
 
				-
			
 
				-<pre><code class="language-go">package main
			
 
				-
			
 
				-import (
			
 
				-    &quot;bytes&quot;
			
 
				-    &quot;encoding/base64&quot;
			
 
				-    &quot;encoding/json&quot;
			
 
				-    &quot;fmt&quot;
			
 
				-    &quot;io/ioutil&quot;
			
 
				-    &quot;net/http&quot;
			
 
				-)
			
 
				-
			
 
				-func main() {
			
 
				-    API_URL := &quot;http://localhost:8080/ocr&quot;
			
 
				-    imagePath := &quot;./demo.jpg&quot;
			
 
				-    outputImagePath := &quot;./out.jpg&quot;
			
 
				-
			
 
				-    imageBytes, err := ioutil.ReadFile(imagePath)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error reading image file:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    imageData := base64.StdEncoding.EncodeToString(imageBytes)
			
 
				-
			
 
				-    payload := map[string]string{&quot;image&quot;: imageData}
			
 
				-    payloadBytes, err := json.Marshal(payload)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error marshaling payload:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-
			
 
				-    client := &amp;http.Client{}
			
 
				-    req, err := http.NewRequest(&quot;POST&quot;, API_URL, bytes.NewBuffer(payloadBytes))
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error creating request:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-
			
 
				-    res, err := client.Do(req)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error sending request:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    defer res.Body.Close()
			
 
				-
			
 
				-    body, err := ioutil.ReadAll(res.Body)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error reading response body:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    type Response struct {
			
 
				-        Result struct {
			
 
				-            Image      string   `json:&quot;image&quot;`
			
 
				-            Texts []map[string]interface{} `json:&quot;texts&quot;`
			
 
				-        } `json:&quot;result&quot;`
			
 
				-    }
			
 
				-    var respData Response
			
 
				-    err = json.Unmarshal([]byte(string(body)), &amp;respData)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error unmarshaling response body:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-
			
 
				-    outputImageData, err := base64.StdEncoding.DecodeString(respData.Result.Image)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error decoding base64 image data:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    err = ioutil.WriteFile(outputImagePath, outputImageData, 0644)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error writing image to file:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    fmt.Printf(&quot;Image saved at %s.jpg\n&quot;, outputImagePath)
			
 
				-    fmt.Println(&quot;\nDetected texts:&quot;)
			
 
				-    for _, text := range respData.Result.Texts {
			
 
				-        fmt.Println(text)
			
 
				-    }
			
 
				-}
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>C#</summary>
			
 
				-
			
 
				-<pre><code class="language-csharp">using System;
			
 
				-using System.IO;
			
 
				-using System.Net.Http;
			
 
				-using System.Net.Http.Headers;
			
 
				-using System.Text;
			
 
				-using System.Threading.Tasks;
			
 
				-using Newtonsoft.Json.Linq;
			
 
				-
			
 
				-class Program
			
 
				-{
			
 
				-    static readonly string API_URL = &quot;http://localhost:8080/ocr&quot;;
			
 
				-    static readonly string imagePath = &quot;./demo.jpg&quot;;
			
 
				-    static readonly string outputImagePath = &quot;./out.jpg&quot;;
			
 
				-
			
 
				-    static async Task Main(string[] args)
			
 
				-    {
			
 
				-        var httpClient = new HttpClient();
			
 
				-
			
 
				-        byte[] imageBytes = File.ReadAllBytes(imagePath);
			
 
				-        string image_data = Convert.ToBase64String(imageBytes);
			
 
				-
			
 
				-        var payload = new JObject{ { &quot;image&quot;, image_data } };
			
 
				-        var content = new StringContent(payload.ToString(), Encoding.UTF8, &quot;application/json&quot;);
			
 
				-
			
 
				-        HttpResponseMessage response = await httpClient.PostAsync(API_URL, content);
			
 
				-        response.EnsureSuccessStatusCode();
			
 
				-
			
 
				-        string responseBody = await response.Content.ReadAsStringAsync();
			
 
				-        JObject jsonResponse = JObject.Parse(responseBody);
			
 
				-
			
 
				-        string base64Image = jsonResponse[&quot;result&quot;][&quot;image&quot;].ToString();
			
 
				-        byte[] outputImageBytes = Convert.FromBase64String(base64Image);
			
 
				-
			
 
				-        File.WriteAllBytes(outputImagePath, outputImageBytes);
			
 
				-        Console.WriteLine($&quot;Output image saved at {outputImagePath}&quot;);
			
 
				-        Console.WriteLine(&quot;\nDetected texts:&quot;);
			
 
				-        Console.WriteLine(jsonResponse[&quot;result&quot;][&quot;texts&quot;].ToString());
			
 
				-    }
			
 
				-}
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>Node.js</summary>
			
 
				-
			
 
				-<pre><code class="language-js">const axios = require('axios');
			
 
				-const fs = require('fs');
			
 
				-
			
 
				-const API_URL = 'http://localhost:8080/ocr'
			
 
				-const imagePath = './demo.jpg'
			
 
				-const outputImagePath = &quot;./out.jpg&quot;;
			
 
				-
			
 
				-let config = {
			
 
				-   method: 'POST',
			
 
				-   maxBodyLength: Infinity,
			
 
				-   url: API_URL,
			
 
				-   data: JSON.stringify({
			
 
				-    'image': encodeImageToBase64(imagePath)
			
 
				-  })
			
 
				-};
			
 
				-
			
 
				-function encodeImageToBase64(filePath) {
			
 
				-  const bitmap = fs.readFileSync(filePath);
			
 
				-  return Buffer.from(bitmap).toString('base64');
			
 
				-}
			
 
				-
			
 
				-axios.request(config)
			
 
				-.then((response) =&gt; {
			
 
				-    const result = response.data[&quot;result&quot;];
			
 
				-    const imageBuffer = Buffer.from(result[&quot;image&quot;], 'base64');
			
 
				-    fs.writeFile(outputImagePath, imageBuffer, (err) =&gt; {
			
 
				-      if (err) throw err;
			
 
				-      console.log(`Output image saved at ${outputImagePath}`);
			
 
				-    });
			
 
				-    console.log(&quot;\nDetected texts:&quot;);
			
 
				-    console.log(result[&quot;texts&quot;]);
			
 
				-})
			
 
				-.catch((error) =&gt; {
			
 
				-  console.log(error);
			
 
				-});
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>PHP</summary>
			
 
				-
			
 
				-<pre><code class="language-php">&lt;?php
			
 
				-
			
 
				-$API_URL = &quot;http://localhost:8080/ocr&quot;;
			
 
				-$image_path = &quot;./demo.jpg&quot;;
			
 
				-$output_image_path = &quot;./out.jpg&quot;;
			
 
				-
			
 
				-$image_data = base64_encode(file_get_contents($image_path));
			
 
				-$payload = array(&quot;image&quot; =&gt; $image_data);
			
 
				-
			
 
				-$ch = curl_init($API_URL);
			
 
				-curl_setopt($ch, CURLOPT_POST, true);
			
 
				-curl_setopt($ch, CURLOPT_POSTFIELDS, json_encode($payload));
			
 
				-curl_setopt($ch, CURLOPT_HTTPHEADER, array('Content-Type: application/json'));
			
 
				-curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
			
 
				-$response = curl_exec($ch);
			
 
				-curl_close($ch);
			
 
				-
			
 
				-$result = json_decode($response, true)[&quot;result&quot;];
			
 
				-file_put_contents($output_image_path, base64_decode($result[&quot;image&quot;]));
			
 
				-echo &quot;Output image saved at &quot; . $output_image_path . &quot;\n&quot;;
			
 
				-echo &quot;\nDetected texts:\n&quot;;
			
 
				-print_r($result[&quot;texts&quot;]);
			
 
				-
			
 
				-?&gt;
			
 
				+for i, res in enumerate(result[&quot;ocrResults&quot;]):
			
 
				+    print(&quot;Detected texts:&quot;)
			
 
				+    print(res[&quot;texts&quot;])
			
 
				+    output_img_path = f&quot;out_{i}.jpg&quot;
			
 
				+    with open(output_img_path, &quot;wb&quot;) as f:
			
 
				+        f.write(base64.b64decode(res[&quot;image&quot;]))
			
 
				+    print(f&quot;Output image saved at {output_img_path}&quot;)
			
 
				 </code></pre></details>
			
 
				 </details>
			
 
				 <br/>
			
--- a/docs/pipeline_usage/tutorials/ocr_pipelines/OCR.md
+++ b/docs/pipeline_usage/tutorials/ocr_pipelines/OCR.md
@@ -835,9 +835,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -927,12 +927,18 @@ for res in output:
 
				 </thead>
			
 
				 <tbody>
			
 
				 <tr>
			
 
				-<td><code>image</code></td>
			
 
				+<td><code>file</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的图像文件或PDF文件的URL，或上述类型文件内容的Base64编码结果。对于超过10页的PDF文件，只有前10页的内容会被使用。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				+<td><code>fileType</code></td>
			
 
				+<td><code>integer</code></td>
			
 
				+<td>文件类型。<code>0</code>表示PDF文件，<code>1</code>表示图像文件。若请求体无此属性，则将根据URL推断文件类型。</td>
			
 
				+<td>否</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				 <td><code>inferenceParams</code></td>
			
 
				 <td><code>object</code></td>
			
 
				 <td>推理参数。</td>
			
@@ -972,6 +978,28 @@ for res in output:
 
				 </thead>
			
 
				 <tbody>
			
 
				 <tr>
			
 
				+<td><code>ocrResults</code></td>
			
 
				+<td><code>object</code></td>
			
 
				+<td>OCR结果。数组长度为1（对于图像输入）或文档页数与10中的较小者（对于PDF输入）。对于PDF输入，数组中的每个元素依次表示PDF文件中每一页的处理结果。</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td><code>dataInfo</code></td>
			
 
				+<td><code>object</code></td>
			
 
				+<td>输入数据信息。</td>
			
 
				+</tr>
			
 
				+</tbody>
			
 
				+</table>
			
 
				+<p><code>ocrResults</code>中的每个元素为一个<code>object</code>，具有如下属性：</p>
			
 
				+<table>
			
 
				+<thead>
			
 
				+<tr>
			
 
				+<th>名称</th>
			
 
				+<th>类型</th>
			
 
				+<th>含义</th>
			
 
				+</tr>
			
 
				+</thead>
			
 
				+<tbody>
			
 
				+<tr>
			
 
				 <td><code>texts</code></td>
			
 
				 <td><code>array</code></td>
			
 
				 <td>文本位置、内容和得分。</td>
			
@@ -1010,418 +1038,35 @@ for res in output:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p><code>result</code>示例如下：</p>
			
 
				-<pre><code class="language-json">{
			
 
				-&quot;texts&quot;: [
			
 
				-{
			
 
				-&quot;poly&quot;: [
			
 
				-[
			
 
				-444,
			
 
				-244
			
 
				-],
			
 
				-[
			
 
				-705,
			
 
				-244
			
 
				-],
			
 
				-[
			
 
				-705,
			
 
				-311
			
 
				-],
			
 
				-[
			
 
				-444,
			
 
				-311
			
 
				-]
			
 
				-],
			
 
				-&quot;text&quot;: &quot;北京南站&quot;,
			
 
				-&quot;score&quot;: 0.9
			
 
				-},
			
 
				-{
			
 
				-&quot;poly&quot;: [
			
 
				-[
			
 
				-992,
			
 
				-248
			
 
				-],
			
 
				-[
			
 
				-1263,
			
 
				-251
			
 
				-],
			
 
				-[
			
 
				-1263,
			
 
				-318
			
 
				-],
			
 
				-[
			
 
				-992,
			
 
				-315
			
 
				-]
			
 
				-],
			
 
				-&quot;text&quot;: &quot;天津站&quot;,
			
 
				-&quot;score&quot;: 0.5
			
 
				-}
			
 
				-],
			
 
				-&quot;image&quot;: &quot;xxxxxx&quot;
			
 
				-}
			
 
				-</code></pre></details>
			
 
				 
			
 
				 <details><summary>多语言调用服务示例</summary>
			
 
				 
			
 
				 <details>
			
 
				 <summary>Python</summary>
			
 
				 
			
 
				-
			
 
				 <pre><code class="language-python">import base64
			
 
				 import requests
			
 
				 
			
 
				-API_URL = &quot;http://localhost:8080/ocr&quot; # 服务URL
			
 
				-image_path = &quot;./demo.jpg&quot;
			
 
				-output_image_path = &quot;./out.jpg&quot;
			
 
				+API_URL = &quot;http://localhost:8080/ocr&quot;
			
 
				+file_path = &quot;./demo.jpg&quot;
			
 
				 
			
 
				-# 对本地图像进行Base64编码
			
 
				-with open(image_path, &quot;rb&quot;) as file:
			
 
				-    image_bytes = file.read()
			
 
				-    image_data = base64.b64encode(image_bytes).decode(&quot;ascii&quot;)
			
 
				+with open(file_path, &quot;rb&quot;) as file:
			
 
				+    file_bytes = file.read()
			
 
				+    file_data = base64.b64encode(file_bytes).decode(&quot;ascii&quot;)
			
 
				 
			
 
				-payload = {&quot;image&quot;: image_data}  # Base64编码的文件内容或者图像URL
			
 
				+payload = {&quot;file&quot;: file_data, &quot;fileType&quot;: 1}
			
 
				 
			
 
				-# 调用API
			
 
				 response = requests.post(API_URL, json=payload)
			
 
				 
			
 
				-# 处理接口返回数据
			
 
				 assert response.status_code == 200
			
 
				 result = response.json()[&quot;result&quot;]
			
 
				-with open(output_image_path, &quot;wb&quot;) as file:
			
 
				-    file.write(base64.b64decode(result[&quot;image&quot;]))
			
 
				-print(f&quot;Output image saved at {output_image_path}&quot;)
			
 
				-print(&quot;\nDetected texts:&quot;)
			
 
				-print(result[&quot;texts&quot;])
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>C++</summary>
			
 
				-
			
 
				-<pre><code class="language-cpp">#include &lt;iostream&gt;
			
 
				-#include &quot;cpp-httplib/httplib.h&quot; // https://github.com/Huiyicc/cpp-httplib
			
 
				-#include &quot;nlohmann/json.hpp&quot; // https://github.com/nlohmann/json
			
 
				-#include &quot;base64.hpp&quot; // https://github.com/tobiaslocker/base64
			
 
				-
			
 
				-int main() {
			
 
				-    httplib::Client client(&quot;localhost:8080&quot;);
			
 
				-    const std::string imagePath = &quot;./demo.jpg&quot;;
			
 
				-    const std::string outputImagePath = &quot;./out.jpg&quot;;
			
 
				-
			
 
				-    httplib::Headers headers = {
			
 
				-        {&quot;Content-Type&quot;, &quot;application/json&quot;}
			
 
				-    };
			
 
				-
			
 
				-    // 对本地图像进行Base64编码
			
 
				-    std::ifstream file(imagePath, std::ios::binary | std::ios::ate);
			
 
				-    std::streamsize size = file.tellg();
			
 
				-    file.seekg(0, std::ios::beg);
			
 
				-
			
 
				-    std::vector&lt;char&gt; buffer(size);
			
 
				-    if (!file.read(buffer.data(), size)) {
			
 
				-        std::cerr &lt;&lt; &quot;Error reading file.&quot; &lt;&lt; std::endl;
			
 
				-        return 1;
			
 
				-    }
			
 
				-    std::string bufferStr(reinterpret_cast&lt;const char*&gt;(buffer.data()), buffer.size());
			
 
				-    std::string encodedImage = base64::to_base64(bufferStr);
			
 
				-
			
 
				-    nlohmann::json jsonObj;
			
 
				-    jsonObj[&quot;image&quot;] = encodedImage;
			
 
				-    std::string body = jsonObj.dump();
			
 
				-
			
 
				-    // 调用API
			
 
				-    auto response = client.Post(&quot;/ocr&quot;, headers, body, &quot;application/json&quot;);
			
 
				-    // 处理接口返回数据
			
 
				-    if (response &amp;&amp; response-&gt;status == 200) {
			
 
				-        nlohmann::json jsonResponse = nlohmann::json::parse(response-&gt;body);
			
 
				-        auto result = jsonResponse[&quot;result&quot;];
			
 
				-
			
 
				-        encodedImage = result[&quot;image&quot;];
			
 
				-        std::string decodedString = base64::from_base64(encodedImage);
			
 
				-        std::vector&lt;unsigned char&gt; decodedImage(decodedString.begin(), decodedString.end());
			
 
				-        std::ofstream outputImage(outPutImagePath, std::ios::binary | std::ios::out);
			
 
				-        if (outputImage.is_open()) {
			
 
				-            outputImage.write(reinterpret_cast&lt;char*&gt;(decodedImage.data()), decodedImage.size());
			
 
				-            outputImage.close();
			
 
				-            std::cout &lt;&lt; &quot;Output image saved at &quot; &lt;&lt; outPutImagePath &lt;&lt; std::endl;
			
 
				-        } else {
			
 
				-            std::cerr &lt;&lt; &quot;Unable to open file for writing: &quot; &lt;&lt; outPutImagePath &lt;&lt; std::endl;
			
 
				-        }
			
 
				-
			
 
				-        auto texts = result[&quot;texts&quot;];
			
 
				-        std::cout &lt;&lt; &quot;\nDetected texts:&quot; &lt;&lt; std::endl;
			
 
				-        for (const auto&amp; text : texts) {
			
 
				-            std::cout &lt;&lt; text &lt;&lt; std::endl;
			
 
				-        }
			
 
				-    } else {
			
 
				-        std::cout &lt;&lt; &quot;Failed to send HTTP request.&quot; &lt;&lt; std::endl;
			
 
				-        return 1;
			
 
				-    }
			
 
				-
			
 
				-    return 0;
			
 
				-}
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>Java</summary>
			
 
				-
			
 
				-<pre><code class="language-java">import okhttp3.*;
			
 
				-import com.fasterxml.jackson.databind.ObjectMapper;
			
 
				-import com.fasterxml.jackson.databind.JsonNode;
			
 
				-import com.fasterxml.jackson.databind.node.ObjectNode;
			
 
				-
			
 
				-import java.io.File;
			
 
				-import java.io.FileOutputStream;
			
 
				-import java.io.IOException;
			
 
				-import java.util.Base64;
			
 
				-
			
 
				-public class Main {
			
 
				-    public static void main(String[] args) throws IOException {
			
 
				-        String API_URL = &quot;http://localhost:8080/ocr&quot;; // 服务URL
			
 
				-        String imagePath = &quot;./demo.jpg&quot;; // 本地图像
			
 
				-        String outputImagePath = &quot;./out.jpg&quot;; // 输出图像
			
 
				-
			
 
				-        // 对本地图像进行Base64编码
			
 
				-        File file = new File(imagePath);
			
 
				-        byte[] fileContent = java.nio.file.Files.readAllBytes(file.toPath());
			
 
				-        String imageData = Base64.getEncoder().encodeToString(fileContent);
			
 
				-
			
 
				-        ObjectMapper objectMapper = new ObjectMapper();
			
 
				-        ObjectNode params = objectMapper.createObjectNode();
			
 
				-        params.put(&quot;image&quot;, imageData); // Base64编码的文件内容或者图像URL
			
 
				-
			
 
				-        // 创建 OkHttpClient 实例
			
 
				-        OkHttpClient client = new OkHttpClient();
			
 
				-        MediaType JSON = MediaType.Companion.get(&quot;application/json; charset=utf-8&quot;);
			
 
				-        RequestBody body = RequestBody.Companion.create(params.toString(), JSON);
			
 
				-        Request request = new Request.Builder()
			
 
				-                .url(API_URL)
			
 
				-                .post(body)
			
 
				-                .build();
			
 
				-
			
 
				-        // 调用API并处理接口返回数据
			
 
				-        try (Response response = client.newCall(request).execute()) {
			
 
				-            if (response.isSuccessful()) {
			
 
				-                String responseBody = response.body().string();
			
 
				-                JsonNode resultNode = objectMapper.readTree(responseBody);
			
 
				-                JsonNode result = resultNode.get(&quot;result&quot;);
			
 
				-                String base64Image = result.get(&quot;image&quot;).asText();
			
 
				-                JsonNode texts = result.get(&quot;texts&quot;);
			
 
				-
			
 
				-                byte[] imageBytes = Base64.getDecoder().decode(base64Image);
			
 
				-                try (FileOutputStream fos = new FileOutputStream(outputImagePath)) {
			
 
				-                    fos.write(imageBytes);
			
 
				-                }
			
 
				-                System.out.println(&quot;Output image saved at &quot; + outputImagePath);
			
 
				-                System.out.println(&quot;\nDetected texts: &quot; + texts.toString());
			
 
				-            } else {
			
 
				-                System.err.println(&quot;Request failed with code: &quot; + response.code());
			
 
				-            }
			
 
				-        }
			
 
				-    }
			
 
				-}
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>Go</summary>
			
 
				-
			
 
				-<pre><code class="language-go">package main
			
 
				-
			
 
				-import (
			
 
				-    &quot;bytes&quot;
			
 
				-    &quot;encoding/base64&quot;
			
 
				-    &quot;encoding/json&quot;
			
 
				-    &quot;fmt&quot;
			
 
				-    &quot;io/ioutil&quot;
			
 
				-    &quot;net/http&quot;
			
 
				-)
			
 
				-
			
 
				-func main() {
			
 
				-    API_URL := &quot;http://localhost:8080/ocr&quot;
			
 
				-    imagePath := &quot;./demo.jpg&quot;
			
 
				-    outputImagePath := &quot;./out.jpg&quot;
			
 
				-
			
 
				-    // 对本地图像进行Base64编码
			
 
				-    imageBytes, err := ioutil.ReadFile(imagePath)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error reading image file:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    imageData := base64.StdEncoding.EncodeToString(imageBytes)
			
 
				-
			
 
				-    payload := map[string]string{&quot;image&quot;: imageData} // Base64编码的文件内容或者图像URL
			
 
				-    payloadBytes, err := json.Marshal(payload)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error marshaling payload:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-
			
 
				-    // 调用API
			
 
				-    client := &amp;http.Client{}
			
 
				-    req, err := http.NewRequest(&quot;POST&quot;, API_URL, bytes.NewBuffer(payloadBytes))
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error creating request:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-
			
 
				-    res, err := client.Do(req)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error sending request:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    defer res.Body.Close()
			
 
				-
			
 
				-    // 处理接口返回数据
			
 
				-    body, err := ioutil.ReadAll(res.Body)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error reading response body:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    type Response struct {
			
 
				-        Result struct {
			
 
				-            Image      string   `json:&quot;image&quot;`
			
 
				-            Texts []map[string]interface{} `json:&quot;texts&quot;`
			
 
				-        } `json:&quot;result&quot;`
			
 
				-    }
			
 
				-    var respData Response
			
 
				-    err = json.Unmarshal([]byte(string(body)), &amp;respData)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error unmarshaling response body:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-
			
 
				-    outputImageData, err := base64.StdEncoding.DecodeString(respData.Result.Image)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error decoding base64 image data:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    err = ioutil.WriteFile(outputImagePath, outputImageData, 0644)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error writing image to file:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    fmt.Printf(&quot;Image saved at %s.jpg\n&quot;, outputImagePath)
			
 
				-    fmt.Println(&quot;\nDetected texts:&quot;)
			
 
				-    for _, text := range respData.Result.Texts {
			
 
				-        fmt.Println(text)
			
 
				-    }
			
 
				-}
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>C#</summary>
			
 
				-
			
 
				-<pre><code class="language-csharp">using System;
			
 
				-using System.IO;
			
 
				-using System.Net.Http;
			
 
				-using System.Net.Http.Headers;
			
 
				-using System.Text;
			
 
				-using System.Threading.Tasks;
			
 
				-using Newtonsoft.Json.Linq;
			
 
				-
			
 
				-class Program
			
 
				-{
			
 
				-    static readonly string API_URL = &quot;http://localhost:8080/ocr&quot;;
			
 
				-    static readonly string imagePath = &quot;./demo.jpg&quot;;
			
 
				-    static readonly string outputImagePath = &quot;./out.jpg&quot;;
			
 
				-
			
 
				-    static async Task Main(string[] args)
			
 
				-    {
			
 
				-        var httpClient = new HttpClient();
			
 
				-
			
 
				-        // 对本地图像进行Base64编码
			
 
				-        byte[] imageBytes = File.ReadAllBytes(imagePath);
			
 
				-        string image_data = Convert.ToBase64String(imageBytes);
			
 
				-
			
 
				-        var payload = new JObject{ { &quot;image&quot;, image_data } }; // Base64编码的文件内容或者图像URL
			
 
				-        var content = new StringContent(payload.ToString(), Encoding.UTF8, &quot;application/json&quot;);
			
 
				-
			
 
				-        // 调用API
			
 
				-        HttpResponseMessage response = await httpClient.PostAsync(API_URL, content);
			
 
				-        response.EnsureSuccessStatusCode();
			
 
				-
			
 
				-        // 处理接口返回数据
			
 
				-        string responseBody = await response.Content.ReadAsStringAsync();
			
 
				-        JObject jsonResponse = JObject.Parse(responseBody);
			
 
				-
			
 
				-        string base64Image = jsonResponse[&quot;result&quot;][&quot;image&quot;].ToString();
			
 
				-        byte[] outputImageBytes = Convert.FromBase64String(base64Image);
			
 
				-
			
 
				-        File.WriteAllBytes(outputImagePath, outputImageBytes);
			
 
				-        Console.WriteLine($&quot;Output image saved at {outputImagePath}&quot;);
			
 
				-        Console.WriteLine(&quot;\nDetected texts:&quot;);
			
 
				-        Console.WriteLine(jsonResponse[&quot;result&quot;][&quot;texts&quot;].ToString());
			
 
				-    }
			
 
				-}
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>Node.js</summary>
			
 
				-
			
 
				-<pre><code class="language-js">const axios = require('axios');
			
 
				-const fs = require('fs');
			
 
				-
			
 
				-const API_URL = 'http://localhost:8080/ocr'
			
 
				-const imagePath = './demo.jpg'
			
 
				-const outputImagePath = &quot;./out.jpg&quot;;
			
 
				-
			
 
				-let config = {
			
 
				-   method: 'POST',
			
 
				-   maxBodyLength: Infinity,
			
 
				-   url: API_URL,
			
 
				-   data: JSON.stringify({
			
 
				-    'image': encodeImageToBase64(imagePath)  // Base64编码的文件内容或者图像URL
			
 
				-  })
			
 
				-};
			
 
				-
			
 
				-// 对本地图像进行Base64编码
			
 
				-function encodeImageToBase64(filePath) {
			
 
				-  const bitmap = fs.readFileSync(filePath);
			
 
				-  return Buffer.from(bitmap).toString('base64');
			
 
				-}
			
 
				-
			
 
				-// 调用API
			
 
				-axios.request(config)
			
 
				-.then((response) =&gt; {
			
 
				-    // 处理接口返回数据
			
 
				-    const result = response.data[&quot;result&quot;];
			
 
				-    const imageBuffer = Buffer.from(result[&quot;image&quot;], 'base64');
			
 
				-    fs.writeFile(outputImagePath, imageBuffer, (err) =&gt; {
			
 
				-      if (err) throw err;
			
 
				-      console.log(`Output image saved at ${outputImagePath}`);
			
 
				-    });
			
 
				-    console.log(&quot;\nDetected texts:&quot;);
			
 
				-    console.log(result[&quot;texts&quot;]);
			
 
				-})
			
 
				-.catch((error) =&gt; {
			
 
				-  console.log(error);
			
 
				-});
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>PHP</summary>
			
 
				-
			
 
				-<pre><code class="language-php">&lt;?php
			
 
				-
			
 
				-$API_URL = &quot;http://localhost:8080/ocr&quot;; // 服务URL
			
 
				-$image_path = &quot;./demo.jpg&quot;;
			
 
				-$output_image_path = &quot;./out.jpg&quot;;
			
 
				-
			
 
				-// 对本地图像进行Base64编码
			
 
				-$image_data = base64_encode(file_get_contents($image_path));
			
 
				-$payload = array(&quot;image&quot; =&gt; $image_data); // Base64编码的文件内容或者图像URL
			
 
				-
			
 
				-// 调用API
			
 
				-$ch = curl_init($API_URL);
			
 
				-curl_setopt($ch, CURLOPT_POST, true);
			
 
				-curl_setopt($ch, CURLOPT_POSTFIELDS, json_encode($payload));
			
 
				-curl_setopt($ch, CURLOPT_HTTPHEADER, array('Content-Type: application/json'));
			
 
				-curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
			
 
				-$response = curl_exec($ch);
			
 
				-curl_close($ch);
			
 
				-
			
 
				-// 处理接口返回数据
			
 
				-$result = json_decode($response, true)[&quot;result&quot;];
			
 
				-file_put_contents($output_image_path, base64_decode($result[&quot;image&quot;]));
			
 
				-echo &quot;Output image saved at &quot; . $output_image_path . &quot;\n&quot;;
			
 
				-echo &quot;\nDetected texts:\n&quot;;
			
 
				-print_r($result[&quot;texts&quot;]);
			
 
				-
			
 
				-?&gt;
			
 
				+for i, res in enumerate(result[&quot;ocrResults&quot;]):
			
 
				+    print(&quot;Detected texts:&quot;)
			
 
				+    print(res[&quot;texts&quot;])
			
 
				+    output_img_path = f&quot;out_{i}.jpg&quot;
			
 
				+    with open(output_img_path, &quot;wb&quot;) as f:
			
 
				+        f.write(base64.b64decode(res[&quot;image&quot;]))
			
 
				+    print(f&quot;Output image saved at {output_img_path}&quot;)
			
 
				 </code></pre></details>
			
 
				 </details>
			
 
				 <br/>
			
--- a/docs/pipeline_usage/tutorials/ocr_pipelines/doc_preprocessor.md
+++ b/docs/pipeline_usage/tutorials/ocr_pipelines/doc_preprocessor.md
@@ -348,9 +348,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -442,13 +442,13 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>file</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件或PDF文件的URL，或上述类型文件内容的Base64编码结果。对于超过10页的PDF文件，只有前10页的内容会被使用。</td>
			
 
				+<td>服务器可访问的图像文件或PDF文件的URL，或上述类型文件内容的Base64编码结果。对于超过10页的PDF文件，只有前10页的内容会被使用。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				 <td><code>fileType</code></td>
			
 
				 <td><code>integer</code></td>
			
 
				-<td>文件类型。<code>0</code>表示PDF文件，<code>1</code>表示图像文件。若请求体无此属性，则服务将尝试根据URL自动推断文件类型。</td>
			
 
				+<td>文件类型。<code>0</code>表示PDF文件，<code>1</code>表示图像文件。若请求体无此属性，则将根据URL推断文件类型。</td>
			
 
				 <td>否</td>
			
 
				 </tr>
			
 
				 <tr>
			
--- a/docs/pipeline_usage/tutorials/ocr_pipelines/formula_recognition.en.md
+++ b/docs/pipeline_usage/tutorials/ocr_pipelines/formula_recognition.en.md
@@ -283,13 +283,13 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for the performance metrics of deployment strategies (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end speedups. For detailed high-performance inference procedures, refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.en.md).
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving low-cost service-oriented deployment of pipelines. For detailed service-oriented deployment procedures, refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.en.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -355,7 +355,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service:</p>
			
 
				+<p>Primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -377,38 +377,13 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>file</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of an image file or PDF file accessible by the service, or the Base64 encoded result of the content of the above-mentioned file types. For PDF files with more than 10 pages, only the content of the first 10 pages will be used.</td>
			
 
				+<td>The URL of an image file or PDF file accessible by the server, or the Base64 encoded result of the content of the above-mentioned file types. For PDF files with more than 10 pages, only the content of the first 10 pages will be used.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				 <td><code>fileType</code></td>
			
 
				 <td><code>integer</code></td>
			
 
				-<td>File type. <code>0</code> indicates a PDF file, and <code>1</code> indicates an image file. If this property is not present in the request body, the service will attempt to infer the file type automatically based on the URL.</td>
			
 
				-<td>No</td>
			
 
				-</tr>
			
 
				-<tr>
			
 
				-<td><code>inferenceParams</code></td>
			
 
				-<td><code>object</code></td>
			
 
				-<td>Inference parameters.</td>
			
 
				-<td>No</td>
			
 
				-</tr>
			
 
				-</tbody>
			
 
				-</table>
			
 
				-<p>Properties of <code>inferenceParams</code>:</p>
			
 
				-<table>
			
 
				-<thead>
			
 
				-<tr>
			
 
				-<th>Name</th>
			
 
				-<th>Type</th>
			
 
				-<th>Description</th>
			
 
				-<th>Required</th>
			
 
				-</tr>
			
 
				-</thead>
			
 
				-<tbody>
			
 
				-<tr>
			
 
				-<td><code>maxLongSide</code></td>
			
 
				-<td><code>integer</code></td>
			
 
				-<td>During inference, if the length of the longer side of the input image for the layout detection model is greater than <code>maxLongSide</code>, the image will be scaled so that the length of the longer side equals <code>maxLongSide</code>.</td>
			
 
				+<td>File type. <code>0</code> indicates a PDF file, and <code>1</code> indicates an image file. If this property is not present in the request body, the file type will be inferred based on the URL.</td>
			
 
				 <td>No</td>
			
 
				 </tr>
			
 
				 </tbody>
			
@@ -462,11 +437,6 @@ Below are the API references and multi-language service invocation examples:
 
				 <td><code>string</code></td>
			
 
				 <td>Layout area detection result image. The image is in JPEG format and encoded using Base64.</td>
			
 
				 </tr>
			
 
				-<tr>
			
 
				-<td><code>ocrImage</code></td>
			
 
				-<td><code>string</code></td>
			
 
				-<td>OCR result image. The image is in JPEG format and encoded using Base64.</td>
			
 
				-</tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				 <p>Each element in <code>formulas</code> is an <code>object</code> with the following properties:</p>
			
--- a/docs/pipeline_usage/tutorials/ocr_pipelines/formula_recognition.md
+++ b/docs/pipeline_usage/tutorials/ocr_pipelines/formula_recognition.md
@@ -281,9 +281,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -375,38 +375,13 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>file</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件或PDF文件的URL，或上述类型文件内容的Base64编码结果。对于超过10页的PDF文件，只有前10页的内容会被使用。</td>
			
 
				+<td>服务器可访问的图像文件或PDF文件的URL，或上述类型文件内容的Base64编码结果。对于超过10页的PDF文件，只有前10页的内容会被使用。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				 <td><code>fileType</code></td>
			
 
				 <td><code>integer</code></td>
			
 
				-<td>文件类型。<code>0</code>表示PDF文件，<code>1</code>表示图像文件。若请求体无此属性，则服务将尝试根据URL自动推断文件类型。</td>
			
 
				-<td>否</td>
			
 
				-</tr>
			
 
				-<tr>
			
 
				-<td><code>inferenceParams</code></td>
			
 
				-<td><code>object</code></td>
			
 
				-<td>推理参数。</td>
			
 
				-<td>否</td>
			
 
				-</tr>
			
 
				-</tbody>
			
 
				-</table>
			
 
				-<p><code>inferenceParams</code>的属性如下：</p>
			
 
				-<table>
			
 
				-<thead>
			
 
				-<tr>
			
 
				-<th>名称</th>
			
 
				-<th>类型</th>
			
 
				-<th>含义</th>
			
 
				-<th>是否必填</th>
			
 
				-</tr>
			
 
				-</thead>
			
 
				-<tbody>
			
 
				-<tr>
			
 
				-<td><code>maxLongSide</code></td>
			
 
				-<td><code>integer</code></td>
			
 
				-<td>推理时，若文本检测模型的输入图像较长边的长度大于<code>maxLongSide</code>，则将对图像进行缩放，使其较长边的长度等于<code>maxLongSide</code>。</td>
			
 
				+<td>文件类型。<code>0</code>表示PDF文件，<code>1</code>表示图像文件。若请求体无此属性，则将根据URL推断文件类型。</td>
			
 
				 <td>否</td>
			
 
				 </tr>
			
 
				 </tbody>
			
@@ -460,11 +435,6 @@ for res in output:
 
				 <td><code>string</code></td>
			
 
				 <td>版面区域检测结果图。图像为JPEG格式，使用Base64编码。</td>
			
 
				 </tr>
			
 
				-<tr>
			
 
				-<td><code>ocrImage</code></td>
			
 
				-<td><code>string</code></td>
			
 
				-<td>OCR结果图。图像为JPEG格式，使用Base64编码。</td>
			
 
				-</tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				 <p><code>formulas</code>中的每个元素为一个<code>object</code>，具有如下属性：</p>
			
--- a/docs/pipeline_usage/tutorials/ocr_pipelines/layout_parsing.en.md
+++ b/docs/pipeline_usage/tutorials/ocr_pipelines/layout_parsing.en.md
@@ -531,13 +531,13 @@ Additionally, PaddleX offers three other deployment methods, detailed as follows
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In production environments, many applications require stringent performance metrics, especially response speed, to ensure efficient operation and smooth user experience. PaddleX provides a high-performance inference plugin that deeply optimizes model inference and pre/post-processing for significant end-to-end speedups. For detailed instructions on high-performance inference, refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.en.md).
			
 
				 
			
 
				-☁️ <b>Service Deployment</b>: Service deployment is a common form in production environments, where reasoning functions are encapsulated as services accessible via network requests. PaddleX enables cost-effective service deployment of pipelines. For detailed instructions on service deployment, refer to the [PaddleX Service Deployment Guide](../../../pipeline_deploy/service_deploy.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -603,7 +603,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service:</p>
			
 
				+<p>Primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -625,13 +625,13 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>file</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of an image file or PDF file accessible by the service, or the Base64 encoded result of the content of the above-mentioned file types. For PDF files with more than 10 pages, only the content of the first 10 pages will be used.</td>
			
 
				+<td>The URL of an image file or PDF file accessible by the server, or the Base64 encoded result of the content of the above-mentioned file types. For PDF files with more than 10 pages, only the content of the first 10 pages will be used.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				 <td><code>fileType</code></td>
			
 
				 <td><code>integer</code></td>
			
 
				-<td>File type. <code>0</code> indicates a PDF file, and <code>1</code> indicates an image file. If this property is not present in the request body, the service will attempt to infer the file type automatically based on the URL.</td>
			
 
				+<td>File type. <code>0</code> indicates a PDF file, and <code>1</code> indicates an image file. If this property is not present in the request body, the file type will be inferred based on the URL.</td>
			
 
				 <td>No</td>
			
 
				 </tr>
			
 
				 <tr>
			
@@ -652,31 +652,6 @@ Below are the API references and multi-language service invocation examples:
 
				 <td>Whether to enable seal text detection. This function is enabled by default.</td>
			
 
				 <td>No</td>
			
 
				 </tr>
			
 
				-<tr>
			
 
				-<td><code>inferenceParams</code></td>
			
 
				-<td><code>object</code></td>
			
 
				-<td>Inference parameters.</td>
			
 
				-<td>No</td>
			
 
				-</tr>
			
 
				-</tbody>
			
 
				-</table>
			
 
				-<p>properties of <code>inferenceParams</code>:</p>
			
 
				-<table>
			
 
				-<thead>
			
 
				-<tr>
			
 
				-<th>Name</th>
			
 
				-<th>Type</th>
			
 
				-<th>Description</th>
			
 
				-<th>Required</th>
			
 
				-</tr>
			
 
				-</thead>
			
 
				-<tbody>
			
 
				-<tr>
			
 
				-<td><code>maxLongSide</code></td>
			
 
				-<td><code>integer</code></td>
			
 
				-<td>During inference, if the length of the longer side of the input image for the text detection model is greater than <code>maxLongSide</code>, the image will be scaled so that the length of the longer side equals <code>maxLongSide</code>.</td>
			
 
				-<td>No</td>
			
 
				-</tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				 <ul>
			
--- a/docs/pipeline_usage/tutorials/ocr_pipelines/layout_parsing.md
+++ b/docs/pipeline_usage/tutorials/ocr_pipelines/layout_parsing.md
@@ -534,9 +534,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -628,13 +628,13 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>file</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件或PDF文件的URL，或上述类型文件内容的Base64编码结果。对于超过10页的PDF文件，只有前10页的内容会被使用。</td>
			
 
				+<td>服务器可访问的图像文件或PDF文件的URL，或上述类型文件内容的Base64编码结果。对于超过10页的PDF文件，只有前10页的内容会被使用。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				 <td><code>fileType</code></td>
			
 
				 <td><code>integer</code></td>
			
 
				-<td>文件类型。<code>0</code>表示PDF文件，<code>1</code>表示图像文件。若请求体无此属性，则服务将尝试根据URL自动推断文件类型。</td>
			
 
				+<td>文件类型。<code>0</code>表示PDF文件，<code>1</code>表示图像文件。若请求体无此属性，则将根据URL推断文件类型。</td>
			
 
				 <td>否</td>
			
 
				 </tr>
			
 
				 <tr>
			
@@ -655,31 +655,6 @@ for res in output:
 
				 <td>是否启用印章文本检测功能。默认启用该功能。</td>
			
 
				 <td>否</td>
			
 
				 </tr>
			
 
				-<tr>
			
 
				-<td><code>inferenceParams</code></td>
			
 
				-<td><code>object</code></td>
			
 
				-<td>推理参数。</td>
			
 
				-<td>否</td>
			
 
				-</tr>
			
 
				-</tbody>
			
 
				-</table>
			
 
				-<p><code>inferenceParams</code>的属性如下：</p>
			
 
				-<table>
			
 
				-<thead>
			
 
				-<tr>
			
 
				-<th>名称</th>
			
 
				-<th>类型</th>
			
 
				-<th>含义</th>
			
 
				-<th>是否必填</th>
			
 
				-</tr>
			
 
				-</thead>
			
 
				-<tbody>
			
 
				-<tr>
			
 
				-<td><code>maxLongSide</code></td>
			
 
				-<td><code>integer</code></td>
			
 
				-<td>推理时，若文本检测模型的输入图像较长边的长度大于<code>maxLongSide</code>，则将对图像进行缩放，使其较长边的长度等于<code>maxLongSide</code>。</td>
			
 
				-<td>否</td>
			
 
				-</tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				 <ul>
			
--- a/docs/pipeline_usage/tutorials/ocr_pipelines/seal_recognition.en.md
+++ b/docs/pipeline_usage/tutorials/ocr_pipelines/seal_recognition.en.md
@@ -462,13 +462,13 @@ In addition, PaddleX also offers three other deployment methods, detailed as fol
 
				 
			
 
				 🚀 ** High performance deployment: In actual production environments, many applications have strict standards for the performance indicators of deployment strategies, especially response speed, to ensure efficient system operation and smooth user experience. To this end, PaddleX provides a high-performance inference plugin aimed at deep performance optimization of model inference and pre-processing, achieving significant acceleration of end-to-end processes. For a detailed high-performance deployment process, please refer to the [PaddleX High Performance Deployment Guide] (../../../pipelin_deploy/high_performance_deploy. md).
			
 
				 
			
 
				-☁️ ** Service deployment * *: Service deployment is a common form of deployment in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users to achieve service-oriented deployment of production lines at low cost. For detailed service-oriented deployment processes, please refer to the PaddleX Service Deployment Guide (../../../ipeline_deploy/service_deploy. md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -534,7 +534,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service:</p>
			
 
				+<p>Primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -556,38 +556,13 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>file</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of an image file or PDF file accessible by the service, or the Base64 encoded result of the content of the above-mentioned file types. For PDF files with more than 10 pages, only the content of the first 10 pages will be used.</td>
			
 
				+<td>The URL of an image file or PDF file accessible by the server, or the Base64 encoded result of the content of the above-mentioned file types. For PDF files with more than 10 pages, only the content of the first 10 pages will be used.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				 <td><code>fileType</code></td>
			
 
				 <td><code>integer</code></td>
			
 
				-<td>File type. <code>0</code> indicates a PDF file, and <code>1</code> indicates an image file. If this property is not present in the request body, the service will attempt to infer the file type automatically based on the URL.</td>
			
 
				-<td>No</td>
			
 
				-</tr>
			
 
				-<tr>
			
 
				-<td><code>inferenceParams</code></td>
			
 
				-<td><code>object</code></td>
			
 
				-<td>Inference parameters.</td>
			
 
				-<td>No</td>
			
 
				-</tr>
			
 
				-</tbody>
			
 
				-</table>
			
 
				-<p>Properties of <code>inferenceParams</code>:</p>
			
 
				-<table>
			
 
				-<thead>
			
 
				-<tr>
			
 
				-<th>Name</th>
			
 
				-<th>Type</th>
			
 
				-<th>Description</th>
			
 
				-<th>Required</th>
			
 
				-</tr>
			
 
				-</thead>
			
 
				-<tbody>
			
 
				-<tr>
			
 
				-<td><code>maxLongSide</code></td>
			
 
				-<td><code>integer</code></td>
			
 
				-<td>During inference, if the length of the longer side of the input image for the layout detection model is greater than <code>maxLongSide</code>, the image will be scaled so that the length of the longer side equals <code>maxLongSide</code>.</td>
			
 
				+<td>File type. <code>0</code> indicates a PDF file, and <code>1</code> indicates an image file. If this property is not present in the request body, the file type will be inferred based on the URL.</td>
			
 
				 <td>No</td>
			
 
				 </tr>
			
 
				 </tbody>
			
@@ -701,13 +676,13 @@ result = response.json()[&quot;result&quot;]
 
				 for i, res in enumerate(result[&quot;sealRecResults&quot;]):
			
 
				     print(&quot;Detected texts:&quot;)
			
 
				     print(res[&quot;texts&quot;])
			
 
				-    ocr_img_path = f&quot;ocr_{i}.jpg&quot;
			
 
				-    with open(ocr_img_path, &quot;wb&quot;) as f:
			
 
				-        f.write(base64.b64decode(res[&quot;ocrImage&quot;]))
			
 
				     layout_img_path = f&quot;layout_{i}.jpg&quot;
			
 
				     with open(layout_img_path, &quot;wb&quot;) as f:
			
 
				         f.write(base64.b64decode(res[&quot;layoutImage&quot;]))
			
 
				-    print(f&quot;Output images saved at {ocr_img_path} and {layout_img_path}&quot;)
			
 
				+    ocr_img_path = f&quot;ocr_{i}.jpg&quot;
			
 
				+    with open(ocr_img_path, &quot;wb&quot;) as f:
			
 
				+        f.write(base64.b64decode(res[&quot;ocrImage&quot;]))
			
 
				+    print(f&quot;Output images saved at {layout_img_path} and {ocr_img_path}&quot;)
			
 
				 </code></pre></details>
			
 
				 </details>
			
 
				 <br/>
			
--- a/docs/pipeline_usage/tutorials/ocr_pipelines/seal_recognition.md
+++ b/docs/pipeline_usage/tutorials/ocr_pipelines/seal_recognition.md
@@ -472,9 +472,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能部署</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能部署流程请参考[PaddleX高性能部署指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -566,38 +566,13 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>file</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件或PDF文件的URL，或上述类型文件内容的Base64编码结果。对于超过10页的PDF文件，只有前10页的内容会被使用。</td>
			
 
				+<td>服务器可访问的图像文件或PDF文件的URL，或上述类型文件内容的Base64编码结果。对于超过10页的PDF文件，只有前10页的内容会被使用。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				 <td><code>fileType</code></td>
			
 
				 <td><code>integer</code></td>
			
 
				-<td>文件类型。<code>0</code>表示PDF文件，<code>1</code>表示图像文件。若请求体无此属性，则服务将尝试根据URL自动推断文件类型。</td>
			
 
				-<td>否</td>
			
 
				-</tr>
			
 
				-<tr>
			
 
				-<td><code>inferenceParams</code></td>
			
 
				-<td><code>object</code></td>
			
 
				-<td>推理参数。</td>
			
 
				-<td>否</td>
			
 
				-</tr>
			
 
				-</tbody>
			
 
				-</table>
			
 
				-<p><code>inferenceParams</code>的属性如下：</p>
			
 
				-<table>
			
 
				-<thead>
			
 
				-<tr>
			
 
				-<th>名称</th>
			
 
				-<th>类型</th>
			
 
				-<th>含义</th>
			
 
				-<th>是否必填</th>
			
 
				-</tr>
			
 
				-</thead>
			
 
				-<tbody>
			
 
				-<tr>
			
 
				-<td><code>maxLongSide</code></td>
			
 
				-<td><code>integer</code></td>
			
 
				-<td>推理时，若文本检测模型的输入图像较长边的长度大于<code>maxLongSide</code>，则将对图像进行缩放，使其较长边的长度等于<code>maxLongSide</code>。</td>
			
 
				+<td>文件类型。<code>0</code>表示PDF文件，<code>1</code>表示图像文件。若请求体无此属性，则将根据URL推断文件类型。</td>
			
 
				 <td>否</td>
			
 
				 </tr>
			
 
				 </tbody>
			
@@ -711,13 +686,13 @@ result = response.json()[&quot;result&quot;]
 
				 for i, res in enumerate(result[&quot;sealRecResults&quot;]):
			
 
				     print(&quot;Detected texts:&quot;)
			
 
				     print(res[&quot;texts&quot;])
			
 
				-    ocr_img_path = f&quot;ocr_{i}.jpg&quot;
			
 
				-    with open(ocr_img_path, &quot;wb&quot;) as f:
			
 
				-        f.write(base64.b64decode(res[&quot;ocrImage&quot;]))
			
 
				     layout_img_path = f&quot;layout_{i}.jpg&quot;
			
 
				     with open(layout_img_path, &quot;wb&quot;) as f:
			
 
				         f.write(base64.b64decode(res[&quot;layoutImage&quot;]))
			
 
				-    print(f&quot;Output images saved at {ocr_img_path} and {layout_img_path}&quot;)
			
 
				+    ocr_img_path = f&quot;ocr_{i}.jpg&quot;
			
 
				+    with open(ocr_img_path, &quot;wb&quot;) as f:
			
 
				+        f.write(base64.b64decode(res[&quot;ocrImage&quot;]))
			
 
				+    print(f&quot;Output images saved at {layout_img_path} and {ocr_img_path}&quot;)
			
 
				 </code></pre></details>
			
 
				 </details>
			
 
				 <br/>
			
--- a/docs/pipeline_usage/tutorials/ocr_pipelines/table_recognition.en.md
+++ b/docs/pipeline_usage/tutorials/ocr_pipelines/table_recognition.en.md
@@ -349,13 +349,13 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for deployment strategy performance metrics (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins that aim to deeply optimize model inference and pre/post-processing for significant end-to-end process acceleration. For detailed high-performance inference procedures, refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.md).
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving low-cost service-oriented deployment of pipelines. For detailed service-oriented deployment procedures, refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.en.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -421,7 +421,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service:</p>
			
 
				+<p>Primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -441,41 +441,44 @@ Below are the API references and multi-language service invocation examples:
 
				 </thead>
			
 
				 <tbody>
			
 
				 <tr>
			
 
				-<td><code>image</code></td>
			
 
				+<td><code>file</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of an image file accessible by the service or the Base64 encoded result of the image file content.</td>
			
 
				+<td>The URL of an image file or PDF file accessible by the server, or the Base64 encoded result of the content of the above-mentioned file types. For PDF files with more than 10 pages, only the content of the first 10 pages will be used.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				-<td><code>inferenceParams</code></td>
			
 
				-<td><code>object</code></td>
			
 
				-<td>Inference parameters.</td>
			
 
				+<td><code>fileType</code></td>
			
 
				+<td><code>integer</code></td>
			
 
				+<td>File type. <code>0</code> indicates a PDF file, and <code>1</code> indicates an image file. If this property is not present in the request body, the file type will be inferred based on the URL.</td>
			
 
				 <td>No</td>
			
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Properties of <code>inferenceParams</code>:</p>
			
 
				+<ul>
			
 
				+<li>When the request is processed successfully, the <code>result</code> of the response body has the following properties:</li>
			
 
				+</ul>
			
 
				 <table>
			
 
				 <thead>
			
 
				 <tr>
			
 
				 <th>Name</th>
			
 
				 <th>Type</th>
			
 
				 <th>Description</th>
			
 
				-<th>Required</th>
			
 
				 </tr>
			
 
				 </thead>
			
 
				 <tbody>
			
 
				 <tr>
			
 
				-<td><code>maxLongSide</code></td>
			
 
				-<td><code>integer</code></td>
			
 
				-<td>During inference, if the length of the longer side of the input image for the text detection model is greater than <code>maxLongSide</code>, the image will be scaled so that the length of the longer side equals <code>maxLongSide</code>.</td>
			
 
				-<td>No</td>
			
 
				+<td><code>tableRecResults</code></td>
			
 
				+<td><code>array</code></td>
			
 
				+<td>Table recognition results. The array length is 1 (for image input) or the smaller of the number of document pages and 10 (for PDF input). For PDF input, each element in the array represents the processing result of each page in the PDF file.</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td><code>dataInfo</code></td>
			
 
				+<td><code>object</code></td>
			
 
				+<td>Information about the input data.</td>
			
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<ul>
			
 
				-<li>When the request is processed successfully, the <code>result</code> of the response body has the following properties:</li>
			
 
				-</ul>
			
 
				+<p>Each element in <code>tableRecResults</code> is an <code>object</code> with the following properties:</p>
			
 
				 <table>
			
 
				 <thead>
			
 
				 <tr>
			
@@ -535,397 +538,28 @@ Below are the API references and multi-language service invocation examples:
 
				 import requests
			
 
				 
			
 
				 API_URL = &quot;http://localhost:8080/table-recognition&quot;
			
 
				-image_path = &quot;./demo.jpg&quot;
			
 
				-ocr_image_path = &quot;./ocr.jpg&quot;
			
 
				-layout_image_path = &quot;./layout.jpg&quot;
			
 
				+file_path = &quot;./demo.jpg&quot;
			
 
				 
			
 
				-with open(image_path, &quot;rb&quot;) as file:
			
 
				-    image_bytes = file.read()
			
 
				-    image_data = base64.b64encode(image_bytes).decode(&quot;ascii&quot;)
			
 
				+with open(file_path, &quot;rb&quot;) as file:
			
 
				+    file_bytes = file.read()
			
 
				+    file_data = base64.b64encode(file_bytes).decode(&quot;ascii&quot;)
			
 
				 
			
 
				-payload = {&quot;image&quot;: image_data}
			
 
				+payload = {&quot;file&quot;: file_data, &quot;fileType&quot;: 1}
			
 
				 
			
 
				 response = requests.post(API_URL, json=payload)
			
 
				 
			
 
				 assert response.status_code == 200
			
 
				 result = response.json()[&quot;result&quot;]
			
 
				-with open(ocr_image_path, &quot;wb&quot;) as file:
			
 
				-    file.write(base64.b64decode(result[&quot;ocrImage&quot;]))
			
 
				-print(f&quot;Output image saved at {ocr_image_path}&quot;)
			
 
				-with open(layout_image_path, &quot;wb&quot;) as file:
			
 
				-    file.write(base64.b64decode(result[&quot;layoutImage&quot;]))
			
 
				-print(f&quot;Output image saved at {layout_image_path}&quot;)
			
 
				-print(&quot;\nDetected tables:&quot;)
			
 
				-print(result[&quot;tables&quot;])
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>C++</summary>
			
 
				-
			
 
				-<pre><code class="language-cpp">#include &lt;iostream&gt;
			
 
				-#include &quot;cpp-httplib/httplib.h&quot; // https://github.com/Huiyicc/cpp-httplib
			
 
				-#include &quot;nlohmann/json.hpp&quot; // https://github.com/nlohmann/json
			
 
				-#include &quot;base64.hpp&quot; // https://github.com/tobiaslocker/base64
			
 
				-
			
 
				-int main() {
			
 
				-    httplib::Client client(&quot;localhost:8080&quot;);
			
 
				-    const std::string imagePath = &quot;./demo.jpg&quot;;
			
 
				-    const std::string ocrImagePath = &quot;./ocr.jpg&quot;;
			
 
				-    const std::string layoutImagePath = &quot;./layout.jpg&quot;;
			
 
				-
			
 
				-    httplib::Headers headers = {
			
 
				-        {&quot;Content-Type&quot;, &quot;application/json&quot;}
			
 
				-    };
			
 
				-
			
 
				-    std::ifstream file(imagePath, std::ios::binary | std::ios::ate);
			
 
				-    std::streamsize size = file.tellg();
			
 
				-    file.seekg(0, std::ios::beg);
			
 
				-
			
 
				-    std::vector&lt;char&gt; buffer(size);
			
 
				-    if (!file.read(buffer.data(), size)) {
			
 
				-        std::cerr &lt;&lt; &quot;Error reading file.&quot; &lt;&lt; std::endl;
			
 
				-        return 1;
			
 
				-    }
			
 
				-    std::string bufferStr(reinterpret_cast&lt;const char*&gt;(buffer.data()), buffer.size());
			
 
				-    std::string encodedImage = base64::to_base64(bufferStr);
			
 
				-
			
 
				-    nlohmann::json jsonObj;
			
 
				-    jsonObj[&quot;image&quot;] = encodedImage;
			
 
				-    std::string body = jsonObj.dump();
			
 
				-
			
 
				-    auto response = client.Post(&quot;/table-recognition&quot;, headers, body, &quot;application/json&quot;);
			
 
				-
			
 
				-    if (response &amp;&amp; response-&gt;status == 200) {
			
 
				-        nlohmann::json jsonResponse = nlohmann::json::parse(response-&gt;body);
			
 
				-        auto result = jsonResponse[&quot;result&quot;];
			
 
				-
			
 
				-        encodedImage = result[&quot;ocrImage&quot;];
			
 
				-        std::string decoded_string = base64::from_base64(encodedImage);
			
 
				-        std::vector&lt;unsigned char&gt; decodedOcrImage(decoded_string.begin(), decoded_string.end());
			
 
				-        std::ofstream outputOcrFile(ocrImagePath, std::ios::binary | std::ios::out);
			
 
				-        if (outputOcrFile.is_open()) {
			
 
				-            outputOcrFile.write(reinterpret_cast&lt;char*&gt;(decodedOcrImage.data()), decodedOcrImage.size());
			
 
				-            outputOcrFile.close();
			
 
				-            std::cout &lt;&lt; &quot;Output image saved at &quot; &lt;&lt; ocrImagePath &lt;&lt; std::endl;
			
 
				-        } else {
			
 
				-            std::cerr &lt;&lt; &quot;Unable to open file for writing: &quot; &lt;&lt; ocrImagePath &lt;&lt; std::endl;
			
 
				-        }
			
 
				-
			
 
				-        encodedImage = result[&quot;layoutImage&quot;];
			
 
				-        decodedString = base64::from_base64(encodedImage);
			
 
				-        std::vector&lt;unsigned char&gt; decodedLayoutImage(decodedString.begin(), decodedString.end());
			
 
				-        std::ofstream outputLayoutFile(layoutImagePath, std::ios::binary | std::ios::out);
			
 
				-        if (outputLayoutFile.is_open()) {
			
 
				-            outputLayoutFile.write(reinterpret_cast&lt;char*&gt;(decodedLayoutImage.data()), decodedlayoutImage.size());
			
 
				-            outputLayoutFile.close();
			
 
				-            std::cout &lt;&lt; &quot;Output image saved at &quot; &lt;&lt; layoutImagePath &lt;&lt; std::endl;
			
 
				-        } else {
			
 
				-            std::cerr &lt;&lt; &quot;Unable to open file for writing: &quot; &lt;&lt; layoutImagePath &lt;&lt; std::endl;
			
 
				-        }
			
 
				-
			
 
				-        auto tables = result[&quot;tables&quot;];
			
 
				-        std::cout &lt;&lt; &quot;\nDetected tables:&quot; &lt;&lt; std::endl;
			
 
				-        for (const auto&amp; table : tables) {
			
 
				-            std::cout &lt;&lt; table &lt;&lt; std::endl;
			
 
				-        }
			
 
				-    } else {
			
 
				-        std::cout &lt;&lt; &quot;Failed to send HTTP request.&quot; &lt;&lt; std::endl;
			
 
				-        return 1;
			
 
				-    }
			
 
				-
			
 
				-    return 0;
			
 
				-}
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>Java</summary>
			
 
				-
			
 
				-<pre><code class="language-java">import okhttp3.*;
			
 
				-import com.fasterxml.jackson.databind.ObjectMapper;
			
 
				-import com.fasterxml.jackson.databind.JsonNode;
			
 
				-import com.fasterxml.jackson.databind.node.ObjectNode;
			
 
				-
			
 
				-import java.io.File;
			
 
				-import java.io.FileOutputStream;
			
 
				-import java.io.IOException;
			
 
				-import java.util.Base64;
			
 
				-
			
 
				-public class Main {
			
 
				-    public static void main(String[] args) throws IOException {
			
 
				-        String API_URL = &quot;http://localhost:8080/table-recognition&quot;;
			
 
				-        String imagePath = &quot;./demo.jpg&quot;;
			
 
				-        String ocrImagePath = &quot;./ocr.jpg&quot;;
			
 
				-        String layoutImagePath = &quot;./layout.jpg&quot;;
			
 
				-
			
 
				-        File file = new File(imagePath);
			
 
				-        byte[] fileContent = java.nio.file.Files.readAllBytes(file.toPath());
			
 
				-        String imageData = Base64.getEncoder().encodeToString(fileContent);
			
 
				-
			
 
				-        ObjectMapper objectMapper = new ObjectMapper();
			
 
				-        ObjectNode params = objectMapper.createObjectNode();
			
 
				-        params.put(&quot;image&quot;, imageData);
			
 
				-
			
 
				-        OkHttpClient client = new OkHttpClient();
			
 
				-        MediaType JSON = MediaType.Companion.get(&quot;application/json; charset=utf-8&quot;);
			
 
				-        RequestBody body = RequestBody.Companion.create(params.toString(), JSON);
			
 
				-        Request request = new Request.Builder()
			
 
				-                .url(API_URL)
			
 
				-                .post(body)
			
 
				-                .build();
			
 
				-
			
 
				-        try (Response response = client.newCall(request).execute()) {
			
 
				-            if (response.isSuccessful()) {
			
 
				-                String responseBody = response.body().string();
			
 
				-                JsonNode resultNode = objectMapper.readTree(responseBody);
			
 
				-                JsonNode result = resultNode.get(&quot;result&quot;);
			
 
				-                String ocrBase64Image = result.get(&quot;ocrImage&quot;).asText();
			
 
				-                String layoutBase64Image = result.get(&quot;layoutImage&quot;).asText();
			
 
				-                JsonNode tables = result.get(&quot;tables&quot;);
			
 
				-
			
 
				-                byte[] imageBytes = Base64.getDecoder().decode(ocrBase64Image);
			
 
				-                try (FileOutputStream fos = new FileOutputStream(ocrImagePath)) {
			
 
				-                    fos.write(imageBytes);
			
 
				-                }
			
 
				-                System.out.println(&quot;Output image saved at &quot; + ocrBase64Image);
			
 
				-
			
 
				-                imageBytes = Base64.getDecoder().decode(layoutBase64Image);
			
 
				-                try (FileOutputStream fos = new FileOutputStream(layoutImagePath)) {
			
 
				-                    fos.write(imageBytes);
			
 
				-                }
			
 
				-                System.out.println(&quot;Output image saved at &quot; + layoutImagePath);
			
 
				-
			
 
				-                System.out.println(&quot;\nDetected tables: &quot; + tables.toString());
			
 
				-            } else {
			
 
				-                System.err.println(&quot;Request failed with code: &quot; + response.code());
			
 
				-            }
			
 
				-        }
			
 
				-    }
			
 
				-}
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>Go</summary>
			
 
				-
			
 
				-<pre><code class="language-go">package main
			
 
				-
			
 
				-import (
			
 
				-    &quot;bytes&quot;
			
 
				-    &quot;encoding/base64&quot;
			
 
				-    &quot;encoding/json&quot;
			
 
				-    &quot;fmt&quot;
			
 
				-    &quot;io/ioutil&quot;
			
 
				-    &quot;net/http&quot;
			
 
				-)
			
 
				-
			
 
				-func main() {
			
 
				-    API_URL := &quot;http://localhost:8080/table-recognition&quot;
			
 
				-    imagePath := &quot;./demo.jpg&quot;
			
 
				-    ocrImagePath := &quot;./ocr.jpg&quot;
			
 
				-    layoutImagePath := &quot;./layout.jpg&quot;
			
 
				-
			
 
				-    imageBytes, err := ioutil.ReadFile(imagePath)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error reading image file:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    imageData := base64.StdEncoding.EncodeToString(imageBytes)
			
 
				-
			
 
				-    payload := map[string]string{&quot;image&quot;: imageData}
			
 
				-    payloadBytes, err := json.Marshal(payload)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error marshaling payload:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-
			
 
				-    client := &amp;http.Client{}
			
 
				-    req, err := http.NewRequest(&quot;POST&quot;, API_URL, bytes.NewBuffer(payloadBytes))
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error creating request:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-
			
 
				-    res, err := client.Do(req)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error sending request:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    defer res.Body.Close()
			
 
				-
			
 
				-    body, err := ioutil.ReadAll(res.Body)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error reading response body:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    type Response struct {
			
 
				-        Result struct {
			
 
				-            OcrImage      string   `json:&quot;ocrImage&quot;`
			
 
				-            LayoutImage      string   `json:&quot;layoutImage&quot;`
			
 
				-            Tables []map[string]interface{} `json:&quot;tables&quot;`
			
 
				-        } `json:&quot;result&quot;`
			
 
				-    }
			
 
				-    var respData Response
			
 
				-    err = json.Unmarshal([]byte(string(body)), &amp;respData)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error unmarshaling response body:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-
			
 
				-    ocrImageData, err := base64.StdEncoding.DecodeString(respData.Result.OcrImage)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error decoding base64 image data:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    err = ioutil.WriteFile(ocrImagePath, ocrImageData, 0644)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error writing image to file:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    fmt.Printf(&quot;Image saved at %s.jpg\n&quot;, ocrImagePath)
			
 
				-
			
 
				-    layoutImageData, err := base64.StdEncoding.DecodeString(respData.Result.LayoutImage)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error decoding base64 image data:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    err = ioutil.WriteFile(layoutImagePath, layoutImageData, 0644)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error writing image to file:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    fmt.Printf(&quot;Image saved at %s.jpg\n&quot;, layoutImagePath)
			
 
				-
			
 
				-    fmt.Println(&quot;\nDetected tables:&quot;)
			
 
				-    for _, table := range respData.Result.Tables {
			
 
				-        fmt.Println(table)
			
 
				-    }
			
 
				-}
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>C#</summary>
			
 
				-
			
 
				-<pre><code class="language-csharp">using System;
			
 
				-using System.IO;
			
 
				-using System.Net.Http;
			
 
				-using System.Net.Http.Headers;
			
 
				-using System.Text;
			
 
				-using System.Threading.Tasks;
			
 
				-using Newtonsoft.Json.Linq;
			
 
				-
			
 
				-class Program
			
 
				-{
			
 
				-    static readonly string API_URL = &quot;http://localhost:8080/table-recognition&quot;;
			
 
				-    static readonly string imagePath = &quot;./demo.jpg&quot;;
			
 
				-    static readonly string ocrImagePath = &quot;./ocr.jpg&quot;;
			
 
				-    static readonly string layoutImagePath = &quot;./layout.jpg&quot;;
			
 
				-
			
 
				-    static async Task Main(string[] args)
			
 
				-    {
			
 
				-        var httpClient = new HttpClient();
			
 
				-
			
 
				-        byte[] imageBytes = File.ReadAllBytes(imagePath);
			
 
				-        string image_data = Convert.ToBase64String(imageBytes);
			
 
				-
			
 
				-        var payload = new JObject{ { &quot;image&quot;, image_data } };
			
 
				-        var content = new StringContent(payload.ToString(), Encoding.UTF8, &quot;application/json&quot;);
			
 
				-
			
 
				-        HttpResponseMessage response = await httpClient.PostAsync(API_URL, content);
			
 
				-        response.EnsureSuccessStatusCode();
			
 
				-
			
 
				-        string responseBody = await response.Content.ReadAsStringAsync();
			
 
				-        JObject jsonResponse = JObject.Parse(responseBody);
			
 
				-
			
 
				-        string ocrBase64Image = jsonResponse[&quot;result&quot;][&quot;ocrImage&quot;].ToString();
			
 
				-        byte[] ocrImageBytes = Convert.FromBase64String(ocrBase64Image);
			
 
				-        File.WriteAllBytes(ocrImagePath, ocrImageBytes);
			
 
				-        Console.WriteLine($&quot;Output image saved at {ocrImagePath}&quot;);
			
 
				-
			
 
				-        string layoutBase64Image = jsonResponse[&quot;result&quot;][&quot;layoutImage&quot;].ToString();
			
 
				-        byte[] layoutImageBytes = Convert.FromBase64String(layoutBase64Image);
			
 
				-        File.WriteAllBytes(layoutImagePath, layoutImageBytes);
			
 
				-        Console.WriteLine($&quot;Output image saved at {layoutImagePath}&quot;);
			
 
				-
			
 
				-        Console.WriteLine(&quot;\nDetected tables:&quot;);
			
 
				-        Console.WriteLine(jsonResponse[&quot;result&quot;][&quot;tables&quot;].ToString());
			
 
				-    }
			
 
				-}
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>Node.js</summary>
			
 
				-
			
 
				-<pre><code class="language-js">const axios = require('axios');
			
 
				-const fs = require('fs');
			
 
				-
			
 
				-const API_URL = 'http://localhost:8080/table-recognition'
			
 
				-const imagePath = './demo.jpg'
			
 
				-const ocrImagePath = &quot;./ocr.jpg&quot;;
			
 
				-const layoutImagePath = &quot;./layout.jpg&quot;;
			
 
				-
			
 
				-let config = {
			
 
				-   method: 'POST',
			
 
				-   maxBodyLength: Infinity,
			
 
				-   url: API_URL,
			
 
				-   data: JSON.stringify({
			
 
				-    'image': encodeImageToBase64(imagePath)
			
 
				-  })
			
 
				-};
			
 
				-
			
 
				-function encodeImageToBase64(filePath) {
			
 
				-  const bitmap = fs.readFileSync(filePath);
			
 
				-  return Buffer.from(bitmap).toString('base64');
			
 
				-}
			
 
				-
			
 
				-axios.request(config)
			
 
				-.then((response) =&gt; {
			
 
				-    const result = response.data[&quot;result&quot;];
			
 
				-
			
 
				-    const imageBuffer = Buffer.from(result[&quot;ocrImage&quot;], 'base64');
			
 
				-    fs.writeFile(ocrImagePath, imageBuffer, (err) =&gt; {
			
 
				-      if (err) throw err;
			
 
				-      console.log(`Output image saved at ${ocrImagePath}`);
			
 
				-    });
			
 
				-
			
 
				-    imageBuffer = Buffer.from(result[&quot;layoutImage&quot;], 'base64');
			
 
				-    fs.writeFile(layoutImagePath, imageBuffer, (err) =&gt; {
			
 
				-      if (err) throw err;
			
 
				-      console.log(`Output image saved at ${layoutImagePath}`);
			
 
				-    });
			
 
				-
			
 
				-    console.log(&quot;\nDetected tables:&quot;);
			
 
				-    console.log(result[&quot;tables&quot;]);
			
 
				-})
			
 
				-.catch((error) =&gt; {
			
 
				-  console.log(error);
			
 
				-});
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>PHP</summary>
			
 
				-
			
 
				-<pre><code class="language-php">&lt;?php
			
 
				-
			
 
				-$API_URL = &quot;http://localhost:8080/table-recognition&quot;;
			
 
				-$image_path = &quot;./demo.jpg&quot;;
			
 
				-$ocr_image_path = &quot;./ocr.jpg&quot;;
			
 
				-$layout_image_path = &quot;./layout.jpg&quot;;
			
 
				-
			
 
				-$image_data = base64_encode(file_get_contents($image_path));
			
 
				-$payload = array(&quot;image&quot; =&gt; $image_data);
			
 
				-
			
 
				-$ch = curl_init($API_URL);
			
 
				-curl_setopt($ch, CURLOPT_POST, true);
			
 
				-curl_setopt($ch, CURLOPT_POSTFIELDS, json_encode($payload));
			
 
				-curl_setopt($ch, CURLOPT_HTTPHEADER, array('Content-Type: application/json'));
			
 
				-curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
			
 
				-$response = curl_exec($ch);
			
 
				-curl_close($ch);
			
 
				-
			
 
				-$result = json_decode($response, true)[&quot;result&quot;];
			
 
				-file_put_contents($ocr_image_path, base64_decode($result[&quot;ocrImage&quot;]));
			
 
				-echo &quot;Output image saved at &quot; . $ocr_image_path . &quot;\n&quot;;
			
 
				-
			
 
				-file_put_contents($layout_image_path, base64_decode($result[&quot;layoutImage&quot;]));
			
 
				-echo &quot;Output image saved at &quot; . $layout_image_path . &quot;\n&quot;;
			
 
				-
			
 
				-echo &quot;\nDetected tables:\n&quot;;
			
 
				-print_r($result[&quot;tables&quot;]);
			
 
				-
			
 
				-?&gt;
			
 
				+for i, res in enumerate(result[&quot;tableRecResults&quot;]):
			
 
				+    print(&quot;Detected tables:&quot;)
			
 
				+    print(res[&quot;tables&quot;])
			
 
				+    layout_img_path = f&quot;layout_{i}.jpg&quot;
			
 
				+    with open(layout_img_path, &quot;wb&quot;) as f:
			
 
				+        f.write(base64.b64decode(res[&quot;layoutImage&quot;]))
			
 
				+    ocr_img_path = f&quot;ocr_{i}.jpg&quot;
			
 
				+    with open(ocr_img_path, &quot;wb&quot;) as f:
			
 
				+        f.write(base64.b64decode(res[&quot;ocrImage&quot;]))
			
 
				+    print(f&quot;Output images saved at {layout_img_path} and {ocr_img_path}&quot;)
			
 
				 </code></pre></details>
			
 
				 </details>
			
 
				 <br/>
			
--- a/docs/pipeline_usage/tutorials/ocr_pipelines/table_recognition.md
+++ b/docs/pipeline_usage/tutorials/ocr_pipelines/table_recognition.md
@@ -433,9 +433,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -525,41 +525,44 @@ for res in output:
 
				 </thead>
			
 
				 <tbody>
			
 
				 <tr>
			
 
				-<td><code>image</code></td>
			
 
				+<td><code>file</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的图像文件的URL或图像文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的图像文件或PDF文件的URL，或上述类型文件内容的Base64编码结果。对于超过10页的PDF文件，只有前10页的内容会被使用。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 <tr>
			
 
				-<td><code>inferenceParams</code></td>
			
 
				-<td><code>object</code></td>
			
 
				-<td>推理参数。</td>
			
 
				+<td><code>fileType</code></td>
			
 
				+<td><code>integer</code></td>
			
 
				+<td>文件类型。<code>0</code>表示PDF文件，<code>1</code>表示图像文件。若请求体无此属性，则将根据URL推断文件类型。</td>
			
 
				 <td>否</td>
			
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p><code>inferenceParams</code>的属性如下：</p>
			
 
				+<ul>
			
 
				+<li>请求处理成功时，响应体的<code>result</code>具有如下属性：</li>
			
 
				+</ul>
			
 
				 <table>
			
 
				 <thead>
			
 
				 <tr>
			
 
				 <th>名称</th>
			
 
				 <th>类型</th>
			
 
				 <th>含义</th>
			
 
				-<th>是否必填</th>
			
 
				 </tr>
			
 
				 </thead>
			
 
				 <tbody>
			
 
				 <tr>
			
 
				-<td><code>maxLongSide</code></td>
			
 
				-<td><code>integer</code></td>
			
 
				-<td>推理时，若文本检测模型的输入图像较长边的长度大于<code>maxLongSide</code>，则将对图像进行缩放，使其较长边的长度等于<code>maxLongSide</code>。</td>
			
 
				-<td>否</td>
			
 
				+<td><code>tableRecResults</code></td>
			
 
				+<td><code>object</code></td>
			
 
				+<td>表格识别结果。数组长度为1（对于图像输入）或文档页数与10中的较小者（对于PDF输入）。对于PDF输入，数组中的每个元素依次表示PDF文件中每一页的处理结果。</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td><code>dataInfo</code></td>
			
 
				+<td><code>object</code></td>
			
 
				+<td>输入数据信息。</td>
			
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<ul>
			
 
				-<li>请求处理成功时，响应体的<code>result</code>具有如下属性：</li>
			
 
				-</ul>
			
 
				+<p><code>tableRecResults</code>中的每个元素为一个<code>object</code>，具有如下属性：</p>
			
 
				 <table>
			
 
				 <thead>
			
 
				 <tr>
			
@@ -618,418 +621,29 @@ for res in output:
 
				 <pre><code class="language-python">import base64
			
 
				 import requests
			
 
				 
			
 
				-API_URL = &quot;http://localhost:8080/table-recognition&quot; # 服务URL
			
 
				-image_path = &quot;./demo.jpg&quot;
			
 
				-ocr_image_path = &quot;./ocr.jpg&quot;
			
 
				-layout_image_path = &quot;./layout.jpg&quot;
			
 
				+API_URL = &quot;http://localhost:8080/table-recognition&quot;
			
 
				+file_path = &quot;./demo.jpg&quot;
			
 
				 
			
 
				-# 对本地图像进行Base64编码
			
 
				-with open(image_path, &quot;rb&quot;) as file:
			
 
				-    image_bytes = file.read()
			
 
				-    image_data = base64.b64encode(image_bytes).decode(&quot;ascii&quot;)
			
 
				+with open(file_path, &quot;rb&quot;) as file:
			
 
				+    file_bytes = file.read()
			
 
				+    file_data = base64.b64encode(file_bytes).decode(&quot;ascii&quot;)
			
 
				 
			
 
				-payload = {&quot;image&quot;: image_data}  # Base64编码的文件内容或者图像URL
			
 
				+payload = {&quot;file&quot;: file_data, &quot;fileType&quot;: 1}
			
 
				 
			
 
				-# 调用API
			
 
				 response = requests.post(API_URL, json=payload)
			
 
				 
			
 
				-# 处理接口返回数据
			
 
				 assert response.status_code == 200
			
 
				 result = response.json()[&quot;result&quot;]
			
 
				-with open(ocr_image_path, &quot;wb&quot;) as file:
			
 
				-    file.write(base64.b64decode(result[&quot;ocrImage&quot;]))
			
 
				-print(f&quot;Output image saved at {ocr_image_path}&quot;)
			
 
				-with open(layout_image_path, &quot;wb&quot;) as file:
			
 
				-    file.write(base64.b64decode(result[&quot;layoutImage&quot;]))
			
 
				-print(f&quot;Output image saved at {layout_image_path}&quot;)
			
 
				-print(&quot;\nDetected tables:&quot;)
			
 
				-print(result[&quot;tables&quot;])
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>C++</summary>
			
 
				-
			
 
				-<pre><code class="language-cpp">#include &lt;iostream&gt;
			
 
				-#include &quot;cpp-httplib/httplib.h&quot; // https://github.com/Huiyicc/cpp-httplib
			
 
				-#include &quot;nlohmann/json.hpp&quot; // https://github.com/nlohmann/json
			
 
				-#include &quot;base64.hpp&quot; // https://github.com/tobiaslocker/base64
			
 
				-
			
 
				-int main() {
			
 
				-    httplib::Client client(&quot;localhost:8080&quot;);
			
 
				-    const std::string imagePath = &quot;./demo.jpg&quot;;
			
 
				-    const std::string ocrImagePath = &quot;./ocr.jpg&quot;;
			
 
				-    const std::string layoutImagePath = &quot;./layout.jpg&quot;;
			
 
				-
			
 
				-    httplib::Headers headers = {
			
 
				-        {&quot;Content-Type&quot;, &quot;application/json&quot;}
			
 
				-    };
			
 
				-
			
 
				-    // 对本地图像进行Base64编码
			
 
				-    std::ifstream file(imagePath, std::ios::binary | std::ios::ate);
			
 
				-    std::streamsize size = file.tellg();
			
 
				-    file.seekg(0, std::ios::beg);
			
 
				-
			
 
				-    std::vector&lt;char&gt; buffer(size);
			
 
				-    if (!file.read(buffer.data(), size)) {
			
 
				-        std::cerr &lt;&lt; &quot;Error reading file.&quot; &lt;&lt; std::endl;
			
 
				-        return 1;
			
 
				-    }
			
 
				-    std::string bufferStr(reinterpret_cast&lt;const char*&gt;(buffer.data()), buffer.size());
			
 
				-    std::string encodedImage = base64::to_base64(bufferStr);
			
 
				-
			
 
				-    nlohmann::json jsonObj;
			
 
				-    jsonObj[&quot;image&quot;] = encodedImage;
			
 
				-    std::string body = jsonObj.dump();
			
 
				-
			
 
				-    // 调用API
			
 
				-    auto response = client.Post(&quot;/table-recognition&quot;, headers, body, &quot;application/json&quot;);
			
 
				-    // 处理接口返回数据
			
 
				-    if (response &amp;&amp; response-&gt;status == 200) {
			
 
				-        nlohmann::json jsonResponse = nlohmann::json::parse(response-&gt;body);
			
 
				-        auto result = jsonResponse[&quot;result&quot;];
			
 
				-
			
 
				-        encodedImage = result[&quot;ocrImage&quot;];
			
 
				-        std::string decoded_string = base64::from_base64(encodedImage);
			
 
				-        std::vector&lt;unsigned char&gt; decodedOcrImage(decoded_string.begin(), decoded_string.end());
			
 
				-        std::ofstream outputOcrFile(ocrImagePath, std::ios::binary | std::ios::out);
			
 
				-        if (outputOcrFile.is_open()) {
			
 
				-            outputOcrFile.write(reinterpret_cast&lt;char*&gt;(decodedOcrImage.data()), decodedOcrImage.size());
			
 
				-            outputOcrFile.close();
			
 
				-            std::cout &lt;&lt; &quot;Output image saved at &quot; &lt;&lt; ocrImagePath &lt;&lt; std::endl;
			
 
				-        } else {
			
 
				-            std::cerr &lt;&lt; &quot;Unable to open file for writing: &quot; &lt;&lt; ocrImagePath &lt;&lt; std::endl;
			
 
				-        }
			
 
				-
			
 
				-        encodedImage = result[&quot;layoutImage&quot;];
			
 
				-        decodedString = base64::from_base64(encodedImage);
			
 
				-        std::vector&lt;unsigned char&gt; decodedLayoutImage(decodedString.begin(), decodedString.end());
			
 
				-        std::ofstream outputLayoutFile(layoutImagePath, std::ios::binary | std::ios::out);
			
 
				-        if (outputLayoutFile.is_open()) {
			
 
				-            outputLayoutFile.write(reinterpret_cast&lt;char*&gt;(decodedLayoutImage.data()), decodedLayoutImage.size());
			
 
				-            outputLayoutFile.close();
			
 
				-            std::cout &lt;&lt; &quot;Output image saved at &quot; &lt;&lt; layoutImagePath &lt;&lt; std::endl;
			
 
				-        } else {
			
 
				-            std::cerr &lt;&lt; &quot;Unable to open file for writing: &quot; &lt;&lt; layoutImagePath &lt;&lt; std::endl;
			
 
				-        }
			
 
				-
			
 
				-        auto tables = result[&quot;tables&quot;];
			
 
				-        std::cout &lt;&lt; &quot;\nDetected tables:&quot; &lt;&lt; std::endl;
			
 
				-        for (const auto&amp; table : tables) {
			
 
				-            std::cout &lt;&lt; table &lt;&lt; std::endl;
			
 
				-        }
			
 
				-    } else {
			
 
				-        std::cout &lt;&lt; &quot;Failed to send HTTP request.&quot; &lt;&lt; std::endl;
			
 
				-        return 1;
			
 
				-    }
			
 
				-
			
 
				-    return 0;
			
 
				-}
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>Java</summary>
			
 
				-
			
 
				-<pre><code class="language-java">import okhttp3.*;
			
 
				-import com.fasterxml.jackson.databind.ObjectMapper;
			
 
				-import com.fasterxml.jackson.databind.JsonNode;
			
 
				-import com.fasterxml.jackson.databind.node.ObjectNode;
			
 
				-
			
 
				-import java.io.File;
			
 
				-import java.io.FileOutputStream;
			
 
				-import java.io.IOException;
			
 
				-import java.util.Base64;
			
 
				-
			
 
				-public class Main {
			
 
				-    public static void main(String[] args) throws IOException {
			
 
				-        String API_URL = &quot;http://localhost:8080/table-recognition&quot;; // 服务URL
			
 
				-        String imagePath = &quot;./demo.jpg&quot;; // 本地图像
			
 
				-        String ocrImagePath = &quot;./ocr.jpg&quot;;
			
 
				-        String layoutImagePath = &quot;./layout.jpg&quot;;
			
 
				-
			
 
				-        // 对本地图像进行Base64编码
			
 
				-        File file = new File(imagePath);
			
 
				-        byte[] fileContent = java.nio.file.Files.readAllBytes(file.toPath());
			
 
				-        String imageData = Base64.getEncoder().encodeToString(fileContent);
			
 
				-
			
 
				-        ObjectMapper objectMapper = new ObjectMapper();
			
 
				-        ObjectNode params = objectMapper.createObjectNode();
			
 
				-        params.put(&quot;image&quot;, imageData); // Base64编码的文件内容或者图像URL
			
 
				-
			
 
				-        // 创建 OkHttpClient 实例
			
 
				-        OkHttpClient client = new OkHttpClient();
			
 
				-        MediaType JSON = MediaType.Companion.get(&quot;application/json; charset=utf-8&quot;);
			
 
				-        RequestBody body = RequestBody.Companion.create(params.toString(), JSON);
			
 
				-        Request request = new Request.Builder()
			
 
				-                .url(API_URL)
			
 
				-                .post(body)
			
 
				-                .build();
			
 
				-
			
 
				-        // 调用API并处理接口返回数据
			
 
				-        try (Response response = client.newCall(request).execute()) {
			
 
				-            if (response.isSuccessful()) {
			
 
				-                String responseBody = response.body().string();
			
 
				-                JsonNode resultNode = objectMapper.readTree(responseBody);
			
 
				-                JsonNode result = resultNode.get(&quot;result&quot;);
			
 
				-                String ocrBase64Image = result.get(&quot;ocrImage&quot;).asText();
			
 
				-                String layoutBase64Image = result.get(&quot;layoutImage&quot;).asText();
			
 
				-                JsonNode tables = result.get(&quot;tables&quot;);
			
 
				-
			
 
				-                byte[] imageBytes = Base64.getDecoder().decode(ocrBase64Image);
			
 
				-                try (FileOutputStream fos = new FileOutputStream(ocrImagePath)) {
			
 
				-                    fos.write(imageBytes);
			
 
				-                }
			
 
				-                System.out.println(&quot;Output image saved at &quot; + ocrBase64Image);
			
 
				-
			
 
				-                imageBytes = Base64.getDecoder().decode(layoutBase64Image);
			
 
				-                try (FileOutputStream fos = new FileOutputStream(layoutImagePath)) {
			
 
				-                    fos.write(imageBytes);
			
 
				-                }
			
 
				-                System.out.println(&quot;Output image saved at &quot; + layoutImagePath);
			
 
				-
			
 
				-                System.out.println(&quot;\nDetected tables: &quot; + tables.toString());
			
 
				-            } else {
			
 
				-                System.err.println(&quot;Request failed with code: &quot; + response.code());
			
 
				-            }
			
 
				-        }
			
 
				-    }
			
 
				-}
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>Go</summary>
			
 
				-
			
 
				-<pre><code class="language-go">package main
			
 
				-
			
 
				-import (
			
 
				-    &quot;bytes&quot;
			
 
				-    &quot;encoding/base64&quot;
			
 
				-    &quot;encoding/json&quot;
			
 
				-    &quot;fmt&quot;
			
 
				-    &quot;io/ioutil&quot;
			
 
				-    &quot;net/http&quot;
			
 
				-)
			
 
				-
			
 
				-func main() {
			
 
				-    API_URL := &quot;http://localhost:8080/table-recognition&quot;
			
 
				-    imagePath := &quot;./demo.jpg&quot;
			
 
				-    ocrImagePath := &quot;./ocr.jpg&quot;
			
 
				-    layoutImagePath := &quot;./layout.jpg&quot;
			
 
				-
			
 
				-    // 对本地图像进行Base64编码
			
 
				-    imageBytes, err := ioutil.ReadFile(imagePath)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error reading image file:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    imageData := base64.StdEncoding.EncodeToString(imageBytes)
			
 
				-
			
 
				-    payload := map[string]string{&quot;image&quot;: imageData} // Base64编码的文件内容或者图像URL
			
 
				-    payloadBytes, err := json.Marshal(payload)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error marshaling payload:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-
			
 
				-    // 调用API
			
 
				-    client := &amp;http.Client{}
			
 
				-    req, err := http.NewRequest(&quot;POST&quot;, API_URL, bytes.NewBuffer(payloadBytes))
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error creating request:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-
			
 
				-    res, err := client.Do(req)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error sending request:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    defer res.Body.Close()
			
 
				-
			
 
				-    // 处理接口返回数据
			
 
				-    body, err := ioutil.ReadAll(res.Body)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error reading response body:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    type Response struct {
			
 
				-        Result struct {
			
 
				-            OcrImage      string   `json:&quot;ocrImage&quot;`
			
 
				-            LayoutImage      string   `json:&quot;layoutImage&quot;`
			
 
				-            Tables []map[string]interface{} `json:&quot;tables&quot;`
			
 
				-        } `json:&quot;result&quot;`
			
 
				-    }
			
 
				-    var respData Response
			
 
				-    err = json.Unmarshal([]byte(string(body)), &amp;respData)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error unmarshaling response body:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-
			
 
				-    ocrImageData, err := base64.StdEncoding.DecodeString(respData.Result.OcrImage)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error decoding base64 image data:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    err = ioutil.WriteFile(ocrImagePath, ocrImageData, 0644)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error writing image to file:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    fmt.Printf(&quot;Image saved at %s.jpg\n&quot;, ocrImagePath)
			
 
				-
			
 
				-    layoutImageData, err := base64.StdEncoding.DecodeString(respData.Result.LayoutImage)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error decoding base64 image data:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    err = ioutil.WriteFile(layoutImagePath, layoutImageData, 0644)
			
 
				-    if err != nil {
			
 
				-        fmt.Println(&quot;Error writing image to file:&quot;, err)
			
 
				-        return
			
 
				-    }
			
 
				-    fmt.Printf(&quot;Image saved at %s.jpg\n&quot;, layoutImagePath)
			
 
				-
			
 
				-    fmt.Println(&quot;\nDetected tables:&quot;)
			
 
				-    for _, table := range respData.Result.Tables {
			
 
				-        fmt.Println(table)
			
 
				-    }
			
 
				-}
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>C#</summary>
			
 
				-
			
 
				-<pre><code class="language-csharp">using System;
			
 
				-using System.IO;
			
 
				-using System.Net.Http;
			
 
				-using System.Net.Http.Headers;
			
 
				-using System.Text;
			
 
				-using System.Threading.Tasks;
			
 
				-using Newtonsoft.Json.Linq;
			
 
				-
			
 
				-class Program
			
 
				-{
			
 
				-    static readonly string API_URL = &quot;http://localhost:8080/table-recognition&quot;;
			
 
				-    static readonly string imagePath = &quot;./demo.jpg&quot;;
			
 
				-    static readonly string ocrImagePath = &quot;./ocr.jpg&quot;;
			
 
				-    static readonly string layoutImagePath = &quot;./layout.jpg&quot;;
			
 
				-
			
 
				-    static async Task Main(string[] args)
			
 
				-    {
			
 
				-        var httpClient = new HttpClient();
			
 
				-
			
 
				-        // 对本地图像进行Base64编码
			
 
				-        byte[] imageBytes = File.ReadAllBytes(imagePath);
			
 
				-        string image_data = Convert.ToBase64String(imageBytes);
			
 
				-
			
 
				-        var payload = new JObject{ { &quot;image&quot;, image_data } }; // Base64编码的文件内容或者图像URL
			
 
				-        var content = new StringContent(payload.ToString(), Encoding.UTF8, &quot;application/json&quot;);
			
 
				-
			
 
				-        // 调用API
			
 
				-        HttpResponseMessage response = await httpClient.PostAsync(API_URL, content);
			
 
				-        response.EnsureSuccessStatusCode();
			
 
				-
			
 
				-        // 处理接口返回数据
			
 
				-        string responseBody = await response.Content.ReadAsStringAsync();
			
 
				-        JObject jsonResponse = JObject.Parse(responseBody);
			
 
				-
			
 
				-        string ocrBase64Image = jsonResponse[&quot;result&quot;][&quot;ocrImage&quot;].ToString();
			
 
				-        byte[] ocrImageBytes = Convert.FromBase64String(ocrBase64Image);
			
 
				-        File.WriteAllBytes(ocrImagePath, ocrImageBytes);
			
 
				-        Console.WriteLine($&quot;Output image saved at {ocrImagePath}&quot;);
			
 
				-
			
 
				-        string layoutBase64Image = jsonResponse[&quot;result&quot;][&quot;layoutImage&quot;].ToString();
			
 
				-        byte[] layoutImageBytes = Convert.FromBase64String(layoutBase64Image);
			
 
				-        File.WriteAllBytes(layoutImagePath, layoutImageBytes);
			
 
				-        Console.WriteLine($&quot;Output image saved at {layoutImagePath}&quot;);
			
 
				-
			
 
				-        Console.WriteLine(&quot;\nDetected tables:&quot;);
			
 
				-        Console.WriteLine(jsonResponse[&quot;result&quot;][&quot;tables&quot;].ToString());
			
 
				-    }
			
 
				-}
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>Node.js</summary>
			
 
				-
			
 
				-<pre><code class="language-js">const axios = require('axios');
			
 
				-const fs = require('fs');
			
 
				-
			
 
				-const API_URL = 'http://localhost:8080/table-recognition'
			
 
				-const imagePath = './demo.jpg'
			
 
				-const ocrImagePath = &quot;./ocr.jpg&quot;;
			
 
				-const layoutImagePath = &quot;./layout.jpg&quot;;
			
 
				-
			
 
				-let config = {
			
 
				-   method: 'POST',
			
 
				-   maxBodyLength: Infinity,
			
 
				-   url: API_URL,
			
 
				-   data: JSON.stringify({
			
 
				-    'image': encodeImageToBase64(imagePath)  // Base64编码的文件内容或者图像URL
			
 
				-  })
			
 
				-};
			
 
				-
			
 
				-// 对本地图像进行Base64编码
			
 
				-function encodeImageToBase64(filePath) {
			
 
				-  const bitmap = fs.readFileSync(filePath);
			
 
				-  return Buffer.from(bitmap).toString('base64');
			
 
				-}
			
 
				-
			
 
				-// 调用API
			
 
				-axios.request(config)
			
 
				-.then((response) =&gt; {
			
 
				-    // 处理接口返回数据
			
 
				-    const result = response.data[&quot;result&quot;];
			
 
				-
			
 
				-    const imageBuffer = Buffer.from(result[&quot;ocrImage&quot;], 'base64');
			
 
				-    fs.writeFile(ocrImagePath, imageBuffer, (err) =&gt; {
			
 
				-      if (err) throw err;
			
 
				-      console.log(`Output image saved at ${ocrImagePath}`);
			
 
				-    });
			
 
				-
			
 
				-    imageBuffer = Buffer.from(result[&quot;layoutImage&quot;], 'base64');
			
 
				-    fs.writeFile(layoutImagePath, imageBuffer, (err) =&gt; {
			
 
				-      if (err) throw err;
			
 
				-      console.log(`Output image saved at ${layoutImagePath}`);
			
 
				-    });
			
 
				-
			
 
				-    console.log(&quot;\nDetected tables:&quot;);
			
 
				-    console.log(result[&quot;tables&quot;]);
			
 
				-})
			
 
				-.catch((error) =&gt; {
			
 
				-  console.log(error);
			
 
				-});
			
 
				-</code></pre></details>
			
 
				-
			
 
				-<details><summary>PHP</summary>
			
 
				-
			
 
				-<pre><code class="language-php">&lt;?php
			
 
				-
			
 
				-$API_URL = &quot;http://localhost:8080/table-recognition&quot;; // 服务URL
			
 
				-$image_path = &quot;./demo.jpg&quot;;
			
 
				-$ocr_image_path = &quot;./ocr.jpg&quot;;
			
 
				-$layout_image_path = &quot;./layout.jpg&quot;;
			
 
				-
			
 
				-// 对本地图像进行Base64编码
			
 
				-$image_data = base64_encode(file_get_contents($image_path));
			
 
				-$payload = array(&quot;image&quot; =&gt; $image_data); // Base64编码的文件内容或者图像URL
			
 
				-
			
 
				-// 调用API
			
 
				-$ch = curl_init($API_URL);
			
 
				-curl_setopt($ch, CURLOPT_POST, true);
			
 
				-curl_setopt($ch, CURLOPT_POSTFIELDS, json_encode($payload));
			
 
				-curl_setopt($ch, CURLOPT_HTTPHEADER, array('Content-Type: application/json'));
			
 
				-curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
			
 
				-$response = curl_exec($ch);
			
 
				-curl_close($ch);
			
 
				-
			
 
				-// 处理接口返回数据
			
 
				-$result = json_decode($response, true)[&quot;result&quot;];
			
 
				-file_put_contents($ocr_image_path, base64_decode($result[&quot;ocrImage&quot;]));
			
 
				-echo &quot;Output image saved at &quot; . $ocr_image_path . &quot;\n&quot;;
			
 
				-
			
 
				-file_put_contents($layout_image_path, base64_decode($result[&quot;layoutImage&quot;]));
			
 
				-echo &quot;Output image saved at &quot; . $layout_image_path . &quot;\n&quot;;
			
 
				-
			
 
				-echo &quot;\nDetected tables:\n&quot;;
			
 
				-print_r($result[&quot;tables&quot;]);
			
 
				-
			
 
				-?&gt;
			
 
				+for i, res in enumerate(result[&quot;tableRecResults&quot;]):
			
 
				+    print(&quot;Detected tables:&quot;)
			
 
				+    print(res[&quot;tables&quot;])
			
 
				+    layout_img_path = f&quot;layout_{i}.jpg&quot;
			
 
				+    with open(layout_img_path, &quot;wb&quot;) as f:
			
 
				+        f.write(base64.b64decode(res[&quot;layoutImage&quot;]))
			
 
				+    ocr_img_path = f&quot;ocr_{i}.jpg&quot;
			
 
				+    with open(ocr_img_path, &quot;wb&quot;) as f:
			
 
				+        f.write(base64.b64decode(res[&quot;ocrImage&quot;]))
			
 
				+    print(f&quot;Output images saved at {layout_img_path} and {ocr_img_path}&quot;)
			
 
				 </code></pre></details>
			
 
				 </details>
			
 
				 <br/>
			
--- a/docs/pipeline_usage/tutorials/time_series_pipelines/time_series_anomaly_detection.en.md
+++ b/docs/pipeline_usage/tutorials/time_series_pipelines/time_series_anomaly_detection.en.md
@@ -256,13 +256,13 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for the performance metrics of deployment strategies (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end speedups. For detailed high-performance inference procedures, refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.en.md).
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX enables users to achieve low-cost service-oriented deployment of pipelines. For detailed service-oriented deployment procedures, refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.en.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -328,7 +328,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service:</p>
			
 
				+<p>Primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -350,7 +350,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>csv</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of a CSV file accessible by the service or the Base64 encoded result of the CSV file content. The CSV file must be encoded in UTF-8.</td>
			
 
				+<td>The URL of a CSV file accessible by the server or the Base64 encoded result of the CSV file content. The CSV file must be encoded in UTF-8.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/time_series_pipelines/time_series_anomaly_detection.md
+++ b/docs/pipeline_usage/tutorials/time_series_pipelines/time_series_anomaly_detection.md
@@ -259,9 +259,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -353,7 +353,7 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>csv</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的CSV文件的URL或CSV文件内容的Base64编码结果。CSV文件需要使用UTF-8编码。</td>
			
 
				+<td>服务器可访问的CSV文件的URL或CSV文件内容的Base64编码结果。CSV文件需要使用UTF-8编码。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/time_series_pipelines/time_series_classification.en.md
+++ b/docs/pipeline_usage/tutorials/time_series_pipelines/time_series_classification.en.md
@@ -221,13 +221,13 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for deployment performance metrics (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins that deeply optimize model inference and pre/post-processing to significantly speed up the end-to-end process. Refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.en.md) for detailed high-performance inference procedures.
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX enables users to achieve low-cost service-oriented deployment of pipelines. Refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.en.md) for detailed service-oriented deployment procedures.
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -293,7 +293,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service:</p>
			
 
				+<p>Primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -315,7 +315,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>csv</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of a CSV file accessible by the service or the Base64 encoded result of the CSV file content. The CSV file must be encoded in UTF-8.</td>
			
 
				+<td>The URL of a CSV file accessible by the server or the Base64 encoded result of the CSV file content. The CSV file must be encoded in UTF-8.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/time_series_pipelines/time_series_classification.md
+++ b/docs/pipeline_usage/tutorials/time_series_pipelines/time_series_classification.md
@@ -218,9 +218,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -312,7 +312,7 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>csv</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的CSV文件的URL或CSV文件内容的Base64编码结果。CSV文件需要使用UTF-8编码。</td>
			
 
				+<td>服务器可访问的CSV文件的URL或CSV文件内容的Base64编码结果。CSV文件需要使用UTF-8编码。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/time_series_pipelines/time_series_forecasting.en.md
+++ b/docs/pipeline_usage/tutorials/time_series_pipelines/time_series_forecasting.en.md
@@ -265,13 +265,13 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for deployment strategy performance metrics (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end process acceleration. For detailed high-performance inference procedures, refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.en.md).
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving low-cost service-oriented deployment of pipelines. For detailed service-oriented deployment procedures, refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.en.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -337,7 +337,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service are as follows:</p>
			
 
				+<p>Primary operations provided by the service are as follows:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -359,7 +359,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>csv</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of a CSV file accessible by the service or the Base64 encoded result of the CSV file content. The CSV file must be encoded in UTF-8.</td>
			
 
				+<td>The URL of a CSV file accessible by the server or the Base64 encoded result of the CSV file content. The CSV file must be encoded in UTF-8.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/time_series_pipelines/time_series_forecasting.md
+++ b/docs/pipeline_usage/tutorials/time_series_pipelines/time_series_forecasting.md
@@ -267,9 +267,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -361,7 +361,7 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>csv</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的CSV文件的URL或CSV文件内容的Base64编码结果。CSV文件需要使用UTF-8编码。</td>
			
 
				+<td>服务器可访问的CSV文件的URL或CSV文件内容的Base64编码结果。CSV文件需要使用UTF-8编码。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 </tbody>
			
--- a/docs/pipeline_usage/tutorials/video_pipelines/video_classification.en.md
+++ b/docs/pipeline_usage/tutorials/video_pipelines/video_classification.en.md
@@ -225,15 +225,15 @@ If you need to apply the pipeline directly in your Python project, refer to the
 
				 
			
 
				 Additionally, PaddleX provides three other deployment methods, detailed as follows:
			
 
				 
			
 
				-🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for the performance metrics of deployment strategies (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end speedups. For detailed high-performance inference procedures, refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.en.md).
			
 
				+🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for deployment strategy performance metrics (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end process acceleration. For detailed high-performance inference procedures, refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.en.md).
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving low-cost service-oriented deployment of pipelines. For detailed service-oriented deployment procedures, refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.en.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -285,7 +285,7 @@ Below are the API references and multi-language service invocation examples:
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service are as follows:</p>
			
 
				+<p>Primary operations provided by the service are as follows:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -307,7 +307,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>video</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of an video file accessible by the service or the Base64 encoded result of the video file content.</td>
			
 
				+<td>The URL of a video file accessible by the server or the Base64 encoded result of the video file content.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 <tr>
			
--- a/docs/pipeline_usage/tutorials/video_pipelines/video_classification.md
+++ b/docs/pipeline_usage/tutorials/video_pipelines/video_classification.md
@@ -227,9 +227,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -307,7 +307,7 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>video</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的视频文件的URL或视频文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的视频文件的URL或视频文件内容的Base64编码结果。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 <tr>
			
--- a/docs/pipeline_usage/tutorials/video_pipelines/video_detection.en.md
+++ b/docs/pipeline_usage/tutorials/video_pipelines/video_detection.en.md
@@ -217,13 +217,13 @@ Additionally, PaddleX provides three other deployment methods, detailed as follo
 
				 
			
 
				 🚀 <b>High-Performance Inference</b>: In actual production environments, many applications have stringent standards for the performance metrics of deployment strategies (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end speedups. For detailed high-performance inference procedures, refer to the [PaddleX High-Performance Inference Guide](../../../pipeline_deploy/high_performance_inference.en.md).
			
 
				 
			
 
				-☁️ <b>Service-Oriented Deployment</b>: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving low-cost service-oriented deployment of pipelines. For detailed service-oriented deployment procedures, refer to the [PaddleX Service-Oriented Deployment Guide](../../../pipeline_deploy/service_deploy.en.md).
			
 
				+☁️ <b>Serving</b>: Serving is a common deployment strategy in real-world production environments. By encapsulating inference functions into services, clients can access these services via network requests to obtain inference results. PaddleX supports various solutions for serving pipelines. For detailed pipeline serving procedures, please refer to the [PaddleX Pipeline Serving Guide](../../../pipeline_deploy/serving.md).
			
 
				 
			
 
				-Below are the API references and multi-language service invocation examples:
			
 
				+Below are the API reference and multi-language service invocation examples for the basic serving solution:
			
 
				 
			
 
				 <details><summary>API Reference</summary>
			
 
				 
			
 
				-<p>For main operations provided by the service:</p>
			
 
				+<p>For primary operations provided by the service:</p>
			
 
				 <ul>
			
 
				 <li>The HTTP request method is POST.</li>
			
 
				 <li>The request body and the response body are both JSON data (JSON objects).</li>
			
@@ -239,6 +239,11 @@ Below are the API references and multi-language service invocation examples:
 
				 </thead>
			
 
				 <tbody>
			
 
				 <tr>
			
 
				+<td><code>logId</code></td>
			
 
				+<td><code>string</code></td>
			
 
				+<td>UUID for the request.</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				 <td><code>errorCode</code></td>
			
 
				 <td><code>integer</code></td>
			
 
				 <td>Error code. Fixed as <code>0</code>.</td>
			
@@ -246,11 +251,15 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>errorMsg</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>Error message. Fixed as <code>"Success"</code>.</td>
			
 
				+<td>Error description. Fixed as <code>"Success"</code>.</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				+<td><code>result</code></td>
			
 
				+<td><code>object</code></td>
			
 
				+<td>Operation result.</td>
			
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>The response body may also have a <code>result</code> property of type <code>object</code>, which stores the operation result information.</p>
			
 
				 <ul>
			
 
				 <li>When the request is not processed successfully, the response body properties are as follows:</li>
			
 
				 </ul>
			
@@ -264,6 +273,11 @@ Below are the API references and multi-language service invocation examples:
 
				 </thead>
			
 
				 <tbody>
			
 
				 <tr>
			
 
				+<td><code>logId</code></td>
			
 
				+<td><code>string</code></td>
			
 
				+<td>UUID for the request.</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				 <td><code>errorCode</code></td>
			
 
				 <td><code>integer</code></td>
			
 
				 <td>Error code. Same as the response status code.</td>
			
@@ -271,11 +285,11 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>errorMsg</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>Error message.</td>
			
 
				+<td>Error description.</td>
			
 
				 </tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>Main operations provided by the service are as follows:</p>
			
 
				+<p>Primary operations provided by the service are as follows:</p>
			
 
				 <ul>
			
 
				 <li><b><code>infer</code></b></li>
			
 
				 </ul>
			
@@ -297,7 +311,7 @@ Below are the API references and multi-language service invocation examples:
 
				 <tr>
			
 
				 <td><code>video</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>The URL of an video file accessible by the service or the Base64 encoded result of the video file content.</td>
			
 
				+<td>The URL of an video file accessible by the server or the Base64 encoded result of the video file content.</td>
			
 
				 <td>Yes</td>
			
 
				 </tr>
			
 
				 <tr>
			
--- a/docs/pipeline_usage/tutorials/video_pipelines/video_detection.md
+++ b/docs/pipeline_usage/tutorials/video_pipelines/video_detection.md
@@ -214,9 +214,9 @@ for res in output:
 
				 
			
 
				 🚀 <b>高性能推理</b>：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考[PaddleX高性能推理指南](../../../pipeline_deploy/high_performance_inference.md)。
			
 
				 
			
 
				-☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/service_deploy.md)。
			
 
				+☁️ <b>服务化部署</b>：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持多种产线服务化部署方案，详细的产线服务化部署流程请参考[PaddleX服务化部署指南](../../../pipeline_deploy/serving.md)。
			
 
				 
			
 
				-下面是API参考和多语言服务调用示例：
			
 
				+以下是基础服务化部署的API参考与多语言服务调用示例：
			
 
				 
			
 
				 <details><summary>API参考</summary>
			
 
				 
			
@@ -236,6 +236,11 @@ for res in output:
 
				 </thead>
			
 
				 <tbody>
			
 
				 <tr>
			
 
				+<td><code>logId</code></td>
			
 
				+<td><code>string</code></td>
			
 
				+<td>请求的UUID。</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				 <td><code>errorCode</code></td>
			
 
				 <td><code>integer</code></td>
			
 
				 <td>错误码。固定为<code>0</code>。</td>
			
@@ -245,9 +250,13 @@ for res in output:
 
				 <td><code>string</code></td>
			
 
				 <td>错误说明。固定为<code>"Success"</code>。</td>
			
 
				 </tr>
			
 
				+<tr>
			
 
				+<td><code>result</code></td>
			
 
				+<td><code>object</code></td>
			
 
				+<td>操作结果。</td>
			
 
				+</tr>
			
 
				 </tbody>
			
 
				 </table>
			
 
				-<p>响应体还可能有<code>result</code>属性，类型为<code>object</code>，其中存储操作结果信息。</p>
			
 
				 <ul>
			
 
				 <li>当请求处理未成功时，响应体的属性如下：</li>
			
 
				 </ul>
			
@@ -261,6 +270,11 @@ for res in output:
 
				 </thead>
			
 
				 <tbody>
			
 
				 <tr>
			
 
				+<td><code>logId</code></td>
			
 
				+<td><code>string</code></td>
			
 
				+<td>请求的UUID。</td>
			
 
				+</tr>
			
 
				+<tr>
			
 
				 <td><code>errorCode</code></td>
			
 
				 <td><code>integer</code></td>
			
 
				 <td>错误码。与响应状态码相同。</td>
			
@@ -294,7 +308,7 @@ for res in output:
 
				 <tr>
			
 
				 <td><code>video</code></td>
			
 
				 <td><code>string</code></td>
			
 
				-<td>服务可访问的视频文件的URL或视频文件内容的Base64编码结果。</td>
			
 
				+<td>服务器可访问的视频文件的URL或视频文件内容的Base64编码结果。</td>
			
 
				 <td>是</td>
			
 
				 </tr>
			
 
				 <tr>
			
--- a/docs/practical_tutorials/anomaly_detection_tutorial.en.md
+++ b/docs/practical_tutorials/anomaly_detection_tutorial.en.md
@@ -264,7 +264,7 @@ For more parameters, please refer to [Anomaly Detection Pipeline Usage Tutorial]
 
				 2. Additionally, PaddleX offers three other deployment methods, detailed as follows:
			
 
				 
			
 
				 * high-performance inference: In actual production environments, many applications have stringent standards for deployment strategy performance metrics (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end process acceleration. For detailed high-performance inference procedures, please refer to the [PaddleX High-Performance Inference Guide](../pipeline_deploy/high_performance_inference.en.md).
			
 
				-* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/service_deploy.en.md).
			
 
				+* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/serving.en.md).
			
 
				 * Edge Deployment: Edge deployment is a method that places computing and data processing capabilities directly on user devices, allowing devices to process data without relying on remote servers. PaddleX supports deploying models on edge devices such as Android. For detailed edge deployment procedures, please refer to the [PaddleX Edge Deployment Guide](../pipeline_deploy/edge_deploy.en.md).
			
 
				 
			
 
				 You can select the appropriate deployment method for your model pipeline according to your needs, and proceed with subsequent AI application integration.
			
--- a/docs/practical_tutorials/anomaly_detection_tutorial.md
+++ b/docs/practical_tutorials/anomaly_detection_tutorial.md
@@ -262,7 +262,7 @@ for res in output:
 
				 2. 此外，PaddleX 也提供了其他三种部署方式，详细说明如下：
			
 
				 
			
 
				 * 高性能部署：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能部署流程请参考 [PaddleX 高性能推理指南](../pipeline_deploy/high_performance_inference.md)。
			
 
				-* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/service_deploy.md)。
			
 
				+* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/serving.md)。
			
 
				 * 端侧部署：端侧部署是一种将计算和数据处理功能放在用户设备本身上的方式，设备可以直接处理数据，而不需要依赖远程的服务器。PaddleX 支持将模型部署在 Android 等端侧设备上，详细的端侧部署流程请参考 [PaddleX端侧部署指南](../pipeline_deploy/edge_deploy.md)。
			
 
				 
			
 
				 您可以根据需要选择合适的方式部署模型产线，进而进行后续的 AI 应用集成。
			
--- a/docs/practical_tutorials/deployment_tutorial.en.md
+++ b/docs/practical_tutorials/deployment_tutorial.en.md
@@ -9,7 +9,7 @@ Before using this tutorial, you need to install PaddleX. For installation instru
 
				 The three deployment methods of PaddleX are detailed below:
			
 
				 
			
 
				 * High-Performance Inference: In actual production environments, many applications have stringent performance standards for deployment strategies, especially in terms of response speed, to ensure efficient system operation and smooth user experience. To this end, PaddleX provides a high-performance inference plugin aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end speedups. For detailed high-performance inference procedures, please refer to the [PaddleX High-Performance Inference Guide](../pipeline_deploy/high_performance_inference.en.md).
			
 
				-* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functionality as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving low-cost service-oriented deployment in pipelines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/service_deploy.en.md).
			
 
				+* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functionality as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving low-cost service-oriented deployment in pipelines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/serving.en.md).
			
 
				 * Edge Deployment: Edge deployment is a method where computing and data processing functions are placed on the user's device itself, allowing the device to directly process data without relying on remote servers. PaddleX supports deploying models on edge devices such as Android. For detailed edge deployment procedures, please refer to the [PaddleX Edge Deployment Guide](../pipeline_deploy/edge_deploy.en.md).
			
 
				 
			
 
				 This tutorial will introduce the three deployment methods of PaddleX through three practical application examples.
			
@@ -228,7 +228,7 @@ The command-line options related to service deployment are as follows:
 
				 
			
 
				 ### 2.3 Calling the Service
			
 
				 
			
 
				-Here, only the Python calling example is shown. For API references and service calling examples in other languages, please refer to the **"Calling the Service"** section in the **"Development Integration/Deployment"** part of each pipeline usage tutorial in the [PaddleX Serving Deployment Guide](../pipeline_deploy/service_deploy.md).
			
 
				+Here, only the Python calling example is shown. For API references and service calling examples in other languages, please refer to the **"Calling the Service"** section in the **"Development Integration/Deployment"** part of each pipeline usage tutorial in the [PaddleX Serving Deployment Guide](../pipeline_deploy/serving.md).
			
 
				 
			
 
				 ```python
			
 
				 import base64
			
--- a/docs/practical_tutorials/deployment_tutorial.md
+++ b/docs/practical_tutorials/deployment_tutorial.md
@@ -9,7 +9,7 @@ comments: true
 
				 PaddleX 的三种部署方式详细说明如下：
			
 
				 
			
 
				 * 高性能推理：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能推理流程请参考 [PaddleX 高性能推理指南](../pipeline_deploy/high_performance_inference.md)。
			
 
				-* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/service_deploy.md)。
			
 
				+* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/serving.md)。
			
 
				 * 端侧部署：端侧部署是一种将计算和数据处理功能放在用户设备本身上的方式，设备可以直接处理数据，而不需要依赖远程的服务器。PaddleX 支持将模型部署在 Android 等端侧设备上，详细的端侧部署流程请参考 [PaddleX端侧部署指南](../pipeline_deploy/edge_deploy.md)。
			
 
				 
			
 
				 本教程将举三个实际应用例子，来依次介绍 PaddleX 的三种部署方式。
			
@@ -256,7 +256,7 @@ INFO:     Uvicorn running on http://0.0.0.0:8080 (Press CTRL+C to quit)
 
				 </table>
			
 
				 ### 2.3 调用服务
			
 
				 
			
 
				-此处只展示 Python 调用示例，API参考和其他语言服务调用示例可参考 [PaddleX服务化部署指南](../pipeline_deploy/service_deploy.md) 的 <b>1.3 调用服务</b> 中各产线使用教程的 <b>“开发集成/部署”</b> 部分。
			
 
				+此处只展示 Python 调用示例，API参考和其他语言服务调用示例可参考 [PaddleX服务化部署指南](../pipeline_deploy/serving.md) 的 <b>1.3 调用服务</b> 中各产线使用教程的 <b>“开发集成/部署”</b> 部分。
			
 
				 
			
 
				 ```python
			
 
				 import base64
			
--- a/docs/practical_tutorials/document_scene_information_extraction(layout_detection)_tutorial.en.md
+++ b/docs/practical_tutorials/document_scene_information_extraction(layout_detection)_tutorial.en.md
@@ -542,7 +542,7 @@ For more parameters, please refer to the [Document Scene Information Extraction
 
				 2. Additionally, PaddleX offers three other deployment methods, detailed as follows:
			
 
				 
			
 
				 * high-performance inference: In actual production environments, many applications have stringent standards for deployment strategy performance metrics (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugin aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end process acceleration. For detailed high-performance inference procedures, please refer to the [PaddleX High-Performance Inference Guide](../pipeline_deploy/high_performance_inference.en.md).
			
 
				-* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/service_deploy.en.md).
			
 
				+* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/serving.en.md).
			
 
				 * Edge Deployment: Edge deployment is a method that places computing and data processing capabilities directly on user devices, allowing devices to process data without relying on remote servers. PaddleX supports deploying models on edge devices such as Android. For detailed edge deployment procedures, please refer to the [PaddleX Edge Deployment Guide](../pipeline_deploy/edge_deploy.en.md).
			
 
				 
			
 
				 You can select the appropriate deployment method for your model pipeline according to your needs, and proceed with subsequent AI application integration.
			
--- a/docs/practical_tutorials/document_scene_information_extraction(layout_detection)_tutorial.md
+++ b/docs/practical_tutorials/document_scene_information_extraction(layout_detection)_tutorial.md
@@ -544,7 +544,7 @@ chat_result.print()
 
				 2. 此外，PaddleX 也提供了其他三种部署方式，详细说明如下：
			
 
				 
			
 
				 * 高性能部署：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能部署流程请参考 [PaddleX 高性能部署指南](../pipeline_deploy/high_performance_inference.md)。
			
 
				-* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/service_deploy.md)。
			
 
				+* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/serving.md)。
			
 
				 * 端侧部署：端侧部署是一种将计算和数据处理功能放在用户设备本身上的方式，设备可以直接处理数据，而不需要依赖远程的服务器。PaddleX 支持将模型部署在 Android 等端侧设备上，详细的端侧部署流程请参考 [PaddleX端侧部署指南](../pipeline_deploy/edge_deploy.md)。
			
 
				 
			
 
				 您可以根据需要选择合适的方式部署模型产线，进而进行后续的 AI 应用集成。
			
--- a/docs/practical_tutorials/document_scene_information_extraction(seal_recognition)_tutorial.en.md
+++ b/docs/practical_tutorials/document_scene_information_extraction(seal_recognition)_tutorial.en.md
@@ -473,7 +473,7 @@ For more parameters, please refer to the [Document Scene Information Extraction
 
				 2. Additionally, PaddleX offers three other deployment methods, detailed as follows:
			
 
				 
			
 
				 * high-performance inference: In actual production environments, many applications have stringent standards for deployment strategy performance metrics (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugin aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end process acceleration. For detailed high-performance inference procedures, please refer to the [PaddleX High-Performance Inference Guide](../pipeline_deploy/high_performance_inference.en.md).
			
 
				-* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/service_deploy.en.md).
			
 
				+* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/serving.en.md).
			
 
				 * Edge Deployment: Edge deployment is a method that places computing and data processing capabilities directly on user devices, allowing devices to process data without relying on remote servers. PaddleX supports deploying models on edge devices such as Android. For detailed edge deployment procedures, please refer to the [PaddleX Edge Deployment Guide](../pipeline_deploy/edge_deploy.en.md).
			
 
				 
			
 
				 You can select the appropriate deployment method for your model pipeline according to your needs, and proceed with subsequent AI application integration.
			
--- a/docs/practical_tutorials/document_scene_information_extraction(seal_recognition)_tutorial.md
+++ b/docs/practical_tutorials/document_scene_information_extraction(seal_recognition)_tutorial.md
@@ -463,7 +463,7 @@ chat_result.print()
 
				 2. 此外，PaddleX 也提供了其他三种部署方式，详细说明如下：
			
 
				 
			
 
				 * 高性能部署：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能部署流程请参考 [PaddleX 高性能部署指南](../pipeline_deploy/high_performance_inference.md)。
			
 
				-* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/service_deploy.md)。
			
 
				+* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/serving.md)。
			
 
				 * 端侧部署：端侧部署是一种将计算和数据处理功能放在用户设备本身上的方式，设备可以直接处理数据，而不需要依赖远程的服务器。PaddleX 支持将模型部署在 Android 等端侧设备上，详细的端侧部署流程请参考 [PaddleX端侧部署指南](../pipeline_deploy/edge_deploy.md)。
			
 
				 
			
 
				 您可以根据需要选择合适的方式部署模型产线，进而进行后续的 AI 应用集成。
			
--- a/docs/practical_tutorials/image_classification_garbage_tutorial.en.md
+++ b/docs/practical_tutorials/image_classification_garbage_tutorial.en.md
@@ -394,7 +394,7 @@ For more parameters, please refer to the [General Image Classification Pipeline
 
				 2. Additionally, PaddleX offers three other deployment methods, detailed as follows:
			
 
				 
			
 
				 * high-performance inference: In actual production environments, many applications have stringent standards for deployment strategy performance metrics (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugin aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end process acceleration. For detailed high-performance inference procedures, please refer to the [PaddleX High-Performance Inference Guide](../pipeline_deploy/high_performance_inference.en.md).
			
 
				-* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/service_deploy.en.md).
			
 
				+* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/serving.en.md).
			
 
				 * Edge Deployment: Edge deployment is a method that places computing and data processing capabilities directly on user devices, allowing devices to process data without relying on remote servers. PaddleX supports deploying models on edge devices such as Android. For detailed edge deployment procedures, please refer to the [PaddleX Edge Deployment Guide](../pipeline_deploy/edge_deploy.en.md).
			
 
				 
			
 
				 You can select the appropriate deployment method for your model pipeline according to your needs, and proceed with subsequent AI application integration.
			
--- a/docs/practical_tutorials/image_classification_garbage_tutorial.md
+++ b/docs/practical_tutorials/image_classification_garbage_tutorial.md
@@ -391,7 +391,7 @@ for res in output:
 
				 2. 此外，PaddleX 也提供了其他三种部署方式，详细说明如下：
			
 
				 
			
 
				 * 高性能部署：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能部署流程请参考 [PaddleX 高性能推理指南](../pipeline_deploy/high_performance_inference.md)。
			
 
				-* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/service_deploy.md)。
			
 
				+* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/serving.md)。
			
 
				 * 端侧部署：端侧部署是一种将计算和数据处理功能放在用户设备本身上的方式，设备可以直接处理数据，而不需要依赖远程的服务器。PaddleX 支持将模型部署在 Android 等端侧设备上，详细的端侧部署流程请参考 [PaddleX端侧部署指南](../pipeline_deploy/edge_deploy.md)。
			
 
				 
			
 
				 您可以根据需要选择合适的方式部署模型产线，进而进行后续的 AI 应用集成。
			
--- a/docs/practical_tutorials/instance_segmentation_remote_sensing_tutorial.en.md
+++ b/docs/practical_tutorials/instance_segmentation_remote_sensing_tutorial.en.md
@@ -357,7 +357,7 @@ For more parameters, please refer to the [General Instance Segmentation Pipline
 
				 2. Additionally, PaddleX offers three other deployment methods, detailed as follows:
			
 
				 
			
 
				 * high-performance inference: In actual production environments, many applications have stringent standards for deployment strategy performance metrics (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end process acceleration. For detailed high-performance inference procedures, please refer to the [PaddleX High-Performance Inference Guide](../pipeline_deploy/high_performance_inference.en.md).
			
 
				-* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/service_deploy.en.md).
			
 
				+* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/serving.en.md).
			
 
				 * Edge Deployment: Edge deployment is a method that places computing and data processing capabilities directly on user devices, allowing devices to process data without relying on remote servers. PaddleX supports deploying models on edge devices such as Android. For detailed edge deployment procedures, please refer to the [PaddleX Edge Deployment Guide](../pipeline_deploy/edge_deploy.en.md).
			
 
				 
			
 
				 You can select the appropriate deployment method for your model pipeline according to your needs, and proceed with subsequent AI application integration.
			
--- a/docs/practical_tutorials/instance_segmentation_remote_sensing_tutorial.md
+++ b/docs/practical_tutorials/instance_segmentation_remote_sensing_tutorial.md
@@ -353,7 +353,7 @@ for res in output:
 
				 2. 此外，PaddleX 也提供了其他三种部署方式，详细说明如下：
			
 
				 
			
 
				 * 高性能部署：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能部署流程请参考 [PaddleX 高性能推理指南](../pipeline_deploy/high_performance_inference.md)。
			
 
				-* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/service_deploy.md)。
			
 
				+* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/serving.md)。
			
 
				 * 端侧部署：端侧部署是一种将计算和数据处理功能放在用户设备本身上的方式，设备可以直接处理数据，而不需要依赖远程的服务器。PaddleX 支持将模型部署在 Android 等端侧设备上，详细的端侧部署流程请参考 [PaddleX端侧部署指南](../pipeline_deploy/edge_deploy.md)。
			
 
				 
			
 
				 您可以根据需要选择合适的方式部署模型产线，进而进行后续的 AI 应用集成。
			
--- a/docs/practical_tutorials/object_detection_fall_tutorial.en.md
+++ b/docs/practical_tutorials/object_detection_fall_tutorial.en.md
@@ -365,7 +365,7 @@ For more parameters, please refer to [General Object Detection Pipeline Usage Tu
 
				 2. Additionally, PaddleX offers three other deployment methods, detailed as follows:
			
 
				 
			
 
				 * high-performance inference: In actual production environments, many applications have stringent standards for deployment strategy performance metrics (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end process acceleration. For detailed high-performance inference procedures, please refer to the [PaddleX High-Performance Inference Guide](../pipeline_deploy/high_performance_inference.en.md).
			
 
				-* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/service_deploy.en.md).
			
 
				+* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/serving.en.md).
			
 
				 * Edge Deployment: Edge deployment is a method that places computing and data processing capabilities directly on user devices, allowing devices to process data without relying on remote servers. PaddleX supports deploying models on edge devices such as Android. For detailed edge deployment procedures, please refer to the [PaddleX Edge Deployment Guide](../pipeline_deploy/edge_deploy.en.md).
			
 
				 
			
 
				 You can select the appropriate deployment method for your model pipeline according to your needs, and proceed with subsequent AI application integration.
			
--- a/docs/practical_tutorials/object_detection_fall_tutorial.md
+++ b/docs/practical_tutorials/object_detection_fall_tutorial.md
@@ -364,7 +364,7 @@ for res in output:
 
				 2. 此外，PaddleX 也提供了其他三种部署方式，详细说明如下：
			
 
				 
			
 
				 * 高性能部署：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能部署流程请参考 [PaddleX 高性能推理指南](../pipeline_deploy/high_performance_inference.md)。
			
 
				-* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/service_deploy.md)。
			
 
				+* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/serving.md)。
			
 
				 * 端侧部署：端侧部署是一种将计算和数据处理功能放在用户设备本身上的方式，设备可以直接处理数据，而不需要依赖远程的服务器。PaddleX 支持将模型部署在 Android 等端侧设备上，详细的端侧部署流程请参考 [PaddleX端侧部署指南](../pipeline_deploy/edge_deploy.md)。
			
 
				 
			
 
				 您可以根据需要选择合适的方式部署模型产线，进而进行后续的 AI 应用集成。
			
--- a/docs/practical_tutorials/object_detection_fashion_pedia_tutorial.en.md
+++ b/docs/practical_tutorials/object_detection_fashion_pedia_tutorial.en.md
@@ -374,7 +374,7 @@ For more parameters, please refer to [General Object Detection Pipeline Usage Tu
 
				 2. Additionally, PaddleX offers three other deployment methods, detailed as follows:
			
 
				 
			
 
				 * high-performance inference: In actual production environments, many applications have stringent standards for deployment strategy performance metrics (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end process acceleration. For detailed high-performance inference procedures, please refer to the [PaddleX High-Performance Inference Guide](../pipeline_deploy/high_performance_inference.en.md).
			
 
				-* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/service_deploy.en.md).
			
 
				+* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/serving.en.md).
			
 
				 * Edge Deployment: Edge deployment is a method that places computing and data processing capabilities directly on user devices, allowing devices to process data without relying on remote servers. PaddleX supports deploying models on edge devices such as Android. For detailed edge deployment procedures, please refer to the [PaddleX Edge Deployment Guide](../pipeline_deploy/edge_deploy.en.md).
			
 
				 
			
 
				 You can select the appropriate deployment method for your model pipeline according to your needs, and proceed with subsequent AI application integration.
			
--- a/docs/practical_tutorials/object_detection_fashion_pedia_tutorial.md
+++ b/docs/practical_tutorials/object_detection_fashion_pedia_tutorial.md
@@ -372,7 +372,7 @@ for res in output:
 
				 2. 此外，PaddleX 也提供了其他三种部署方式，详细说明如下：
			
 
				 
			
 
				 * 高性能部署：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能部署流程请参考 [PaddleX 高性能推理指南](../pipeline_deploy/high_performance_inference.md)。
			
 
				-* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/service_deploy.md)。
			
 
				+* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/serving.md)。
			
 
				 * 端侧部署：端侧部署是一种将计算和数据处理功能放在用户设备本身上的方式，设备可以直接处理数据，而不需要依赖远程的服务器。PaddleX 支持将模型部署在 Android 等端侧设备上，详细的端侧部署流程请参考 [PaddleX端侧部署指南](../pipeline_deploy/edge_deploy.md)。
			
 
				 
			
 
				 您可以根据需要选择合适的方式部署模型产线，进而进行后续的 AI 应用集成。
			
--- a/docs/practical_tutorials/ocr_det_license_tutorial.en.md
+++ b/docs/practical_tutorials/ocr_det_license_tutorial.en.md
@@ -326,7 +326,7 @@ For more parameters, please refer to the [General OCR Pipeline Usage Tutorial](.
 
				 2. Additionally, PaddleX offers three other deployment methods, detailed as follows:
			
 
				 
			
 
				 * high-performance inference: In actual production environments, many applications have stringent standards for deployment strategy performance metrics (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end process acceleration. For detailed high-performance inference procedures, please refer to the [PaddleX High-Performance Inference Guide](../pipeline_deploy/high_performance_inference.en.md).
			
 
				-* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/service_deploy.en.md).
			
 
				+* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/serving.en.md).
			
 
				 * Edge Deployment: Edge deployment is a method that places computing and data processing capabilities directly on user devices, allowing devices to process data without relying on remote servers. PaddleX supports deploying models on edge devices such as Android. For detailed edge deployment procedures, please refer to the [PaddleX Edge Deployment Guide](../pipeline_deploy/edge_deploy.en.md).
			
 
				 
			
 
				 You can select the appropriate deployment method for your model pipeline according to your needs, and proceed with subsequent AI application integration.
			
--- a/docs/practical_tutorials/ocr_det_license_tutorial.md
+++ b/docs/practical_tutorials/ocr_det_license_tutorial.md
@@ -324,7 +324,7 @@ for res in output:
 
				 2. 此外，PaddleX 也提供了其他三种部署方式，详细说明如下：
			
 
				 
			
 
				 * 高性能部署：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能部署流程请参考 [PaddleX 高性能推理指南](../pipeline_deploy/high_performance_inference.md)。
			
 
				-* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/service_deploy.md)。
			
 
				+* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/serving.md)。
			
 
				 * 端侧部署：端侧部署是一种将计算和数据处理功能放在用户设备本身上的方式，设备可以直接处理数据，而不需要依赖远程的服务器。PaddleX 支持将模型部署在 Android 等端侧设备上，详细的端侧部署流程请参考 [PaddleX端侧部署指南](../pipeline_deploy/edge_deploy.md)。
			
 
				 
			
 
				 您可以根据需要选择合适的方式部署模型产线，进而进行后续的 AI 应用集成。
			
--- a/docs/practical_tutorials/ocr_rec_chinese_tutorial.en.md
+++ b/docs/practical_tutorials/ocr_rec_chinese_tutorial.en.md
@@ -337,7 +337,7 @@ For more parameters, please refer to the [General OCR Pipeline Usage Tutorial](.
 
				 2. Additionally, PaddleX offers three other deployment methods, detailed as follows:
			
 
				 
			
 
				 * high-performance inference: In actual production environments, many applications have stringent standards for deployment strategy performance metrics (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end process acceleration. For detailed high-performance inference procedures, please refer to the [PaddleX High-Performance Inference Guide](../pipeline_deploy/high_performance_inference.en.md).
			
 
				-* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/service_deploy.en.md).
			
 
				+* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/serving.en.md).
			
 
				 * Edge Deployment: Edge deployment is a method that places computing and data processing capabilities directly on user devices, allowing devices to process data without relying on remote servers. PaddleX supports deploying models on edge devices such as Android. For detailed edge deployment procedures, please refer to the [PaddleX Edge Deployment Guide](../pipeline_deploy/edge_deploy.en.md).
			
 
				 
			
 
				 You can select the appropriate deployment method for your model pipeline according to your needs, and proceed with subsequent AI application integration.
			
--- a/docs/practical_tutorials/ocr_rec_chinese_tutorial.md
+++ b/docs/practical_tutorials/ocr_rec_chinese_tutorial.md
@@ -334,7 +334,7 @@ for res in output:
 
				 2. 此外，PaddleX 也提供了其他三种部署方式，详细说明如下：
			
 
				 
			
 
				 * 高性能部署：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能部署流程请参考 [PaddleX 高性能推理指南](../pipeline_deploy/high_performance_inference.md)。
			
 
				-* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/service_deploy.md)。
			
 
				+* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/serving.md)。
			
 
				 * 端侧部署：端侧部署是一种将计算和数据处理功能放在用户设备本身上的方式，设备可以直接处理数据，而不需要依赖远程的服务器。PaddleX 支持将模型部署在 Android 等端侧设备上，详细的端侧部署流程请参考 [PaddleX端侧部署指南](../pipeline_deploy/edge_deploy.md)。
			
 
				 
			
 
				 您可以根据需要选择合适的方式部署模型产线，进而进行后续的 AI 应用集成。
			
--- a/docs/practical_tutorials/semantic_segmentation_road_tutorial.en.md
+++ b/docs/practical_tutorials/semantic_segmentation_road_tutorial.en.md
@@ -346,7 +346,7 @@ For more parameters, please refer to [General Semantic Segmentation Pipeline Usa
 
				 2. Additionally, PaddleX offers three other deployment methods, detailed as follows:
			
 
				 
			
 
				 * high-performance inference: In actual production environments, many applications have stringent standards for deployment strategy performance metrics (especially response speed) to ensure efficient system operation and smooth user experience. To this end, PaddleX provides high-performance inference plugins aimed at deeply optimizing model inference and pre/post-processing for significant end-to-end process acceleration. For detailed high-performance inference procedures, please refer to the [PaddleX High-Performance Inference Guide](../pipeline_deploy/high_performance_inference.en.md).
			
 
				-* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/service_deploy.en.md).
			
 
				+* Service-Oriented Deployment: Service-oriented deployment is a common deployment form in actual production environments. By encapsulating inference functions as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving cost-effective service-oriented deployment of production lines. For detailed service-oriented deployment procedures, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/serving.en.md).
			
 
				 * Edge Deployment: Edge deployment is a method that places computing and data processing capabilities directly on user devices, allowing devices to process data without relying on remote servers. PaddleX supports deploying models on edge devices such as Android. For detailed edge deployment procedures, please refer to the [PaddleX Edge Deployment Guide](../pipeline_deploy/edge_deploy.en.md).
			
 
				 
			
 
				 You can select the appropriate deployment method for your model pipeline according to your needs, and proceed with subsequent AI application integration.
			
--- a/docs/practical_tutorials/semantic_segmentation_road_tutorial.md
+++ b/docs/practical_tutorials/semantic_segmentation_road_tutorial.md
@@ -344,7 +344,7 @@ for res in output:
 
				 2. 此外，PaddleX 也提供了其他三种部署方式，详细说明如下：
			
 
				 
			
 
				 * 高性能部署：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能部署流程请参考 [PaddleX 高性能推理指南](../pipeline_deploy/high_performance_inference.md)。
			
 
				-* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/service_deploy.md)。
			
 
				+* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/serving.md)。
			
 
				 * 端侧部署：端侧部署是一种将计算和数据处理功能放在用户设备本身上的方式，设备可以直接处理数据，而不需要依赖远程的服务器。PaddleX 支持将模型部署在 Android 等端侧设备上，详细的端侧部署流程请参考 [PaddleX端侧部署指南](../pipeline_deploy/edge_deploy.md)。
			
 
				 
			
 
				 您可以根据需要选择合适的方式部署模型产线，进而进行后续的 AI 应用集成。
			
--- a/docs/practical_tutorials/small_object_detection_tutorial.en.md
+++ b/docs/practical_tutorials/small_object_detection_tutorial.en.md
@@ -296,8 +296,8 @@ For more parameters, please refer to [Small Object Detection Pipeline Usage Tuto
 
				 
			
 
				 2. In addition, PaddleX also provides three other deployment methods, detailed explanations are as follows:
			
 
				 
			
 
				-* High-performance deployment: In actual production environments, many applications have strict standards for the performance indicators of deployment strategies (especially response speed) to ensure the efficient operation of the system and the smooth user experience. For this reason, PaddleX provides a high-performance inference plugin, aiming to deeply optimize the performance of model inference and pre/post-processing, achieving significant acceleration of the end-to-end process. For detailed high-performance deployment processes, please refer to [PaddleX High-Performance Inference Guide](../pipeline_deploy/high_performance_inference.md).
			
 
				-* Service deployment: Service deployment is a common deployment form in actual production environments. By encapsulating inference functions into services, clients can access these services through network requests to obtain inference results. PaddleX supports users to achieve service deployment of the pipeline at a low cost. For detailed service deployment processes, please refer to [PaddleX Service Deployment Guide](../pipeline_deploy/service_deploy.md).
			
 
				-* Edge deployment: Edge deployment is a way of placing computing and data processing functions on user devices themselves, where devices can directly process data without relying on remote servers. PaddleX supports deploying models on edge devices such as Android. For detailed edge deployment processes, please refer to [PaddleX Edge Deployment Guide](../pipeline_deploy/edge_deploy.md).
			
 
				+* High-performance deployment: In actual production environments, many applications have strict standards for the performance indicators of deployment strategies (especially response speed) to ensure the efficient operation of the system and the smooth user experience. For this reason, PaddleX provides a high-performance inference plugin, aiming to deeply optimize the performance of model inference and pre/post-processing, achieving significant acceleration of the end-to-end process. For detailed high-performance deployment processes, please refer to [PaddleX High-Performance Inference Guide](../pipeline_deploy/high_performance_inference.en.md).
			
 
				+* Service deployment: Service deployment is a common deployment form in actual production environments. By encapsulating inference functions into services, clients can access these services through network requests to obtain inference results. PaddleX supports users to achieve service deployment of the pipeline at a low cost. For detailed service deployment processes, please refer to [PaddleX Service Deployment Guide](../pipeline_deploy/serving.en.md).
			
 
				+* Edge deployment: Edge deployment is a way of placing computing and data processing functions on user devices themselves, where devices can directly process data without relying on remote servers. PaddleX supports deploying models on edge devices such as Android. For detailed edge deployment processes, please refer to [PaddleX Edge Deployment Guide](../pipeline_deploy/edge_deploy.en.md).
			
 
				 
			
 
				 You can choose an appropriate method to deploy the model pipeline according to your needs and proceed with subsequent AI application integration.
			
--- a/docs/practical_tutorials/small_object_detection_tutorial.md
+++ b/docs/practical_tutorials/small_object_detection_tutorial.md
@@ -304,7 +304,7 @@ for res in output:
 
				 1. 此外，PaddleX 也提供了其他三种部署方式，详细说明如下：
			
 
				 
			
 
				 * 高性能部署：在实际生产环境中，许多应用对部署策略的性能指标（尤其是响应速度）有着较严苛的标准，以确保系统的高效运行与用户体验的流畅性。为此，PaddleX 提供高性能推理插件，旨在对模型推理及前后处理进行深度性能优化，实现端到端流程的显著提速，详细的高性能部署流程请参考 [PaddleX 高性能推理指南](../pipeline_deploy/high_performance_inference.md)。
			
 
				-* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/service_deploy.md)。
			
 
				+* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/serving.md)。
			
 
				 * 端侧部署：端侧部署是一种将计算和数据处理功能放在用户设备本身上的方式，设备可以直接处理数据，而不需要依赖远程的服务器。PaddleX 支持将模型部署在 Android 等端侧设备上，详细的端侧部署流程请参考 [PaddleX端侧部署指南](../pipeline_deploy/edge_deploy.md)。
			
 
				 
			
 
				 您可以根据需要选择合适的方式部署模型产线，进而进行后续的 AI 应用集成。
			
--- a/docs/practical_tutorials/ts_anomaly_detection.en.md
+++ b/docs/practical_tutorials/ts_anomaly_detection.en.md
@@ -343,5 +343,5 @@ For more parameters, please refer to the [Time Series Anomaly Detection Pipeline
 
				 
			
 
				 2. Additionally, PaddleX's time series anomaly detection pipeline also offers a service-oriented deployment method, detailed as follows:
			
 
				 
			
 
				-Service-Oriented Deployment: This is a common deployment form in actual production environments. By encapsulating the inference functionality as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving service-oriented deployment of pipelines at low cost. For detailed instructions on service-oriented deployment, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/service_deploy.en.md).
			
 
				+Service-Oriented Deployment: This is a common deployment form in actual production environments. By encapsulating the inference functionality as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving service-oriented deployment of pipelines at low cost. For detailed instructions on service-oriented deployment, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/serving.en.md).
			
 
				 You can choose the appropriate method to deploy your model pipeline based on your needs, and proceed with subsequent AI application integration.
			
--- a/docs/practical_tutorials/ts_anomaly_detection.md
+++ b/docs/practical_tutorials/ts_anomaly_detection.md
@@ -340,5 +340,5 @@ for res in output:
 
				 更多参数请参考[时序异常检测产线使用教程](../pipeline_usage/tutorials/time_series_pipelines/time_series_anomaly_detection.md)
			
 
				 
			
 
				 2. 此外，PaddleX 时序异常检测产线也提供了服务化部署方式，详细说明如下：
			
 
				-* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX 服务化部署指南](../pipeline_deploy/service_deploy.md)。
			
 
				+* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX 服务化部署指南](../pipeline_deploy/serving.md)。
			
 
				 您可以根据需要选择合适的方式部署模型产线，进而进行后续的 AI 应用集成。
			
--- a/docs/practical_tutorials/ts_classification.en.md
+++ b/docs/practical_tutorials/ts_classification.en.md
@@ -309,5 +309,5 @@ For more parameters, please refer to the [Time Series Classification Pipeline Us
 
				 
			
 
				 2. Additionally, PaddleX's time series classification pipeline also offers a service-oriented deployment method, detailed as follows:
			
 
				 
			
 
				-Service-Oriented Deployment: This is a common deployment form in actual production environments. By encapsulating the inference functionality as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving service-oriented deployment of pipelines at low cost. For detailed instructions on service-oriented deployment, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/service_deploy.en.md).
			
 
				+Service-Oriented Deployment: This is a common deployment form in actual production environments. By encapsulating the inference functionality as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving service-oriented deployment of pipelines at low cost. For detailed instructions on service-oriented deployment, please refer to the [PaddleX Service-Oriented Deployment Guide](../pipeline_deploy/serving.en.md).
			
 
				 You can choose the appropriate method to deploy your model pipeline based on your needs, and proceed with subsequent AI application integration.
			
--- a/docs/practical_tutorials/ts_classification.md
+++ b/docs/practical_tutorials/ts_classification.md
@@ -303,5 +303,5 @@ for res in output:
 
				 更多参数请参考[时序分类产线使用教程](../pipeline_usage/tutorials/time_series_pipelines/time_series_classification.md)
			
 
				 
			
 
				 2. 此外，PaddleX 时序异常检测产线也提供了服务化部署方式，详细说明如下：
			
 
				-* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX 服务化部署指南](../pipeline_deploy/service_deploy.md)。
			
 
				+* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考[PaddleX 服务化部署指南](../pipeline_deploy/serving.md)。
			
 
				 您可以根据需要选择合适的方式部署模型产线，进而进行后续的 AI 应用集成。
			
--- a/docs/practical_tutorials/ts_forecast.en.md
+++ b/docs/practical_tutorials/ts_forecast.en.md
@@ -489,5 +489,5 @@ For more parameters, please refer to the [Time Series forecast Pipeline Usage Tu
 
				 
			
 
				 2. Additionally, PaddleX's time series forecast pipeline also offers a service-oriented deployment method, detailed as follows:
			
 
				 
			
 
				-Service-Oriented Deployment: This is a common deployment form in actual production environments. By encapsulating the inference functionality as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving service-oriented deployment of pipelines at low cost. For detailed instructions on service-oriented deployment, please refer to the [PPaddleX Service-Oriented Deployment Guide](../pipeline_deploy/service_deploy.en.md).
			
 
				+Service-Oriented Deployment: This is a common deployment form in actual production environments. By encapsulating the inference functionality as services, clients can access these services through network requests to obtain inference results. PaddleX supports users in achieving service-oriented deployment of pipelines at low cost. For detailed instructions on service-oriented deployment, please refer to the [PPaddleX Service-Oriented Deployment Guide](../pipeline_deploy/serving.en.md).
			
 
				 You can choose the appropriate method to deploy your model pipeline based on your needs, and proceed with subsequent AI application integration.
			
--- a/docs/practical_tutorials/ts_forecast.md
+++ b/docs/practical_tutorials/ts_forecast.md
@@ -463,5 +463,5 @@ for res in output:
 
				 更多参数请参考[时序预测产线使用教程](../pipeline_usage/tutorials/time_series_pipelines/time_series_forecasting.md)。
			
 
				 
			
 
				 2. 此外，PaddleX 时序预测产线也提供了服务化部署方式，详细说明如下：
			
 
				-* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/service_deploy.md)。
			
 
				+* 服务化部署：服务化部署是实际生产环境中常见的一种部署形式。通过将推理功能封装为服务，客户端可以通过网络请求来访问这些服务，以获取推理结果。PaddleX 支持用户以低成本实现产线的服务化部署，详细的服务化部署流程请参考 [PaddleX 服务化部署指南](../pipeline_deploy/serving.md)。
			
 
				 您可以根据需要选择合适的方式部署模型产线，进而进行后续的 AI 应用集成。
			
--- a/mkdocs.yml
+++ b/mkdocs.yml
@@ -161,8 +161,8 @@ plugins:
 
				             PaddleX时序任务模型配置文件参数说明: PaddleX Time Series Module Config Parameters
			
 
				             模型产线部署: Pipeline Deploy
			
 
				             高性能推理: High Performance Inference
			
 
				-            服务化部署: Service Deploy
			
 
				-            端侧部署: Edge Deploy
			
 
				+            服务化部署: Serving
			
 
				+            端侧部署: Edge Deployment
			
 
				             多硬件使用: Multi-Device Usage
			
 
				             多硬件使用指南: Multi-Device Usage Guide
			
 
				             飞桨多硬件安装: PaddlePaddle Installation on Multiple Devices
			
@@ -367,7 +367,7 @@ nav:
 
				          - PaddleX时序任务模型配置文件参数说明: module_usage/instructions/config_parameters_time_series.md
			
 
				   - 模型产线部署:
			
 
				        - 高性能推理: pipeline_deploy/high_performance_inference.md
			
 
				-       - 服务化部署: pipeline_deploy/service_deploy.md
			
 
				+       - 服务化部署: pipeline_deploy/serving.md
			
 
				        - 端侧部署: pipeline_deploy/edge_deploy.md
			
 
				   - 多硬件使用:
			
 
				        - 多硬件使用指南: other_devices_support/multi_devices_use_guide.md