Before using Python scripts for single model quick inference, please ensure you have completed the installation of PaddleX following the PaddleX Local Installation Tutorial.
Taking the image classification model as an example, the usage is as follows:
from paddlex import create_model
model = create_model(model_name="PP-LCNet_x1_0")
output = model.predict("https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_image_classification_001.jpg", batch_size=1)
for res in output:
res.print(json_format=False)
res.save_to_img("./output/")
res.save_to_json("./output/res.json")
In short, just three steps:
create_model() method to instantiate the prediction model object;predict() method of the prediction model object to perform inference prediction;print(), save_to_xxx() and other related methods to print or save the prediction results.create_model() Methodcreate_model: Instantiate the prediction model object;
model_name: str type, model name, such as "PP-LCNet_x1_0", "/path/to/PP-LCNet_x1_0_infer/";model_dir: str type, local path to directory of inference model files ,such as "/path/to/PP-LCNet_x1_0_infer/", default to None, means that use the official model specified by model_name;batch_size: int type, default to 1;device: str type, used to set the inference device, such as "cpu", "gpu:2" for GPU settings. By default, using 0 id GPU if available, otherwise CPU;pp_option: PaddlePredictorOption type, used to set the inference engine. Please refer to 4-Inference Backend Configuration for more details;inference hyperparameters: used to set common inference hyperparameters. Please refer to specific model description document for details.BasePredictor type.predict() Method of the Prediction Model Objectpredict: Use the defined prediction model to predict the input data;
input: Any type, supports str type representing the path of the file to be predicted, or a directory containing files to be predicted, or a network URL; for CV models, supports numpy.ndarray representing image data; for TS models, supports pandas.DataFrame type data; also supports list types composed of the above types;generator, using for-in or next() to iterate, and the prediction result of one sample would be returned per call.The prediction results support to be accessed, visualized, and saved, which can be achieved through corresponding attributes or methods, specifically as follows:
str: Representation of the prediction result in str type;
str type, the string representation of the prediction result.json: The prediction result in JSON format;
dict type.img: The visualization image of the prediction result. Available only when the results support visual representation;
PIL.Image type.html: The HTML representation of the prediction result. Available only when the results support representation in HTML format;
str type.more attrs: The prediction result of different models support different representation methods. Please refer to the specific model tutorial documentation for details.print(): Outputs the prediction result. Note that when the prediction result is not convenient for direct output, relevant content will be omitted;
json_format: bool type, default is False, indicating that json formatting is not used;indent: int type, default is 4, valid when json_format is True, indicating the indentation level for json formatting;ensure_ascii: bool type, default is False, valid when json_format is True;save_to_json(): Saves the prediction result as a JSON file. Note that when the prediction result contains data that cannot be serialized in JSON, automatic format conversion will be performed to achieve serialization and saving;
save_path: str type, the path to save the result;indent: int type, default is 4, valid when json_format is True, indicating the indentation level for json formatting;ensure_ascii: bool type, default is False, valid when json_format is True;save_to_img(): Visualizes the prediction result and saves it as an image. Available only when the results support representation in the form of images;
save_path: str type, the path to save the result.save_to_csv(): Saves the prediction result as a CSV file. Available only when the results support representation in CSV format;
save_path: str type, the path to save the result.save_to_html(): Saves the prediction result as an HTML file. Available only when the results support representation in HTML format;
save_path: str type, the path to save the result.save_to_xlsx(): Saves the prediction result as an XLSX file. Available only when the results support representation in XLSX format;
save_path: str type, the path to save the result.PaddleX supports configuring the inference backend through PaddlePredictorOption. Relevant APIs are as follows:
device: Inference device;
str. Device types include 'gpu', 'cpu', 'npu', 'xpu', 'mlu', 'dcu'. When using an accelerator card, you can specify the card number, e.g., 'gpu:0' for GPU 0. By default, using 0 id GPU if available, otherwise CPU;str type, the currently set inference device.run_mode: Inference backend;
str type, options include 'paddle', 'trt_fp32', 'trt_fp16', 'trt_int8', 'mkldnn', 'mkldnn_bf16'. 'mkldnn' is only selectable when the inference device is 'cpu'. The default is 'paddle';str type, the currently set inference backend.cpu_threads: Number of CPU threads for the acceleration library, only valid when the inference device is 'cpu';
int type for the number of CPU threads for the acceleration library during CPU inference;int type, the currently set number of threads for the acceleration library.get_support_run_mode: Get supported inference backend configurations;
get_support_device: Get supported device types for running;
get_device: Get the currently set device;
str type.
```