Paddle Inference部署推理（十五）

-Max-静-

已于 2024-11-25 21:47:23 修改

阅读量353

点赞数 1

CC 4.0 BY-SA版权

分类专栏： PaddleInference推理部署文章标签： paddle 推理部署人工智能深度学习

于 2024-11-25 21:42:38 首次发布

本文链接：https://blog.csdn.net/weixin_46319994/article/details/144041714

PaddleInference推理部署专栏收录该内容

19 篇文章

订阅专栏

十五：Paddle Inference推理（python）API详解

枚举类型

DataType

DataType 定义了 Tensor 的数据类型，由传入 Tensor 的 numpy 数组类型确定。

# DataType 枚举定义
class paddle.inference.DataType:

# 获取各个 DataType 对应的字节数
# 参数：dtype - DataType 枚举
# 输出：dtype 对应的字节数
paddle.inference.get_num_bytes_of_data_type(dtype: DataType)

DataType 中包括以下成员:

INT64: 64位整型
INT32: 32位整型
FLOAT64: 64位浮点型
FLOAT32: 32位浮点型
FLOAT16: 16位浮点型
UINT8: 无符号8位整型
INT8: 8位整型
BOOL: 布尔型
代码示例：

# 引用 paddle inference 预测库
import paddle.inference as paddle_infer

# 创建 FLOAT32 类型 DataType
data_type = paddle_infer.DataType.FLOAT32

# 输出 data_type 的字节数 - 4
paddle_infer.get_num_bytes_of_data_type(data_type)

PrecisionType

PrecisionType设置模型的运行精度。枚举变量定义如下：

# PrecisionType 枚举定义
class paddle.inference.PrecisionType

PrecisionType 中包括以下成员:

Float32: FP32 模式运行
Half: FP16 模式运行
Bf16: BF16 模式运行
Int8: INT8 模式运行

代码示例：

# 引用 paddle inference 预测库
import paddle.inference as paddle_infer

# 创建 config
config = paddle_infer.Config("./mobilenet_v1.pdmodel", "./mobilenet_v1.pdiparams")

# 启用 GPU FP16 推理, 初始化 100 MB 显存，使用 gpu id 为 0
config.enable_use_gpu(100, 0, precision_mode = paddle_infer.PrecisionType.Half)

# 开启 TensorRT 预测，精度为 FP32，开启 INT8 离线量化校准
config.enable_tensorrt_engine(precision_mode = paddle_infer.PrecisionType.Float32,
                              use_calib_mode = True)

PlaceType

PlaceType 为目标设备硬件类型，用户可以根据应用场景选择硬件平台类型。枚举变量定义如下：

# PrecisionType 枚举定义
class paddle.inference.PlaceType

PlaceType 中包括以下成员:

UNK: unknown
CPU
GPU
XPU
NPU
IPU
CUSTOM