基于 Triton Inference Server 的算法服务

TM 2.0

已于 2024-07-01 09:44:56 修改

阅读量601

点赞数 5

CC 4.0 BY-SA版权

文章标签：人工智能 linux ubuntu

于 2024-06-28 18:00:58 首次发布

本文链接：https://blog.csdn.net/qq_42693842/article/details/140048507

如何将算法部署在 Triton Inference Server

基于Python后端的基础模型 (基础示例)

编写配置 config.pbtxt

以目标检测为例
定义输入输出: 参数名, 参数类型, 参数维度

name: "object_detect" # 模型名称, 与当前目录文件名一致
backend: "python" # 推理后端类型
max_batch_size: 1 # 最大批次
input [
  {
   
   
    name: "image" 
    data_type: TYPE_UINT8
    dims: [-1,-1,3 ] # -1代表动态大小
  },
  {
   
   
    name: "score" 
    data_type: TYPE_FP32
    dims: [1]
    optional: true # optional 为 true 时, 该参数为可选参数, 默认为 false
  }
]
output [
  {
   
   
    name: "labels"
    data_type: TYPE_STRING
    dims: [-1,-1]
  },
  {
   
   
    name: "classes"
    data_type: TYPE_UINT16
    dims: [-1]
  },
  {
   
   
    name: "scores"
    data_type: TYPE_FP32
    dims: [ -1 ]
  },
  {
   
   
    name: "bboxes"
    data_type: TYPE_UINT32
    dims: [-1, 4 ]
  }
]