pytorch量化库使用（2）

Arthur.AI

于 2023-06-29 17:56:13 发布

阅读量1.5k

点赞数 1

CC 4.0 BY-SA版权

分类专栏：高性能计算与嵌入式AI 文章标签： pytorch 人工智能 python

本文链接：https://blog.csdn.net/qq_34106574/article/details/131460576

高性能计算与嵌入式AI 专栏收录该内容

40 篇文章 ¥19.90 ¥99.00

订阅专栏

超级会员免费看

FX Graph Mode量化模式

训练后量化有多种量化类型（仅权重、动态和静态），配置通过qconfig_mapping （ prepare_fx函数的参数）完成。

FXPTQ API 示例：

import torch
from torch.ao.quantization import (
  get_default_qconfig_mapping,
  get_default_qat_qconfig_mapping,
  QConfigMapping,
)
import torch.ao.quantization.quantize_fx as quantize_fx
import copy

model_fp = UserModel()

#
# post training dynamic/weight_only quantization
#

# we need to deepcopy if we still want to keep model_fp unchanged after quantization since quantization apis change the input model
model_