TVM 模型量化

I proposed a new quantization framework, which brings hardware and learning method in the loop.
Brought the idea from some existing quantization frameworks, I choose to adopt the annotation-calibration-realization 3-phases design:

Annotation: The annotation pass rewrites the graph and inserts simulated quantize operation according to the rewrite function of each operator. The simulated quantize operation simulates the rounding error and saturating error of quantizing from float to integer,
Calibration: The calibration pass will adjust thresholds of simulated quantize operations to reduce the accuracy dropping.
Realization: The realization pass transforms the simulation graph, which computes with float32 actually, to a real low-precision integer graph.

posted @ 2022-05-30 21:48 michaelchengjl 阅读(79) 评论(0) 编辑收藏举报

刷新页面返回顶部

michaelchengjl