Quantization and Training of Low Bit-Width Convolutional Neural Networks for Object Detection

Yin, Penghang; Zhang, Shuai; Qi, Yingyong; Xin, Jack

Computer Science > Machine Learning

arXiv:1612.06052 (cs)

[Submitted on 19 Dec 2016 (v1), last revised 17 Aug 2017 (this version, v2)]

Title:Quantization and Training of Low Bit-Width Convolutional Neural Networks for Object Detection

Authors:Penghang Yin, Shuai Zhang, Yingyong Qi, Jack Xin

View PDF

Abstract:We present LBW-Net, an efficient optimization based method for quantization and training of the low bit-width convolutional neural networks (CNNs). Specifically, we quantize the weights to zero or powers of two by minimizing the Euclidean distance between full-precision weights and quantized weights during backpropagation. We characterize the combinatorial nature of the low bit-width quantization problem. For 2-bit (ternary) CNNs, the quantization of $N$ weights can be done by an exact formula in $O(N\log N)$ complexity. When the bit-width is three and above, we further propose a semi-analytical thresholding scheme with a single free parameter for quantization that is computationally inexpensive. The free parameter is further determined by network retraining and object detection tests. LBW-Net has several desirable advantages over full-precision CNNs, including considerable memory savings, energy efficiency, and faster deployment. Our experiments on PASCAL VOC dataset show that compared with its 32-bit floating-point counterpart, the performance of the 6-bit LBW-Net is nearly lossless in the object detection tasks, and can even do better in some real world visual scenes, while empirically enjoying more than 4$\times$ faster deployment.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1612.06052 [cs.LG]
	(or arXiv:1612.06052v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1612.06052

Submission history

From: Penghang Yin [view email]
[v1] Mon, 19 Dec 2016 05:54:18 UTC (22 KB)
[v2] Thu, 17 Aug 2017 06:56:17 UTC (5,373 KB)

Computer Science > Machine Learning

Title:Quantization and Training of Low Bit-Width Convolutional Neural Networks for Object Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Quantization and Training of Low Bit-Width Convolutional Neural Networks for Object Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators