Optimize deep convolutional neural network with ternarized weights and high accuracy

Zhezhi He; Boqing Gong; Deliang Fan

doi:10.1109/WACV.2019.00102

Optimize deep convolutional neural network with ternarized weights and high accuracy

Zhezhi He, Boqing Gong, Deliang Fan

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

13 Scopus citations

Abstract

Deep convolution neural network has achieved great success in many artificial intelligence applications. However, its enormous model size and massive computation cost have become the main obstacle for deployment of such powerful algorithm in the low power and resource limited embedded systems. As the countermeasure to this problem, in this work, we propose statistical weight scaling and residual expansion methods to reduce the bit-width of the whole network weight parameters to ternary values (i.e. -1, 0, +1), with the objectives to greatly reduce model size, computation cost and accuracy degradation caused by the model compression. With about 16× model compression rate, our ternarized ResNet-32/44/56 could outperforms full-precision counterparts by 0.12%, 0.24% and 0.18% on CIFAR-10 dataset. We also test our ternarization method with AlexNet and ResNet-18 on ImageNet dataset, which both achieve the best top-1 accuracy compared to recent similar works, with the same 16× compression rate. If further incorporating our residual expansion method, compared to the full-precision counterpart, our ternarized ResNet-18 even improves the top-5 accuracy by 0.61% and merely degrades the top-1 accuracy only by 0.42% for ImageNet dataset, with 8× model compression rate. It outperforms the recent ABC-Net by 1.03% in top-1 accuracy and 1.78% in top-5 accuracy, with around 1.25× higher compression rate and more than 6× computation reduction due to the weight sparsity.

Original language	English (US)
Title of host publication	Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	913-921
Number of pages	9
ISBN (Electronic)	9781728119755
DOIs	https://doi.org/10.1109/WACV.2019.00102
State	Published - Mar 4 2019
Externally published	Yes
Event	19th IEEE Winter Conference on Applications of Computer Vision, WACV 2019 - Waikoloa Village, United States Duration: Jan 7 2019 → Jan 11 2019

Publication series

Name	Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019

Conference

Conference	19th IEEE Winter Conference on Applications of Computer Vision, WACV 2019
Country/Territory	United States
City	Waikoloa Village
Period	1/7/19 → 1/11/19

ASJC Scopus subject areas

Computer Vision and Pattern Recognition
Computer Science Applications

Access to Document

10.1109/WACV.2019.00102

Cite this

He, Z., Gong, B., & Fan, D. (2019). Optimize deep convolutional neural network with ternarized weights and high accuracy. In Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019 (pp. 913-921). Article 8658565 (Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/WACV.2019.00102

Optimize deep convolutional neural network with ternarized weights and high accuracy. / He, Zhezhi; Gong, Boqing; Fan, Deliang.
Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019. Institute of Electrical and Electronics Engineers Inc., 2019. p. 913-921 8658565 (Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

He, Z, Gong, B & Fan, D 2019, Optimize deep convolutional neural network with ternarized weights and high accuracy. in Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019., 8658565, Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019, Institute of Electrical and Electronics Engineers Inc., pp. 913-921, 19th IEEE Winter Conference on Applications of Computer Vision, WACV 2019, Waikoloa Village, United States, 1/7/19. https://doi.org/10.1109/WACV.2019.00102

He Z, Gong B, Fan D. Optimize deep convolutional neural network with ternarized weights and high accuracy. In Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019. Institute of Electrical and Electronics Engineers Inc. 2019. p. 913-921. 8658565. (Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019). doi: 10.1109/WACV.2019.00102

He, Zhezhi ; Gong, Boqing ; Fan, Deliang. / Optimize deep convolutional neural network with ternarized weights and high accuracy. Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019. Institute of Electrical and Electronics Engineers Inc., 2019. pp. 913-921 (Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019).

@inproceedings{479097fb5ed5478f93c9dbb939f6a97b,

title = "Optimize deep convolutional neural network with ternarized weights and high accuracy",

abstract = "Deep convolution neural network has achieved great success in many artificial intelligence applications. However, its enormous model size and massive computation cost have become the main obstacle for deployment of such powerful algorithm in the low power and resource limited embedded systems. As the countermeasure to this problem, in this work, we propose statistical weight scaling and residual expansion methods to reduce the bit-width of the whole network weight parameters to ternary values (i.e. -1, 0, +1), with the objectives to greatly reduce model size, computation cost and accuracy degradation caused by the model compression. With about 16× model compression rate, our ternarized ResNet-32/44/56 could outperforms full-precision counterparts by 0.12%, 0.24% and 0.18% on CIFAR-10 dataset. We also test our ternarization method with AlexNet and ResNet-18 on ImageNet dataset, which both achieve the best top-1 accuracy compared to recent similar works, with the same 16× compression rate. If further incorporating our residual expansion method, compared to the full-precision counterpart, our ternarized ResNet-18 even improves the top-5 accuracy by 0.61% and merely degrades the top-1 accuracy only by 0.42% for ImageNet dataset, with 8× model compression rate. It outperforms the recent ABC-Net by 1.03% in top-1 accuracy and 1.78% in top-5 accuracy, with around 1.25× higher compression rate and more than 6× computation reduction due to the weight sparsity.",

author = "Zhezhi He and Boqing Gong and Deliang Fan",

note = "Funding Information: Acknowledgement This work is supported in part by the National Science Foundation under Grant No. 1740126 and Semiconductor Research Corporation nCORE. Publisher Copyright: {\textcopyright} 2019 IEEE; 19th IEEE Winter Conference on Applications of Computer Vision, WACV 2019 ; Conference date: 07-01-2019 Through 11-01-2019",

year = "2019",

month = mar,

day = "4",

doi = "10.1109/WACV.2019.00102",

language = "English (US)",

series = "Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "913--921",

booktitle = "Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019",

}

TY - GEN

T1 - Optimize deep convolutional neural network with ternarized weights and high accuracy

AU - He, Zhezhi

AU - Gong, Boqing

AU - Fan, Deliang

N1 - Funding Information: Acknowledgement This work is supported in part by the National Science Foundation under Grant No. 1740126 and Semiconductor Research Corporation nCORE. Publisher Copyright: © 2019 IEEE

PY - 2019/3/4

Y1 - 2019/3/4

N2 - Deep convolution neural network has achieved great success in many artificial intelligence applications. However, its enormous model size and massive computation cost have become the main obstacle for deployment of such powerful algorithm in the low power and resource limited embedded systems. As the countermeasure to this problem, in this work, we propose statistical weight scaling and residual expansion methods to reduce the bit-width of the whole network weight parameters to ternary values (i.e. -1, 0, +1), with the objectives to greatly reduce model size, computation cost and accuracy degradation caused by the model compression. With about 16× model compression rate, our ternarized ResNet-32/44/56 could outperforms full-precision counterparts by 0.12%, 0.24% and 0.18% on CIFAR-10 dataset. We also test our ternarization method with AlexNet and ResNet-18 on ImageNet dataset, which both achieve the best top-1 accuracy compared to recent similar works, with the same 16× compression rate. If further incorporating our residual expansion method, compared to the full-precision counterpart, our ternarized ResNet-18 even improves the top-5 accuracy by 0.61% and merely degrades the top-1 accuracy only by 0.42% for ImageNet dataset, with 8× model compression rate. It outperforms the recent ABC-Net by 1.03% in top-1 accuracy and 1.78% in top-5 accuracy, with around 1.25× higher compression rate and more than 6× computation reduction due to the weight sparsity.

AB - Deep convolution neural network has achieved great success in many artificial intelligence applications. However, its enormous model size and massive computation cost have become the main obstacle for deployment of such powerful algorithm in the low power and resource limited embedded systems. As the countermeasure to this problem, in this work, we propose statistical weight scaling and residual expansion methods to reduce the bit-width of the whole network weight parameters to ternary values (i.e. -1, 0, +1), with the objectives to greatly reduce model size, computation cost and accuracy degradation caused by the model compression. With about 16× model compression rate, our ternarized ResNet-32/44/56 could outperforms full-precision counterparts by 0.12%, 0.24% and 0.18% on CIFAR-10 dataset. We also test our ternarization method with AlexNet and ResNet-18 on ImageNet dataset, which both achieve the best top-1 accuracy compared to recent similar works, with the same 16× compression rate. If further incorporating our residual expansion method, compared to the full-precision counterpart, our ternarized ResNet-18 even improves the top-5 accuracy by 0.61% and merely degrades the top-1 accuracy only by 0.42% for ImageNet dataset, with 8× model compression rate. It outperforms the recent ABC-Net by 1.03% in top-1 accuracy and 1.78% in top-5 accuracy, with around 1.25× higher compression rate and more than 6× computation reduction due to the weight sparsity.

UR - http://www.scopus.com/inward/record.url?scp=85063567668&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85063567668&partnerID=8YFLogxK

U2 - 10.1109/WACV.2019.00102

DO - 10.1109/WACV.2019.00102

M3 - Conference contribution

AN - SCOPUS:85063567668

T3 - Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019

SP - 913

EP - 921

BT - Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 19th IEEE Winter Conference on Applications of Computer Vision, WACV 2019

Y2 - 7 January 2019 through 11 January 2019

ER -

Optimize deep convolutional neural network with ternarized weights and high accuracy

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this