WSNet: Compact and efficient networks through weight sampling

Xiaojie Jin; Yingzhen Yang; Ning Xu; Jianchao Yang; Nebojsa Jojic; Jiashi Feng; Shuicheng Yan

WSNet: Compact and efficient networks through weight sampling

Xiaojie Jin, Yingzhen Yang, Ning Xu, Jianchao Yang, Nebojsa Jojic, Jiashi Feng, Shuicheng Yan

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

We present a new approach and a novel architecture, termed WSNet, for learning compact and efficient deep neural networks. Existing approaches conventionally learn full model parameters independently and then compress them via ad hoc processing such as model pruning or filter factorization. Alternatively, WSNet proposes learning model parameters by sampling from a compact set of learnable parameters, which naturally enforces parameter sharing throughout the learning process. We demonstrate that such a novel weight sampling approach (and induced WSNet) promotes both weights and computation sharing favorably. By employing this method, we can more efficiently learn much smaller networks with competitive performance compared to baseline networks with equal numbers of convolution filters. Specifically, we consider learning compact and efficient 1D convolutional neural networks for audio classification. Extensive experiments on multiple audio classification datasets verify the effectiveness of WSNet. Combined with weight quantization, the resulted models are up to 180× smaller and theoretically up to 16× faster than the well-established baselines, without noticeable performance drop.

Original language	English (US)
Title of host publication	35th International Conference on Machine Learning, ICML 2018
Editors	Jennifer Dy, Andreas Krause
Publisher	International Machine Learning Society (IMLS)
Pages	3683-3696
Number of pages	14
ISBN (Electronic)	9781510867963
State	Published - 2018
Externally published	Yes
Event	35th International Conference on Machine Learning, ICML 2018 - Stockholm, Sweden Duration: Jul 10 2018 → Jul 15 2018

Publication series

Name	35th International Conference on Machine Learning, ICML 2018
Volume	5

Conference

Conference	35th International Conference on Machine Learning, ICML 2018
Country/Territory	Sweden
City	Stockholm
Period	7/10/18 → 7/15/18

ASJC Scopus subject areas

Computational Theory and Mathematics
Human-Computer Interaction
Software

Cite this

WSNet: Compact and efficient networks through weight sampling. / Jin, Xiaojie; Yang, Yingzhen; Xu, Ning et al.
35th International Conference on Machine Learning, ICML 2018. ed. / Jennifer Dy; Andreas Krause. International Machine Learning Society (IMLS), 2018. p. 3683-3696 (35th International Conference on Machine Learning, ICML 2018; Vol. 5).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Jin, X, Yang, Y, Xu, N, Yang, J, Jojic, N, Feng, J & Yan, S 2018, WSNet: Compact and efficient networks through weight sampling. in J Dy & A Krause (eds), 35th International Conference on Machine Learning, ICML 2018. 35th International Conference on Machine Learning, ICML 2018, vol. 5, International Machine Learning Society (IMLS), pp. 3683-3696, 35th International Conference on Machine Learning, ICML 2018, Stockholm, Sweden, 7/10/18.

@inproceedings{2eb5ab14d320402f981dec0c30c1436a,

title = "WSNet: Compact and efficient networks through weight sampling",

abstract = "We present a new approach and a novel architecture, termed WSNet, for learning compact and efficient deep neural networks. Existing approaches conventionally learn full model parameters independently and then compress them via ad hoc processing such as model pruning or filter factorization. Alternatively, WSNet proposes learning model parameters by sampling from a compact set of learnable parameters, which naturally enforces parameter sharing throughout the learning process. We demonstrate that such a novel weight sampling approach (and induced WSNet) promotes both weights and computation sharing favorably. By employing this method, we can more efficiently learn much smaller networks with competitive performance compared to baseline networks with equal numbers of convolution filters. Specifically, we consider learning compact and efficient 1D convolutional neural networks for audio classification. Extensive experiments on multiple audio classification datasets verify the effectiveness of WSNet. Combined with weight quantization, the resulted models are up to 180× smaller and theoretically up to 16× faster than the well-established baselines, without noticeable performance drop.",

author = "Xiaojie Jin and Yingzhen Yang and Ning Xu and Jianchao Yang and Nebojsa Jojic and Jiashi Feng and Shuicheng Yan",

note = "Funding Information: Jiashi Feng was partially supported by NUS startup R-263-000-C08-133, MOE Tier-I R-263-000-C21-112, NUS IDS R-263-000-C67-646, ECRA R-263-000-C87-133 and MOE Tier-II R-263-000-D17-112. Publisher Copyright: {\textcopyright} 2018 by authors.All right reserved.; 35th International Conference on Machine Learning, ICML 2018 ; Conference date: 10-07-2018 Through 15-07-2018",

year = "2018",

language = "English (US)",

series = "35th International Conference on Machine Learning, ICML 2018",

publisher = "International Machine Learning Society (IMLS)",

pages = "3683--3696",

editor = "Jennifer Dy and Andreas Krause",

booktitle = "35th International Conference on Machine Learning, ICML 2018",

}

TY - GEN

T1 - WSNet

T2 - 35th International Conference on Machine Learning, ICML 2018

AU - Jin, Xiaojie

AU - Yang, Yingzhen

AU - Xu, Ning

AU - Yang, Jianchao

AU - Jojic, Nebojsa

AU - Feng, Jiashi

AU - Yan, Shuicheng

N1 - Funding Information: Jiashi Feng was partially supported by NUS startup R-263-000-C08-133, MOE Tier-I R-263-000-C21-112, NUS IDS R-263-000-C67-646, ECRA R-263-000-C87-133 and MOE Tier-II R-263-000-D17-112. Publisher Copyright: © 2018 by authors.All right reserved.

PY - 2018

Y1 - 2018

N2 - We present a new approach and a novel architecture, termed WSNet, for learning compact and efficient deep neural networks. Existing approaches conventionally learn full model parameters independently and then compress them via ad hoc processing such as model pruning or filter factorization. Alternatively, WSNet proposes learning model parameters by sampling from a compact set of learnable parameters, which naturally enforces parameter sharing throughout the learning process. We demonstrate that such a novel weight sampling approach (and induced WSNet) promotes both weights and computation sharing favorably. By employing this method, we can more efficiently learn much smaller networks with competitive performance compared to baseline networks with equal numbers of convolution filters. Specifically, we consider learning compact and efficient 1D convolutional neural networks for audio classification. Extensive experiments on multiple audio classification datasets verify the effectiveness of WSNet. Combined with weight quantization, the resulted models are up to 180× smaller and theoretically up to 16× faster than the well-established baselines, without noticeable performance drop.

AB - We present a new approach and a novel architecture, termed WSNet, for learning compact and efficient deep neural networks. Existing approaches conventionally learn full model parameters independently and then compress them via ad hoc processing such as model pruning or filter factorization. Alternatively, WSNet proposes learning model parameters by sampling from a compact set of learnable parameters, which naturally enforces parameter sharing throughout the learning process. We demonstrate that such a novel weight sampling approach (and induced WSNet) promotes both weights and computation sharing favorably. By employing this method, we can more efficiently learn much smaller networks with competitive performance compared to baseline networks with equal numbers of convolution filters. Specifically, we consider learning compact and efficient 1D convolutional neural networks for audio classification. Extensive experiments on multiple audio classification datasets verify the effectiveness of WSNet. Combined with weight quantization, the resulted models are up to 180× smaller and theoretically up to 16× faster than the well-established baselines, without noticeable performance drop.

UR - http://www.scopus.com/inward/record.url?scp=85057242887&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85057242887&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85057242887

T3 - 35th International Conference on Machine Learning, ICML 2018

SP - 3683

EP - 3696

BT - 35th International Conference on Machine Learning, ICML 2018

A2 - Dy, Jennifer

A2 - Krause, Andreas

PB - International Machine Learning Society (IMLS)

Y2 - 10 July 2018 through 15 July 2018

ER -

WSNet: Compact and efficient networks through weight sampling

Abstract

Publication series

Conference

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this