Dynamically hierarchy revolution: DirNet for compressing recurrent neural network on mobile devices

Jie Zhang; Xiaolong Wang; Dawei Li; Yalin Wang

doi:10.24963/ijcai.2018/429

Dynamically hierarchy revolution: DirNet for compressing recurrent neural network on mobile devices

Jie Zhang, Xiaolong Wang, Dawei Li, Yalin Wang

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

5 Scopus citations

Abstract

Recurrent neural networks (RNNs) achieve cutting-edge performance on a variety of problems. However, due to their high computational and memory demands, deploying RNNs on resource constrained mobile devices is a challenging task. To guarantee minimum accuracy loss with higher compression rate and driven by the mobile resource requirement, we introduce a novel model compression approach DirNet based on an optimized fast dictionary learning algorithm, which 1) dynamically mines the dictionary atoms of the projection dictionary matrix within layer to adjust the compression rate 2) adaptively changes the sparsity of sparse codes cross the hierarchical layers. Experimental results on language model and an ASR model trained with a 1000h speech dataset demonstrate that our method significantly outperforms prior approaches. Evaluated on off-the-shelf mobile devices, we are able to reduce the size of original model by eight times with real-time model inference and negligible accuracy loss.

Original language	English (US)
Title of host publication	Proceedings of the 27th International Joint Conference on Artificial Intelligence, IJCAI 2018
Editors	Jerome Lang
Publisher	International Joint Conferences on Artificial Intelligence
Pages	3089-3096
Number of pages	8
ISBN (Electronic)	9780999241127
DOIs	https://doi.org/10.24963/ijcai.2018/429
State	Published - 2018
Event	27th International Joint Conference on Artificial Intelligence, IJCAI 2018 - Stockholm, Sweden Duration: Jul 13 2018 → Jul 19 2018

Publication series

Name	IJCAI International Joint Conference on Artificial Intelligence
Volume	2018-July
ISSN (Print)	1045-0823

Other

Other	27th International Joint Conference on Artificial Intelligence, IJCAI 2018
Country/Territory	Sweden
City	Stockholm
Period	7/13/18 → 7/19/18

ASJC Scopus subject areas

Artificial Intelligence

Access to Document

10.24963/ijcai.2018/429

Cite this

Zhang, J., Wang, X., Li, D., & Wang, Y. (2018). Dynamically hierarchy revolution: DirNet for compressing recurrent neural network on mobile devices. In J. Lang (Ed.), Proceedings of the 27th International Joint Conference on Artificial Intelligence, IJCAI 2018 (pp. 3089-3096). (IJCAI International Joint Conference on Artificial Intelligence; Vol. 2018-July). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2018/429

Dynamically hierarchy revolution: DirNet for compressing recurrent neural network on mobile devices. / Zhang, Jie; Wang, Xiaolong; Li, Dawei et al.
Proceedings of the 27th International Joint Conference on Artificial Intelligence, IJCAI 2018. ed. / Jerome Lang. International Joint Conferences on Artificial Intelligence, 2018. p. 3089-3096 (IJCAI International Joint Conference on Artificial Intelligence; Vol. 2018-July).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Zhang, J, Wang, X, Li, D & Wang, Y 2018, Dynamically hierarchy revolution: DirNet for compressing recurrent neural network on mobile devices. in J Lang (ed.), Proceedings of the 27th International Joint Conference on Artificial Intelligence, IJCAI 2018. IJCAI International Joint Conference on Artificial Intelligence, vol. 2018-July, International Joint Conferences on Artificial Intelligence, pp. 3089-3096, 27th International Joint Conference on Artificial Intelligence, IJCAI 2018, Stockholm, Sweden, 7/13/18. https://doi.org/10.24963/ijcai.2018/429

Zhang J, Wang X, Li D, Wang Y. Dynamically hierarchy revolution: DirNet for compressing recurrent neural network on mobile devices. In Lang J, editor, Proceedings of the 27th International Joint Conference on Artificial Intelligence, IJCAI 2018. International Joint Conferences on Artificial Intelligence. 2018. p. 3089-3096. (IJCAI International Joint Conference on Artificial Intelligence). doi: 10.24963/ijcai.2018/429

Zhang, Jie ; Wang, Xiaolong ; Li, Dawei et al. / Dynamically hierarchy revolution : DirNet for compressing recurrent neural network on mobile devices. Proceedings of the 27th International Joint Conference on Artificial Intelligence, IJCAI 2018. editor / Jerome Lang. International Joint Conferences on Artificial Intelligence, 2018. pp. 3089-3096 (IJCAI International Joint Conference on Artificial Intelligence).

@inproceedings{1ac8174f0b7b474a9bdee0df87059afd,

title = "Dynamically hierarchy revolution: DirNet for compressing recurrent neural network on mobile devices",

abstract = "Recurrent neural networks (RNNs) achieve cutting-edge performance on a variety of problems. However, due to their high computational and memory demands, deploying RNNs on resource constrained mobile devices is a challenging task. To guarantee minimum accuracy loss with higher compression rate and driven by the mobile resource requirement, we introduce a novel model compression approach DirNet based on an optimized fast dictionary learning algorithm, which 1) dynamically mines the dictionary atoms of the projection dictionary matrix within layer to adjust the compression rate 2) adaptively changes the sparsity of sparse codes cross the hierarchical layers. Experimental results on language model and an ASR model trained with a 1000h speech dataset demonstrate that our method significantly outperforms prior approaches. Evaluated on off-the-shelf mobile devices, we are able to reduce the size of original model by eight times with real-time model inference and negligible accuracy loss.",

author = "Jie Zhang and Xiaolong Wang and Dawei Li and Yalin Wang",

note = "Publisher Copyright: {\textcopyright} 2018 International Joint Conferences on Artificial Intelligence. All right reserved.; 27th International Joint Conference on Artificial Intelligence, IJCAI 2018 ; Conference date: 13-07-2018 Through 19-07-2018",

year = "2018",

doi = "10.24963/ijcai.2018/429",

language = "English (US)",

series = "IJCAI International Joint Conference on Artificial Intelligence",

publisher = "International Joint Conferences on Artificial Intelligence",

pages = "3089--3096",

editor = "Jerome Lang",

booktitle = "Proceedings of the 27th International Joint Conference on Artificial Intelligence, IJCAI 2018",

}

TY - GEN

T1 - Dynamically hierarchy revolution

T2 - 27th International Joint Conference on Artificial Intelligence, IJCAI 2018

AU - Zhang, Jie

AU - Wang, Xiaolong

AU - Li, Dawei

AU - Wang, Yalin

PY - 2018

Y1 - 2018

N2 - Recurrent neural networks (RNNs) achieve cutting-edge performance on a variety of problems. However, due to their high computational and memory demands, deploying RNNs on resource constrained mobile devices is a challenging task. To guarantee minimum accuracy loss with higher compression rate and driven by the mobile resource requirement, we introduce a novel model compression approach DirNet based on an optimized fast dictionary learning algorithm, which 1) dynamically mines the dictionary atoms of the projection dictionary matrix within layer to adjust the compression rate 2) adaptively changes the sparsity of sparse codes cross the hierarchical layers. Experimental results on language model and an ASR model trained with a 1000h speech dataset demonstrate that our method significantly outperforms prior approaches. Evaluated on off-the-shelf mobile devices, we are able to reduce the size of original model by eight times with real-time model inference and negligible accuracy loss.

AB - Recurrent neural networks (RNNs) achieve cutting-edge performance on a variety of problems. However, due to their high computational and memory demands, deploying RNNs on resource constrained mobile devices is a challenging task. To guarantee minimum accuracy loss with higher compression rate and driven by the mobile resource requirement, we introduce a novel model compression approach DirNet based on an optimized fast dictionary learning algorithm, which 1) dynamically mines the dictionary atoms of the projection dictionary matrix within layer to adjust the compression rate 2) adaptively changes the sparsity of sparse codes cross the hierarchical layers. Experimental results on language model and an ASR model trained with a 1000h speech dataset demonstrate that our method significantly outperforms prior approaches. Evaluated on off-the-shelf mobile devices, we are able to reduce the size of original model by eight times with real-time model inference and negligible accuracy loss.

UR - http://www.scopus.com/inward/record.url?scp=85055705509&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85055705509&partnerID=8YFLogxK

U2 - 10.24963/ijcai.2018/429

DO - 10.24963/ijcai.2018/429

M3 - Conference contribution

AN - SCOPUS:85055705509

T3 - IJCAI International Joint Conference on Artificial Intelligence

SP - 3089

EP - 3096

BT - Proceedings of the 27th International Joint Conference on Artificial Intelligence, IJCAI 2018

A2 - Lang, Jerome

PB - International Joint Conferences on Artificial Intelligence

Y2 - 13 July 2018 through 19 July 2018

ER -

Dynamically hierarchy revolution: DirNet for compressing recurrent neural network on mobile devices

Abstract

Publication series

Other

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this