Are existing knowledge transfer techniques effective for deep learning with edge devices?

Ragini Sharma; Saman Biookaghazadeh; Baoxin Li; Ming Zhao

doi:10.1109/EDGE.2018.00013

Are existing knowledge transfer techniques effective for deep learning with edge devices?

Ragini Sharma, Saman Biookaghazadeh, Baoxin Li, Ming Zhao

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

38 Scopus citations

Abstract

With the emergence of edge computing paradigm, many applications such as image recognition and augmented reality require to perform machine learning (ML) and artificial intelligence (AI) tasks on edge devices. Most AI and ML models are large and computational-heavy, whereas edge devices are usually equipped with limited computational and storage resources. Such models can be compressed and reduced for deployment on edge devices, but they may lose their capability and not perform well. Recent works used knowledge transfer techniques to transfer information from a large network (termed teacher) to a small one (termed student) in order to improve the performance of the latter. This approach seems to be promising for learning on edge devices, but a thorough investigation on its effectiveness is lacking. This paper provides an extensive study on the performance (in both accuracy and convergence speed) of knowledge transfer, considering different student architectures and different techniques for transferring knowledge from teacher to student. The results show that the performance of KT does vary by architectures and transfer techniques. A good performance improvement is obtained by transferring knowledge from both the intermediate layers and last layer of the teacher to a shallower student. But other architectures and transfer techniques do not fare so well and some of them even lead to negative performance impact.

Original language	English (US)
Title of host publication	Proceedings - 2018 IEEE International Conference on Edge Computing, EDGE 2018 - Part of the 2018 IEEE World Congress on Services
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	42-49
Number of pages	8
ISBN (Electronic)	9781538672389
DOIs	https://doi.org/10.1109/EDGE.2018.00013
State	Published - Sep 26 2018
Event	2018 IEEE International Conference on Edge Computing, EDGE 2018 - San Francisco, United States Duration: Jul 2 2018 → Jul 7 2018

Publication series

Name	Proceedings - 2018 IEEE International Conference on Edge Computing, EDGE 2018 - Part of the 2018 IEEE World Congress on Services

Other

Other	2018 IEEE International Conference on Edge Computing, EDGE 2018
Country/Territory	United States
City	San Francisco
Period	7/2/18 → 7/7/18

Keywords

Cloud computing
Deep neural networks
Edge computing
Knowledge transfer

ASJC Scopus subject areas

Computer Networks and Communications
Computer Science Applications
Hardware and Architecture
Control and Optimization

Access to Document

10.1109/EDGE.2018.00013

Cite this

Sharma, R., Biookaghazadeh, S., Li, B., & Zhao, M. (2018). Are existing knowledge transfer techniques effective for deep learning with edge devices? In Proceedings - 2018 IEEE International Conference on Edge Computing, EDGE 2018 - Part of the 2018 IEEE World Congress on Services (pp. 42-49). Article 8473375 (Proceedings - 2018 IEEE International Conference on Edge Computing, EDGE 2018 - Part of the 2018 IEEE World Congress on Services). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/EDGE.2018.00013

Are existing knowledge transfer techniques effective for deep learning with edge devices? / Sharma, Ragini; Biookaghazadeh, Saman; Li, Baoxin et al.
Proceedings - 2018 IEEE International Conference on Edge Computing, EDGE 2018 - Part of the 2018 IEEE World Congress on Services. Institute of Electrical and Electronics Engineers Inc., 2018. p. 42-49 8473375 (Proceedings - 2018 IEEE International Conference on Edge Computing, EDGE 2018 - Part of the 2018 IEEE World Congress on Services).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Sharma, R, Biookaghazadeh, S, Li, B & Zhao, M 2018, Are existing knowledge transfer techniques effective for deep learning with edge devices? in Proceedings - 2018 IEEE International Conference on Edge Computing, EDGE 2018 - Part of the 2018 IEEE World Congress on Services., 8473375, Proceedings - 2018 IEEE International Conference on Edge Computing, EDGE 2018 - Part of the 2018 IEEE World Congress on Services, Institute of Electrical and Electronics Engineers Inc., pp. 42-49, 2018 IEEE International Conference on Edge Computing, EDGE 2018, San Francisco, United States, 7/2/18. https://doi.org/10.1109/EDGE.2018.00013

Sharma R, Biookaghazadeh S, Li B , Zhao M. Are existing knowledge transfer techniques effective for deep learning with edge devices? In Proceedings - 2018 IEEE International Conference on Edge Computing, EDGE 2018 - Part of the 2018 IEEE World Congress on Services. Institute of Electrical and Electronics Engineers Inc. 2018. p. 42-49. 8473375. (Proceedings - 2018 IEEE International Conference on Edge Computing, EDGE 2018 - Part of the 2018 IEEE World Congress on Services). doi: 10.1109/EDGE.2018.00013

Sharma, Ragini ; Biookaghazadeh, Saman ; Li, Baoxin et al. / Are existing knowledge transfer techniques effective for deep learning with edge devices?. Proceedings - 2018 IEEE International Conference on Edge Computing, EDGE 2018 - Part of the 2018 IEEE World Congress on Services. Institute of Electrical and Electronics Engineers Inc., 2018. pp. 42-49 (Proceedings - 2018 IEEE International Conference on Edge Computing, EDGE 2018 - Part of the 2018 IEEE World Congress on Services).

@inproceedings{a658085d32184764b0ff46f130262d69,

title = "Are existing knowledge transfer techniques effective for deep learning with edge devices?",

abstract = "With the emergence of edge computing paradigm, many applications such as image recognition and augmented reality require to perform machine learning (ML) and artificial intelligence (AI) tasks on edge devices. Most AI and ML models are large and computational-heavy, whereas edge devices are usually equipped with limited computational and storage resources. Such models can be compressed and reduced for deployment on edge devices, but they may lose their capability and not perform well. Recent works used knowledge transfer techniques to transfer information from a large network (termed teacher) to a small one (termed student) in order to improve the performance of the latter. This approach seems to be promising for learning on edge devices, but a thorough investigation on its effectiveness is lacking. This paper provides an extensive study on the performance (in both accuracy and convergence speed) of knowledge transfer, considering different student architectures and different techniques for transferring knowledge from teacher to student. The results show that the performance of KT does vary by architectures and transfer techniques. A good performance improvement is obtained by transferring knowledge from both the intermediate layers and last layer of the teacher to a shallower student. But other architectures and transfer techniques do not fare so well and some of them even lead to negative performance impact.",

keywords = "Cloud computing, Deep neural networks, Edge computing, Knowledge transfer",

author = "Ragini Sharma and Saman Biookaghazadeh and Baoxin Li and Ming Zhao",

note = "Publisher Copyright: {\textcopyright} 2018 IEEE.; 2018 IEEE International Conference on Edge Computing, EDGE 2018 ; Conference date: 02-07-2018 Through 07-07-2018",

year = "2018",

month = sep,

day = "26",

doi = "10.1109/EDGE.2018.00013",

language = "English (US)",

series = "Proceedings - 2018 IEEE International Conference on Edge Computing, EDGE 2018 - Part of the 2018 IEEE World Congress on Services",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "42--49",

booktitle = "Proceedings - 2018 IEEE International Conference on Edge Computing, EDGE 2018 - Part of the 2018 IEEE World Congress on Services",