Distributionally robust edge learning with dirichlet process prior

Zhaofeng Zhang; Yue Chen; Junshan Zhang

doi:10.1109/ICDCS47774.2020.00016

Distributionally robust edge learning with dirichlet process prior

Zhaofeng Zhang, Yue Chen, Junshan Zhang

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

1 Scopus citations

Abstract

In order to meet the real-time performance requirements, intelligent decisions in many IoT applications must take place right here right now at the network edge. The conventional cloud-based learning approach would not be able to keep up with the demands in achieving edge intelligence in these applications. Nevertheless, pushing the artificial intelligence (AI) frontier to achieve edge intelligence is highly nontrivial due to the constrained computing resources and limited training data at the network edge. To tackle these challenges, we develop a distributionally robust optimization (DRO)-based edge learning algorithm, where the uncertainty model is constructed to foster the synergy of cloud knowledge transfer and local training. Specifically, the knowledge transferred from the cloud is in the form of a Dirichlet process prior distribution for the edge model parameters, and the edge device further constructs an uncertainty set centered around the empirical distribution of its local samples to capture the information of local data processing. The edge learning DRO problem, subject to the above two distributional uncertainty constraints, is then recast as an equivalent single-layer optimization problem using a duality approach. We then use an Expectation-Maximization (EM) algorithm-inspired method to derive a convex relaxation, based on which we devise algorithms to learn the edge model parameters. Finally, extensive experiments are implemented to showcase the performance gain over standard learning approaches using local edge data only.

Original language	English (US)
Title of host publication	Proceedings - 2020 IEEE 40th International Conference on Distributed Computing Systems, ICDCS 2020
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	798-808
Number of pages	11
ISBN (Electronic)	9781728170022
DOIs	https://doi.org/10.1109/ICDCS47774.2020.00016
State	Published - Nov 2020
Event	40th IEEE International Conference on Distributed Computing Systems, ICDCS 2020 - Singapore, Singapore Duration: Nov 29 2020 → Dec 1 2020

Publication series

Name	Proceedings - International Conference on Distributed Computing Systems
Volume	2020-November

Conference

Conference	40th IEEE International Conference on Distributed Computing Systems, ICDCS 2020
Country/Territory	Singapore
City	Singapore
Period	11/29/20 → 12/1/20

Keywords

Dirichlet process
Distributionally robust optimization
Edge learning
Wasserstein distance

ASJC Scopus subject areas

Software
Hardware and Architecture
Computer Networks and Communications

Access to Document

10.1109/ICDCS47774.2020.00016

Cite this

Zhang, Z., Chen, Y., & Zhang, J. (2020). Distributionally robust edge learning with dirichlet process prior. In Proceedings - 2020 IEEE 40th International Conference on Distributed Computing Systems, ICDCS 2020 (pp. 798-808). Article 9355582 (Proceedings - International Conference on Distributed Computing Systems; Vol. 2020-November). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICDCS47774.2020.00016

Distributionally robust edge learning with dirichlet process prior. / Zhang, Zhaofeng; Chen, Yue; Zhang, Junshan.
Proceedings - 2020 IEEE 40th International Conference on Distributed Computing Systems, ICDCS 2020. Institute of Electrical and Electronics Engineers Inc., 2020. p. 798-808 9355582 (Proceedings - International Conference on Distributed Computing Systems; Vol. 2020-November).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Zhang, Z, Chen, Y & Zhang, J 2020, Distributionally robust edge learning with dirichlet process prior. in Proceedings - 2020 IEEE 40th International Conference on Distributed Computing Systems, ICDCS 2020., 9355582, Proceedings - International Conference on Distributed Computing Systems, vol. 2020-November, Institute of Electrical and Electronics Engineers Inc., pp. 798-808, 40th IEEE International Conference on Distributed Computing Systems, ICDCS 2020, Singapore, Singapore, 11/29/20. https://doi.org/10.1109/ICDCS47774.2020.00016

Zhang Z, Chen Y, Zhang J. Distributionally robust edge learning with dirichlet process prior. In Proceedings - 2020 IEEE 40th International Conference on Distributed Computing Systems, ICDCS 2020. Institute of Electrical and Electronics Engineers Inc. 2020. p. 798-808. 9355582. (Proceedings - International Conference on Distributed Computing Systems). doi: 10.1109/ICDCS47774.2020.00016

@inproceedings{ce9fab7afdda4e83a4e7561309881b69,

title = "Distributionally robust edge learning with dirichlet process prior",

abstract = "In order to meet the real-time performance requirements, intelligent decisions in many IoT applications must take place right here right now at the network edge. The conventional cloud-based learning approach would not be able to keep up with the demands in achieving edge intelligence in these applications. Nevertheless, pushing the artificial intelligence (AI) frontier to achieve edge intelligence is highly nontrivial due to the constrained computing resources and limited training data at the network edge. To tackle these challenges, we develop a distributionally robust optimization (DRO)-based edge learning algorithm, where the uncertainty model is constructed to foster the synergy of cloud knowledge transfer and local training. Specifically, the knowledge transferred from the cloud is in the form of a Dirichlet process prior distribution for the edge model parameters, and the edge device further constructs an uncertainty set centered around the empirical distribution of its local samples to capture the information of local data processing. The edge learning DRO problem, subject to the above two distributional uncertainty constraints, is then recast as an equivalent single-layer optimization problem using a duality approach. We then use an Expectation-Maximization (EM) algorithm-inspired method to derive a convex relaxation, based on which we devise algorithms to learn the edge model parameters. Finally, extensive experiments are implemented to showcase the performance gain over standard learning approaches using local edge data only.",

keywords = "Dirichlet process, Distributionally robust optimization, Edge learning, Wasserstein distance",

author = "Zhaofeng Zhang and Yue Chen and Junshan Zhang",

note = "Funding Information: This work is supported in part by NSF under Grant CPS-1739344, ARO under grant W911NF-16-1-0448, and the DTRA under Grant HDTRA1-13-1-0029. Publisher Copyright: {\textcopyright} 2020 IEEE; 40th IEEE International Conference on Distributed Computing Systems, ICDCS 2020 ; Conference date: 29-11-2020 Through 01-12-2020",

year = "2020",

month = nov,

doi = "10.1109/ICDCS47774.2020.00016",

language = "English (US)",

series = "Proceedings - International Conference on Distributed Computing Systems",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "798--808",

booktitle = "Proceedings - 2020 IEEE 40th International Conference on Distributed Computing Systems, ICDCS 2020",

}

TY - GEN

T1 - Distributionally robust edge learning with dirichlet process prior

AU - Zhang, Zhaofeng

AU - Chen, Yue

AU - Zhang, Junshan

PY - 2020/11

Y1 - 2020/11

N2 - In order to meet the real-time performance requirements, intelligent decisions in many IoT applications must take place right here right now at the network edge. The conventional cloud-based learning approach would not be able to keep up with the demands in achieving edge intelligence in these applications. Nevertheless, pushing the artificial intelligence (AI) frontier to achieve edge intelligence is highly nontrivial due to the constrained computing resources and limited training data at the network edge. To tackle these challenges, we develop a distributionally robust optimization (DRO)-based edge learning algorithm, where the uncertainty model is constructed to foster the synergy of cloud knowledge transfer and local training. Specifically, the knowledge transferred from the cloud is in the form of a Dirichlet process prior distribution for the edge model parameters, and the edge device further constructs an uncertainty set centered around the empirical distribution of its local samples to capture the information of local data processing. The edge learning DRO problem, subject to the above two distributional uncertainty constraints, is then recast as an equivalent single-layer optimization problem using a duality approach. We then use an Expectation-Maximization (EM) algorithm-inspired method to derive a convex relaxation, based on which we devise algorithms to learn the edge model parameters. Finally, extensive experiments are implemented to showcase the performance gain over standard learning approaches using local edge data only.

AB - In order to meet the real-time performance requirements, intelligent decisions in many IoT applications must take place right here right now at the network edge. The conventional cloud-based learning approach would not be able to keep up with the demands in achieving edge intelligence in these applications. Nevertheless, pushing the artificial intelligence (AI) frontier to achieve edge intelligence is highly nontrivial due to the constrained computing resources and limited training data at the network edge. To tackle these challenges, we develop a distributionally robust optimization (DRO)-based edge learning algorithm, where the uncertainty model is constructed to foster the synergy of cloud knowledge transfer and local training. Specifically, the knowledge transferred from the cloud is in the form of a Dirichlet process prior distribution for the edge model parameters, and the edge device further constructs an uncertainty set centered around the empirical distribution of its local samples to capture the information of local data processing. The edge learning DRO problem, subject to the above two distributional uncertainty constraints, is then recast as an equivalent single-layer optimization problem using a duality approach. We then use an Expectation-Maximization (EM) algorithm-inspired method to derive a convex relaxation, based on which we devise algorithms to learn the edge model parameters. Finally, extensive experiments are implemented to showcase the performance gain over standard learning approaches using local edge data only.

KW - Dirichlet process

KW - Distributionally robust optimization

KW - Edge learning

KW - Wasserstein distance

UR - http://www.scopus.com/inward/record.url?scp=85101986492&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85101986492&partnerID=8YFLogxK

U2 - 10.1109/ICDCS47774.2020.00016

DO - 10.1109/ICDCS47774.2020.00016

M3 - Conference contribution

AN - SCOPUS:85101986492

T3 - Proceedings - International Conference on Distributed Computing Systems

SP - 798

EP - 808

BT - Proceedings - 2020 IEEE 40th International Conference on Distributed Computing Systems, ICDCS 2020

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 40th IEEE International Conference on Distributed Computing Systems, ICDCS 2020

Y2 - 29 November 2020 through 1 December 2020

ER -

Distributionally robust edge learning with dirichlet process prior

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this