NoSync: Particle swarm inspired distributed DNN training

Mihailo Isakov; Michel A. Kinsy

doi:10.1007/978-3-030-01421-6_58

NoSync: Particle swarm inspired distributed DNN training

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

Training deep neural networks on big datasets remains a computational challenge. It can take hundreds of hours to perform and requires distributed computing systems to accelerate. Common distributed data-parallel approaches share a single model across multiple workers, train on different batches, aggregate gradients, and redistribute the new model. In this work, we propose NoSync, a particle swarm optimization inspired alternative where each worker trains a separate model, and applies pressure forcing models to converge. NoSync explores a greater portion of the parameter space and provides resilience to overfitting. It consistently offers higher accuracy compared to single workers, offers a linear speedup for smaller clusters, and is orthogonal to existing data-parallel approaches.

Original language	English (US)
Title of host publication	Artificial Neural Networks and Machine Learning – ICANN 2018 - 27th International Conference on Artificial Neural Networks, 2018, Proceedings
Editors	Yannis Manolopoulos, Barbara Hammer, Ilias Maglogiannis, Vera Kurkova, Lazaros Iliadis
Publisher	Springer Verlag
Pages	607-619
Number of pages	13
ISBN (Print)	9783030014209
DOIs	https://doi.org/10.1007/978-3-030-01421-6_58
State	Published - 2018
Externally published	Yes
Event	27th International Conference on Artificial Neural Networks, ICANN 2018 - Rhodes, Greece Duration: Oct 4 2018 → Oct 7 2018

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	11140 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	27th International Conference on Artificial Neural Networks, ICANN 2018
Country/Territory	Greece
City	Rhodes
Period	10/4/18 → 10/7/18

Keywords

Artificial neural network
Deep learning
Distributed systems
Evolutionary algorithm
Particle swarm optimization

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-3-030-01421-6_58

Cite this

Isakov, M., & Kinsy, M. A. (2018). NoSync: Particle swarm inspired distributed DNN training. In Y. Manolopoulos, B. Hammer, I. Maglogiannis, V. Kurkova, & L. Iliadis (Eds.), Artificial Neural Networks and Machine Learning – ICANN 2018 - 27th International Conference on Artificial Neural Networks, 2018, Proceedings (pp. 607-619). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11140 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-030-01421-6_58

NoSync: Particle swarm inspired distributed DNN training. / Isakov, Mihailo; Kinsy, Michel A.
Artificial Neural Networks and Machine Learning – ICANN 2018 - 27th International Conference on Artificial Neural Networks, 2018, Proceedings. ed. / Yannis Manolopoulos; Barbara Hammer; Ilias Maglogiannis; Vera Kurkova; Lazaros Iliadis. Springer Verlag, 2018. p. 607-619 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11140 LNCS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Isakov, M & Kinsy, MA 2018, NoSync: Particle swarm inspired distributed DNN training. in Y Manolopoulos, B Hammer, I Maglogiannis, V Kurkova & L Iliadis (eds), Artificial Neural Networks and Machine Learning – ICANN 2018 - 27th International Conference on Artificial Neural Networks, 2018, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11140 LNCS, Springer Verlag, pp. 607-619, 27th International Conference on Artificial Neural Networks, ICANN 2018, Rhodes, Greece, 10/4/18. https://doi.org/10.1007/978-3-030-01421-6_58

Isakov M, Kinsy MA. NoSync: Particle swarm inspired distributed DNN training. In Manolopoulos Y, Hammer B, Maglogiannis I, Kurkova V, Iliadis L, editors, Artificial Neural Networks and Machine Learning – ICANN 2018 - 27th International Conference on Artificial Neural Networks, 2018, Proceedings. Springer Verlag. 2018. p. 607-619. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-01421-6_58

Isakov, Mihailo ; Kinsy, Michel A. / NoSync : Particle swarm inspired distributed DNN training. Artificial Neural Networks and Machine Learning – ICANN 2018 - 27th International Conference on Artificial Neural Networks, 2018, Proceedings. editor / Yannis Manolopoulos ; Barbara Hammer ; Ilias Maglogiannis ; Vera Kurkova ; Lazaros Iliadis. Springer Verlag, 2018. pp. 607-619 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{3f895ed02ef04d9ab2a8da1e21e01110,

title = "NoSync: Particle swarm inspired distributed DNN training",

abstract = "Training deep neural networks on big datasets remains a computational challenge. It can take hundreds of hours to perform and requires distributed computing systems to accelerate. Common distributed data-parallel approaches share a single model across multiple workers, train on different batches, aggregate gradients, and redistribute the new model. In this work, we propose NoSync, a particle swarm optimization inspired alternative where each worker trains a separate model, and applies pressure forcing models to converge. NoSync explores a greater portion of the parameter space and provides resilience to overfitting. It consistently offers higher accuracy compared to single workers, offers a linear speedup for smaller clusters, and is orthogonal to existing data-parallel approaches.",

keywords = "Artificial neural network, Deep learning, Distributed systems, Evolutionary algorithm, Particle swarm optimization",

author = "Mihailo Isakov and Kinsy, {Michel A.}",

note = "Publisher Copyright: {\textcopyright} Springer Nature Switzerland AG 2018.; 27th International Conference on Artificial Neural Networks, ICANN 2018 ; Conference date: 04-10-2018 Through 07-10-2018",

year = "2018",

doi = "10.1007/978-3-030-01421-6_58",

language = "English (US)",

isbn = "9783030014209",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "607--619",

editor = "Yannis Manolopoulos and Barbara Hammer and Ilias Maglogiannis and Vera Kurkova and Lazaros Iliadis",

booktitle = "Artificial Neural Networks and Machine Learning – ICANN 2018 - 27th International Conference on Artificial Neural Networks, 2018, Proceedings",

}

TY - GEN

T1 - NoSync

T2 - 27th International Conference on Artificial Neural Networks, ICANN 2018

AU - Isakov, Mihailo

AU - Kinsy, Michel A.

N1 - Publisher Copyright: © Springer Nature Switzerland AG 2018.

PY - 2018

Y1 - 2018

N2 - Training deep neural networks on big datasets remains a computational challenge. It can take hundreds of hours to perform and requires distributed computing systems to accelerate. Common distributed data-parallel approaches share a single model across multiple workers, train on different batches, aggregate gradients, and redistribute the new model. In this work, we propose NoSync, a particle swarm optimization inspired alternative where each worker trains a separate model, and applies pressure forcing models to converge. NoSync explores a greater portion of the parameter space and provides resilience to overfitting. It consistently offers higher accuracy compared to single workers, offers a linear speedup for smaller clusters, and is orthogonal to existing data-parallel approaches.

AB - Training deep neural networks on big datasets remains a computational challenge. It can take hundreds of hours to perform and requires distributed computing systems to accelerate. Common distributed data-parallel approaches share a single model across multiple workers, train on different batches, aggregate gradients, and redistribute the new model. In this work, we propose NoSync, a particle swarm optimization inspired alternative where each worker trains a separate model, and applies pressure forcing models to converge. NoSync explores a greater portion of the parameter space and provides resilience to overfitting. It consistently offers higher accuracy compared to single workers, offers a linear speedup for smaller clusters, and is orthogonal to existing data-parallel approaches.

KW - Artificial neural network

KW - Deep learning

KW - Distributed systems

KW - Evolutionary algorithm

KW - Particle swarm optimization

UR - http://www.scopus.com/inward/record.url?scp=85054881570&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85054881570&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-01421-6_58

DO - 10.1007/978-3-030-01421-6_58

M3 - Conference contribution

AN - SCOPUS:85054881570

SN - 9783030014209

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 607

EP - 619

BT - Artificial Neural Networks and Machine Learning – ICANN 2018 - 27th International Conference on Artificial Neural Networks, 2018, Proceedings

A2 - Manolopoulos, Yannis

A2 - Hammer, Barbara

A2 - Maglogiannis, Ilias

A2 - Kurkova, Vera

A2 - Iliadis, Lazaros

PB - Springer Verlag

Y2 - 4 October 2018 through 7 October 2018

ER -

NoSync: Particle swarm inspired distributed DNN training

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this