Push-Pull Gradient Methods for Distributed Optimization in Networks

Shi Pu; Wei Shi; Jinming Xu; Angelia Nedic

doi:10.1109/TAC.2020.2972824

Push-Pull Gradient Methods for Distributed Optimization in Networks

Shi Pu, Wei Shi, Jinming Xu, Angelia Nedic

Electrical Engineering

Research output: Contribution to journal › Article › peer-review

135 Scopus citations

Abstract

In this article, we focus on solving a distributed convex optimization problem in a network, where each agent has its own convex cost function and the goal is to minimize the sum of the agents'cost functions while obeying the network connectivity structure. In order to minimize the sum of the cost functions, we consider new distributed gradient-based methods where each node maintains two estimates, namely an estimate of the optimal decision variable and an estimate of the gradient for the average of the agents' objective functions. From the viewpoint of an agent, the information about the gradients is pushed to the neighbors, whereas the information about the decision variable ispulled from the neighbors, hence giving the name 'push-pull gradient methods.' The methods utilize two different graphs for the information exchange among agents and, as such, unify the algorithms with different types of distributed architecture, including decentralized (peer to peer), centralized (master-slave), and semicentralized (leader-follower) architectures. We show that the proposed algorithms and their many variants converge linearly for strongly convex and smooth objective functions over a network (possibly with unidirectional data links) in both synchronous and asynchronous random-gossip settings. In particular, under the random-gossip setting, 'push-pull' is the first class of algorithms for distributed optimization over directed graphs. Moreover, we numerically evaluate our proposed algorithms in both scenarios, and show that they outperform other existing linearly convergent schemes, especially for ill-conditioned problems and networks that are not well balanced.

Original language	English (US)
Article number	8988200
Pages (from-to)	1-16
Number of pages	16
Journal	IEEE Transactions on Automatic Control
Volume	66
Issue number	1
DOIs	https://doi.org/10.1109/TAC.2020.2972824
State	Published - Jan 2021

Keywords

Convex optimization
directed graph
distributed optimization
linear convergence
network structure
random-gossip algorithm
spanning tree

ASJC Scopus subject areas

Control and Systems Engineering
Computer Science Applications
Electrical and Electronic Engineering

Access to Document

10.1109/TAC.2020.2972824

Cite this

@article{d62abf1669214293acb0ac90f185bd1c,

title = "Push-Pull Gradient Methods for Distributed Optimization in Networks",

abstract = "In this article, we focus on solving a distributed convex optimization problem in a network, where each agent has its own convex cost function and the goal is to minimize the sum of the agents'cost functions while obeying the network connectivity structure. In order to minimize the sum of the cost functions, we consider new distributed gradient-based methods where each node maintains two estimates, namely an estimate of the optimal decision variable and an estimate of the gradient for the average of the agents' objective functions. From the viewpoint of an agent, the information about the gradients is pushed to the neighbors, whereas the information about the decision variable ispulled from the neighbors, hence giving the name 'push-pull gradient methods.' The methods utilize two different graphs for the information exchange among agents and, as such, unify the algorithms with different types of distributed architecture, including decentralized (peer to peer), centralized (master-slave), and semicentralized (leader-follower) architectures. We show that the proposed algorithms and their many variants converge linearly for strongly convex and smooth objective functions over a network (possibly with unidirectional data links) in both synchronous and asynchronous random-gossip settings. In particular, under the random-gossip setting, 'push-pull' is the first class of algorithms for distributed optimization over directed graphs. Moreover, we numerically evaluate our proposed algorithms in both scenarios, and show that they outperform other existing linearly convergent schemes, especially for ill-conditioned problems and networks that are not well balanced.",

keywords = "Convex optimization, directed graph, distributed optimization, linear convergence, network structure, random-gossip algorithm, spanning tree",

author = "Shi Pu and Wei Shi and Jinming Xu and Angelia Nedic",

note = "Publisher Copyright: {\textcopyright} 2020 IEEE.",

year = "2021",

month = jan,

doi = "10.1109/TAC.2020.2972824",

language = "English (US)",

volume = "66",

pages = "1--16",

journal = "IEEE Transactions on Automatic Control",

issn = "0018-9286",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "1",

}

TY - JOUR

T1 - Push-Pull Gradient Methods for Distributed Optimization in Networks

AU - Pu, Shi

AU - Shi, Wei

AU - Xu, Jinming

AU - Nedic, Angelia

PY - 2021/1

Y1 - 2021/1

N2 - In this article, we focus on solving a distributed convex optimization problem in a network, where each agent has its own convex cost function and the goal is to minimize the sum of the agents'cost functions while obeying the network connectivity structure. In order to minimize the sum of the cost functions, we consider new distributed gradient-based methods where each node maintains two estimates, namely an estimate of the optimal decision variable and an estimate of the gradient for the average of the agents' objective functions. From the viewpoint of an agent, the information about the gradients is pushed to the neighbors, whereas the information about the decision variable ispulled from the neighbors, hence giving the name 'push-pull gradient methods.' The methods utilize two different graphs for the information exchange among agents and, as such, unify the algorithms with different types of distributed architecture, including decentralized (peer to peer), centralized (master-slave), and semicentralized (leader-follower) architectures. We show that the proposed algorithms and their many variants converge linearly for strongly convex and smooth objective functions over a network (possibly with unidirectional data links) in both synchronous and asynchronous random-gossip settings. In particular, under the random-gossip setting, 'push-pull' is the first class of algorithms for distributed optimization over directed graphs. Moreover, we numerically evaluate our proposed algorithms in both scenarios, and show that they outperform other existing linearly convergent schemes, especially for ill-conditioned problems and networks that are not well balanced.

AB - In this article, we focus on solving a distributed convex optimization problem in a network, where each agent has its own convex cost function and the goal is to minimize the sum of the agents'cost functions while obeying the network connectivity structure. In order to minimize the sum of the cost functions, we consider new distributed gradient-based methods where each node maintains two estimates, namely an estimate of the optimal decision variable and an estimate of the gradient for the average of the agents' objective functions. From the viewpoint of an agent, the information about the gradients is pushed to the neighbors, whereas the information about the decision variable ispulled from the neighbors, hence giving the name 'push-pull gradient methods.' The methods utilize two different graphs for the information exchange among agents and, as such, unify the algorithms with different types of distributed architecture, including decentralized (peer to peer), centralized (master-slave), and semicentralized (leader-follower) architectures. We show that the proposed algorithms and their many variants converge linearly for strongly convex and smooth objective functions over a network (possibly with unidirectional data links) in both synchronous and asynchronous random-gossip settings. In particular, under the random-gossip setting, 'push-pull' is the first class of algorithms for distributed optimization over directed graphs. Moreover, we numerically evaluate our proposed algorithms in both scenarios, and show that they outperform other existing linearly convergent schemes, especially for ill-conditioned problems and networks that are not well balanced.

KW - Convex optimization

KW - directed graph

KW - distributed optimization

KW - linear convergence

KW - network structure

KW - random-gossip algorithm

KW - spanning tree

UR - http://www.scopus.com/inward/record.url?scp=85098326167&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85098326167&partnerID=8YFLogxK

U2 - 10.1109/TAC.2020.2972824

DO - 10.1109/TAC.2020.2972824

M3 - Article

AN - SCOPUS:85098326167

SN - 0018-9286

VL - 66

SP - 1

EP - 16

JO - IEEE Transactions on Automatic Control

JF - IEEE Transactions on Automatic Control

IS - 1

M1 - 8988200

ER -

Push-Pull Gradient Methods for Distributed Optimization in Networks

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this