An accelerated gradient method for trace norm minimization

Shuiwang Ji; Jieping Ye

An accelerated gradient method for trace norm minimization

Shuiwang Ji, Jieping Ye

Computing and Augmented Intelligence, School of (IAFSE-SCAI)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

400 Scopus citations

Abstract

We consider the minimization of a smooth loss function regularized by the trace norm of the matrix variable. Such formulation finds applications in many machine learning tasks including multi-task learning, matrix classification, and matrix completion. The standard semidefinite programming formulation for this problem is computationally expensive. In addition, due to the non-smooth nature of the trace norm, the optimal first-order black-box method for solving such class of problems converges as O(1/√k), where k is the iteration counter. In this paper, we exploit the special structure of the trace norm, based on which we propose an extended gradient algorithm that converges as O(1/k). We further propose an accelerated gradient algorithm, which achieves the optimal convergence rate of O(1/k²) for smooth problems. Experiments on multi-task learning problems demonstrate the efficiency of the proposed algorithms.

Original language	English (US)
Title of host publication	Proceedings of the 26th International Conference On Machine Learning, ICML 2009
Pages	457-464
Number of pages	8
State	Published - 2009
Event	26th International Conference On Machine Learning, ICML 2009 - Montreal, QC, Canada Duration: Jun 14 2009 → Jun 18 2009

Publication series

Name	Proceedings of the 26th International Conference On Machine Learning, ICML 2009

Other

Other	26th International Conference On Machine Learning, ICML 2009
Country/Territory	Canada
City	Montreal, QC
Period	6/14/09 → 6/18/09

ASJC Scopus subject areas

Artificial Intelligence
Computer Networks and Communications
Software

Cite this

@inproceedings{5a03efc14f3d438dbe4d7b2055d91b7a,

title = "An accelerated gradient method for trace norm minimization",

abstract = "We consider the minimization of a smooth loss function regularized by the trace norm of the matrix variable. Such formulation finds applications in many machine learning tasks including multi-task learning, matrix classification, and matrix completion. The standard semidefinite programming formulation for this problem is computationally expensive. In addition, due to the non-smooth nature of the trace norm, the optimal first-order black-box method for solving such class of problems converges as O(1/√k), where k is the iteration counter. In this paper, we exploit the special structure of the trace norm, based on which we propose an extended gradient algorithm that converges as O(1/k). We further propose an accelerated gradient algorithm, which achieves the optimal convergence rate of O(1/k2) for smooth problems. Experiments on multi-task learning problems demonstrate the efficiency of the proposed algorithms.",

author = "Shuiwang Ji and Jieping Ye",

year = "2009",

language = "English (US)",

isbn = "9781605585161",

series = "Proceedings of the 26th International Conference On Machine Learning, ICML 2009",

pages = "457--464",

booktitle = "Proceedings of the 26th International Conference On Machine Learning, ICML 2009",

note = "26th International Conference On Machine Learning, ICML 2009 ; Conference date: 14-06-2009 Through 18-06-2009",

}

TY - GEN

T1 - An accelerated gradient method for trace norm minimization

AU - Ji, Shuiwang

AU - Ye, Jieping

PY - 2009

Y1 - 2009

N2 - We consider the minimization of a smooth loss function regularized by the trace norm of the matrix variable. Such formulation finds applications in many machine learning tasks including multi-task learning, matrix classification, and matrix completion. The standard semidefinite programming formulation for this problem is computationally expensive. In addition, due to the non-smooth nature of the trace norm, the optimal first-order black-box method for solving such class of problems converges as O(1/√k), where k is the iteration counter. In this paper, we exploit the special structure of the trace norm, based on which we propose an extended gradient algorithm that converges as O(1/k). We further propose an accelerated gradient algorithm, which achieves the optimal convergence rate of O(1/k2) for smooth problems. Experiments on multi-task learning problems demonstrate the efficiency of the proposed algorithms.

AB - We consider the minimization of a smooth loss function regularized by the trace norm of the matrix variable. Such formulation finds applications in many machine learning tasks including multi-task learning, matrix classification, and matrix completion. The standard semidefinite programming formulation for this problem is computationally expensive. In addition, due to the non-smooth nature of the trace norm, the optimal first-order black-box method for solving such class of problems converges as O(1/√k), where k is the iteration counter. In this paper, we exploit the special structure of the trace norm, based on which we propose an extended gradient algorithm that converges as O(1/k). We further propose an accelerated gradient algorithm, which achieves the optimal convergence rate of O(1/k2) for smooth problems. Experiments on multi-task learning problems demonstrate the efficiency of the proposed algorithms.

UR - http://www.scopus.com/inward/record.url?scp=71149103464&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=71149103464&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:71149103464

SN - 9781605585161

T3 - Proceedings of the 26th International Conference On Machine Learning, ICML 2009

SP - 457

EP - 464

BT - Proceedings of the 26th International Conference On Machine Learning, ICML 2009

T2 - 26th International Conference On Machine Learning, ICML 2009

Y2 - 14 June 2009 through 18 June 2009

ER -

An accelerated gradient method for trace norm minimization

Abstract

Publication series

Other

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this