Efficient methods for overlapping group Lasso

Lei Yuan; Jun Liu; Jieping Ye

Efficient methods for overlapping group Lasso

Lei Yuan, Jun Liu, Jieping Ye

Computing and Augmented Intelligence, School of (IAFSE-SCAI)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

The group Lasso is an extension of the Lasso for feature selection on (predefined) non-overlapping groups of features. The non-overlapping group structure limits its applicability in practice. There have been several recent attempts to study a more general formulation, where groups of features are given, potentially with overlaps between the groups. The resulting optimization is, however, much more challenging to solve due to the group overlaps. In this paper, we consider the efficient optimization of the overlapping group Lasso penalized problem. We reveal several key properties of the proximal operator associated with the overlapping group Lasso, and compute the proximal operator by solving the smooth and convex dual problem, which allows the use of the gradient descent type of algorithms for the optimization. We have performed empirical evaluations using both synthetic and the breast cancer gene expression data set, which consists of 8,141 genes organized into (overlapping) gene sets. Experimental results show that the proposed algorithm is more efficient than existing state-of-the-art algorithms.

Original language	English (US)
Title of host publication	Advances in Neural Information Processing Systems 24
Subtitle of host publication	25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011
State	Published - 2011
Event	25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011 - Granada, Spain Duration: Dec 12 2011 → Dec 14 2011

Publication series

Name	Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011

Other

Other	25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011
Country/Territory	Spain
City	Granada
Period	12/12/11 → 12/14/11

ASJC Scopus subject areas

Information Systems

Cite this

Efficient methods for overlapping group Lasso. / Yuan, Lei; Liu, Jun; Ye, Jieping.
Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011. 2011. (Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Yuan, L, Liu, J & Ye, J 2011, Efficient methods for overlapping group Lasso. in Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011. Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011, 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011, Granada, Spain, 12/12/11.

@inproceedings{3dfd8b9a54c74d8b95f651e615f142d3,

title = "Efficient methods for overlapping group Lasso",

abstract = "The group Lasso is an extension of the Lasso for feature selection on (predefined) non-overlapping groups of features. The non-overlapping group structure limits its applicability in practice. There have been several recent attempts to study a more general formulation, where groups of features are given, potentially with overlaps between the groups. The resulting optimization is, however, much more challenging to solve due to the group overlaps. In this paper, we consider the efficient optimization of the overlapping group Lasso penalized problem. We reveal several key properties of the proximal operator associated with the overlapping group Lasso, and compute the proximal operator by solving the smooth and convex dual problem, which allows the use of the gradient descent type of algorithms for the optimization. We have performed empirical evaluations using both synthetic and the breast cancer gene expression data set, which consists of 8,141 genes organized into (overlapping) gene sets. Experimental results show that the proposed algorithm is more efficient than existing state-of-the-art algorithms.",

author = "Lei Yuan and Jun Liu and Jieping Ye",

year = "2011",

language = "English (US)",

isbn = "9781618395993",

series = "Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011",

booktitle = "Advances in Neural Information Processing Systems 24",

note = "25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011 ; Conference date: 12-12-2011 Through 14-12-2011",

}

TY - GEN

T1 - Efficient methods for overlapping group Lasso

AU - Yuan, Lei

AU - Liu, Jun

AU - Ye, Jieping

PY - 2011

Y1 - 2011

N2 - The group Lasso is an extension of the Lasso for feature selection on (predefined) non-overlapping groups of features. The non-overlapping group structure limits its applicability in practice. There have been several recent attempts to study a more general formulation, where groups of features are given, potentially with overlaps between the groups. The resulting optimization is, however, much more challenging to solve due to the group overlaps. In this paper, we consider the efficient optimization of the overlapping group Lasso penalized problem. We reveal several key properties of the proximal operator associated with the overlapping group Lasso, and compute the proximal operator by solving the smooth and convex dual problem, which allows the use of the gradient descent type of algorithms for the optimization. We have performed empirical evaluations using both synthetic and the breast cancer gene expression data set, which consists of 8,141 genes organized into (overlapping) gene sets. Experimental results show that the proposed algorithm is more efficient than existing state-of-the-art algorithms.

AB - The group Lasso is an extension of the Lasso for feature selection on (predefined) non-overlapping groups of features. The non-overlapping group structure limits its applicability in practice. There have been several recent attempts to study a more general formulation, where groups of features are given, potentially with overlaps between the groups. The resulting optimization is, however, much more challenging to solve due to the group overlaps. In this paper, we consider the efficient optimization of the overlapping group Lasso penalized problem. We reveal several key properties of the proximal operator associated with the overlapping group Lasso, and compute the proximal operator by solving the smooth and convex dual problem, which allows the use of the gradient descent type of algorithms for the optimization. We have performed empirical evaluations using both synthetic and the breast cancer gene expression data set, which consists of 8,141 genes organized into (overlapping) gene sets. Experimental results show that the proposed algorithm is more efficient than existing state-of-the-art algorithms.

UR - http://www.scopus.com/inward/record.url?scp=84860648874&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84860648874&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84860648874

SN - 9781618395993

T3 - Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011

BT - Advances in Neural Information Processing Systems 24

T2 - 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011

Y2 - 12 December 2011 through 14 December 2011

ER -

Efficient methods for overlapping group Lasso

Abstract

Publication series

Other

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this