A convex formulation for learning a shared predictive structure from multiple tasks

Jianhui Chen; Lei Tang; Jun Liu; Jieping Ye

doi:10.1109/TPAMI.2012.189

A convex formulation for learning a shared predictive structure from multiple tasks

Jianhui Chen, Lei Tang, Jun Liu, Jieping Ye

Computing and Augmented Intelligence, School of (IAFSE-SCAI)

Research output: Contribution to journal › Article › peer-review

42 Scopus citations

Abstract

In this paper, we consider the problem of learning from multiple related tasks for improved generalization performance by extracting their shared structures. The alternating structure optimization (ASO) algorithm, which couples all tasks using a shared feature representation, has been successfully applied in various multitask learning problems. However, ASO is nonconvex and the alternating algorithm only finds a local solution. We first present an improved ASO formulation ((iASO)) for multitask learning based on a new regularizer. We then convert (iASO), a nonconvex formulation, into a relaxed convex one ((rASO)). Interestingly, our theoretical analysis reveals that (rASO) finds a globally optimal solution to its nonconvex counterpart (iASO) under certain conditions. (rASO) can be equivalently reformulated as a semidefinite program (SDP), which is, however, not scalable to large datasets. We propose to employ the block coordinate descent (BCD) method and the accelerated projected gradient (APG) algorithm separately to find the globally optimal solution to (rASO); we also develop efficient algorithms for solving the key subproblems involved in BCD and APG. The experiments on the Yahoo webpages datasets and the Drosophila gene expression pattern images datasets demonstrate the effectiveness and efficiency of the proposed algorithms and confirm our theoretical analysis.

Original language	English (US)
Article number	6296661
Pages (from-to)	1025-1035
Number of pages	11
Journal	IEEE Transactions on Pattern Analysis and Machine Intelligence
Volume	35
Issue number	5
DOIs	https://doi.org/10.1109/TPAMI.2012.189
State	Published - 2013

Keywords

Multitask learning
accelerated projected gradient
alternating structure optimization
shared predictive structure

ASJC Scopus subject areas

Software
Computer Vision and Pattern Recognition
Computational Theory and Mathematics
Artificial Intelligence
Applied Mathematics

Access to Document

10.1109/TPAMI.2012.189

Cite this

@article{752a0860a43f4e9ca1dad9d98888e765,

title = "A convex formulation for learning a shared predictive structure from multiple tasks",

abstract = "In this paper, we consider the problem of learning from multiple related tasks for improved generalization performance by extracting their shared structures. The alternating structure optimization (ASO) algorithm, which couples all tasks using a shared feature representation, has been successfully applied in various multitask learning problems. However, ASO is nonconvex and the alternating algorithm only finds a local solution. We first present an improved ASO formulation ((iASO)) for multitask learning based on a new regularizer. We then convert (iASO), a nonconvex formulation, into a relaxed convex one ((rASO)). Interestingly, our theoretical analysis reveals that (rASO) finds a globally optimal solution to its nonconvex counterpart (iASO) under certain conditions. (rASO) can be equivalently reformulated as a semidefinite program (SDP), which is, however, not scalable to large datasets. We propose to employ the block coordinate descent (BCD) method and the accelerated projected gradient (APG) algorithm separately to find the globally optimal solution to (rASO); we also develop efficient algorithms for solving the key subproblems involved in BCD and APG. The experiments on the Yahoo webpages datasets and the Drosophila gene expression pattern images datasets demonstrate the effectiveness and efficiency of the proposed algorithms and confirm our theoretical analysis.",

keywords = "Multitask learning, accelerated projected gradient, alternating structure optimization, shared predictive structure",

author = "Jianhui Chen and Lei Tang and Jun Liu and Jieping Ye",

year = "2013",

doi = "10.1109/TPAMI.2012.189",

language = "English (US)",

volume = "35",

pages = "1025--1035",

journal = "IEEE Transactions on Pattern Analysis and Machine Intelligence",

issn = "0162-8828",

publisher = "IEEE Computer Society",

number = "5",

}

TY - JOUR

T1 - A convex formulation for learning a shared predictive structure from multiple tasks

AU - Chen, Jianhui

AU - Tang, Lei

AU - Liu, Jun

AU - Ye, Jieping

PY - 2013

Y1 - 2013

N2 - In this paper, we consider the problem of learning from multiple related tasks for improved generalization performance by extracting their shared structures. The alternating structure optimization (ASO) algorithm, which couples all tasks using a shared feature representation, has been successfully applied in various multitask learning problems. However, ASO is nonconvex and the alternating algorithm only finds a local solution. We first present an improved ASO formulation ((iASO)) for multitask learning based on a new regularizer. We then convert (iASO), a nonconvex formulation, into a relaxed convex one ((rASO)). Interestingly, our theoretical analysis reveals that (rASO) finds a globally optimal solution to its nonconvex counterpart (iASO) under certain conditions. (rASO) can be equivalently reformulated as a semidefinite program (SDP), which is, however, not scalable to large datasets. We propose to employ the block coordinate descent (BCD) method and the accelerated projected gradient (APG) algorithm separately to find the globally optimal solution to (rASO); we also develop efficient algorithms for solving the key subproblems involved in BCD and APG. The experiments on the Yahoo webpages datasets and the Drosophila gene expression pattern images datasets demonstrate the effectiveness and efficiency of the proposed algorithms and confirm our theoretical analysis.

AB - In this paper, we consider the problem of learning from multiple related tasks for improved generalization performance by extracting their shared structures. The alternating structure optimization (ASO) algorithm, which couples all tasks using a shared feature representation, has been successfully applied in various multitask learning problems. However, ASO is nonconvex and the alternating algorithm only finds a local solution. We first present an improved ASO formulation ((iASO)) for multitask learning based on a new regularizer. We then convert (iASO), a nonconvex formulation, into a relaxed convex one ((rASO)). Interestingly, our theoretical analysis reveals that (rASO) finds a globally optimal solution to its nonconvex counterpart (iASO) under certain conditions. (rASO) can be equivalently reformulated as a semidefinite program (SDP), which is, however, not scalable to large datasets. We propose to employ the block coordinate descent (BCD) method and the accelerated projected gradient (APG) algorithm separately to find the globally optimal solution to (rASO); we also develop efficient algorithms for solving the key subproblems involved in BCD and APG. The experiments on the Yahoo webpages datasets and the Drosophila gene expression pattern images datasets demonstrate the effectiveness and efficiency of the proposed algorithms and confirm our theoretical analysis.

KW - Multitask learning

KW - accelerated projected gradient

KW - alternating structure optimization

KW - shared predictive structure

UR - http://www.scopus.com/inward/record.url?scp=84875472724&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84875472724&partnerID=8YFLogxK

U2 - 10.1109/TPAMI.2012.189

DO - 10.1109/TPAMI.2012.189

M3 - Article

C2 - 23520249

AN - SCOPUS:84875472724

SN - 0162-8828

VL - 35

SP - 1025

EP - 1035

JO - IEEE Transactions on Pattern Analysis and Machine Intelligence

JF - IEEE Transactions on Pattern Analysis and Machine Intelligence

IS - 5

M1 - 6296661

ER -

A convex formulation for learning a shared predictive structure from multiple tasks

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this