Jointly modeling label and feature heterogeneity in medical informatics

Pei Yang; Hongxia Yang; Haoda Fu; Dawei Zhou; Jieping Ye; Theodoros Lappas; Jingrui He

doi:10.1145/2768831

Jointly modeling label and feature heterogeneity in medical informatics

Pei Yang, Hongxia Yang, Haoda Fu, Dawei Zhou, Jieping Ye, Theodoros Lappas, Jingrui He

Research output: Contribution to journal › Article › peer-review

9 Scopus citations

Abstract

Multiple types of heterogeneity including label heterogeneity and feature heterogeneity often co-exist in many real-world data mining applications, such as diabetes treatment classification, gene functionality prediction, and brain image analysis. To effectively leverage such heterogeneity, in this article, we propose a novel graph-based model for Learning with both Label and Feature heterogeneity, namely L²F. It models the label correlation by requiring that any two label-specific classifiers behave similarly on the same views if the associated labels are similar, and imposes the view consistency by requiring that view-based classifiers generate similar predictions on the same examples. The objective function for L²F is jointly convex. To solve the optimization problem, we propose an iterative algorithm, which is guaranteed to converge to the global optimum. One appealing feature of L²F is that it is capable of handling data with missing views and labels. Furthermore, we analyze its generalization performance based on Rademacher complexity, which sheds light on the benefits of jointly modeling the label and feature heterogeneity. Experimental results on various biomedical datasets show the effectiveness of the proposed approach.

Original language	English (US)
Article number	39
Journal	ACM Transactions on Knowledge Discovery from Data
Volume	10
Issue number	4
DOIs	https://doi.org/10.1145/2768831
State	Published - May 2016

Keywords

Heterogeneous learning
Medical informatics
Multi-label learning
Multi-view learning

ASJC Scopus subject areas

General Computer Science

Access to Document

10.1145/2768831

Cite this

@article{0da817de47e34205852e6039539c0b94,

title = "Jointly modeling label and feature heterogeneity in medical informatics",

abstract = "Multiple types of heterogeneity including label heterogeneity and feature heterogeneity often co-exist in many real-world data mining applications, such as diabetes treatment classification, gene functionality prediction, and brain image analysis. To effectively leverage such heterogeneity, in this article, we propose a novel graph-based model for Learning with both Label and Feature heterogeneity, namely L2F. It models the label correlation by requiring that any two label-specific classifiers behave similarly on the same views if the associated labels are similar, and imposes the view consistency by requiring that view-based classifiers generate similar predictions on the same examples. The objective function for L2F is jointly convex. To solve the optimization problem, we propose an iterative algorithm, which is guaranteed to converge to the global optimum. One appealing feature of L2F is that it is capable of handling data with missing views and labels. Furthermore, we analyze its generalization performance based on Rademacher complexity, which sheds light on the benefits of jointly modeling the label and feature heterogeneity. Experimental results on various biomedical datasets show the effectiveness of the proposed approach.",

keywords = "Heterogeneous learning, Medical informatics, Multi-label learning, Multi-view learning",

author = "Pei Yang and Hongxia Yang and Haoda Fu and Dawei Zhou and Jieping Ye and Theodoros Lappas and Jingrui He",

note = "Funding Information: This work is partially supported by the NSF (No. IIS1017415), the Army Research Laboratory (No. W911NF-09-2-0053), Region II University Transportation Center (No. 49997-33 25), DARPA (No. W911NF-11-C-0200 and W911NF-12-C-0028), and NSFC (No. 61473123). Publisher Copyright: {\textcopyright} 2016 ACM.",

year = "2016",

month = may,

doi = "10.1145/2768831",

language = "English (US)",

volume = "10",

journal = "ACM Transactions on Knowledge Discovery from Data",

issn = "1556-4681",

publisher = "Association for Computing Machinery (ACM)",

number = "4",

}

TY - JOUR

T1 - Jointly modeling label and feature heterogeneity in medical informatics

AU - Yang, Pei

AU - Yang, Hongxia

AU - Fu, Haoda

AU - Zhou, Dawei

AU - Ye, Jieping

AU - Lappas, Theodoros

AU - He, Jingrui

N1 - Funding Information: This work is partially supported by the NSF (No. IIS1017415), the Army Research Laboratory (No. W911NF-09-2-0053), Region II University Transportation Center (No. 49997-33 25), DARPA (No. W911NF-11-C-0200 and W911NF-12-C-0028), and NSFC (No. 61473123). Publisher Copyright: © 2016 ACM.

PY - 2016/5

Y1 - 2016/5

N2 - Multiple types of heterogeneity including label heterogeneity and feature heterogeneity often co-exist in many real-world data mining applications, such as diabetes treatment classification, gene functionality prediction, and brain image analysis. To effectively leverage such heterogeneity, in this article, we propose a novel graph-based model for Learning with both Label and Feature heterogeneity, namely L2F. It models the label correlation by requiring that any two label-specific classifiers behave similarly on the same views if the associated labels are similar, and imposes the view consistency by requiring that view-based classifiers generate similar predictions on the same examples. The objective function for L2F is jointly convex. To solve the optimization problem, we propose an iterative algorithm, which is guaranteed to converge to the global optimum. One appealing feature of L2F is that it is capable of handling data with missing views and labels. Furthermore, we analyze its generalization performance based on Rademacher complexity, which sheds light on the benefits of jointly modeling the label and feature heterogeneity. Experimental results on various biomedical datasets show the effectiveness of the proposed approach.

AB - Multiple types of heterogeneity including label heterogeneity and feature heterogeneity often co-exist in many real-world data mining applications, such as diabetes treatment classification, gene functionality prediction, and brain image analysis. To effectively leverage such heterogeneity, in this article, we propose a novel graph-based model for Learning with both Label and Feature heterogeneity, namely L2F. It models the label correlation by requiring that any two label-specific classifiers behave similarly on the same views if the associated labels are similar, and imposes the view consistency by requiring that view-based classifiers generate similar predictions on the same examples. The objective function for L2F is jointly convex. To solve the optimization problem, we propose an iterative algorithm, which is guaranteed to converge to the global optimum. One appealing feature of L2F is that it is capable of handling data with missing views and labels. Furthermore, we analyze its generalization performance based on Rademacher complexity, which sheds light on the benefits of jointly modeling the label and feature heterogeneity. Experimental results on various biomedical datasets show the effectiveness of the proposed approach.

KW - Heterogeneous learning

KW - Medical informatics

KW - Multi-label learning

KW - Multi-view learning

UR - http://www.scopus.com/inward/record.url?scp=84973444975&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84973444975&partnerID=8YFLogxK

U2 - 10.1145/2768831

DO - 10.1145/2768831

M3 - Article

AN - SCOPUS:84973444975

SN - 1556-4681

VL - 10

JO - ACM Transactions on Knowledge Discovery from Data

JF - ACM Transactions on Knowledge Discovery from Data

IS - 4

M1 - 39

ER -

Jointly modeling label and feature heterogeneity in medical informatics

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this