Generalized linear discriminant analysis: A unified framework and efficient model selection

Shuiwang Ji; Jieping Ye

doi:10.1109/TNN.2008.2002078

Generalized linear discriminant analysis: A unified framework and efficient model selection

Shuiwang Ji, Jieping Ye

Computing and Augmented Intelligence, School of (IAFSE-SCAI)

Research output: Contribution to journal › Article › peer-review

168 Scopus citations

Abstract

High-dimensional data are common in many domains, and dimensionality reduction is the key to cope with the curse-of-dimensionality. Linear discriminant analysis (LDA) is a well-known method for supervised dimensionality reduction. When dealing with high-dimensional and low sample size data, classical LDA suffers from the singularity problem. Over the years, many algorithms have been developed to overcome this problem, and they have been applied successfully in various applications. However, there is a lack of a systematic study of the commonalities and differences of these algorithms, as well as their intrinsic relationships. In this paper, a unified framework for generalized LDA is proposed, which elucidates the properties of various algorithms and their relationships. Based on the proposed framework, we show that the matrix computations involved in LDA-based algorithms can be simplified so that the cross-validation procedure for model selection can be performed efficiently. We conduct extensive experiments using a collection of high-dimensional data sets, including text documents, face images, gene expression data, and gene expression pattern images, to evaluate the proposed theories and algorithms.

Original language	English (US)
Pages (from-to)	1768-1782
Number of pages	15
Journal	IEEE Transactions on Neural Networks
Volume	19
Issue number	10
DOIs	https://doi.org/10.1109/TNN.2008.2002078
State	Published - 2008

Keywords

Dimensionality reduction
Linear discriminant analysis (LDA)
Model selection
Principal component analysis (PCA)
Regularization
Visualization

ASJC Scopus subject areas

Software
Computer Science Applications
Computer Networks and Communications
Artificial Intelligence

Access to Document

10.1109/TNN.2008.2002078

Cite this

@article{d429578dbd4b46b3bb5196d4523b1694,

title = "Generalized linear discriminant analysis: A unified framework and efficient model selection",

abstract = "High-dimensional data are common in many domains, and dimensionality reduction is the key to cope with the curse-of-dimensionality. Linear discriminant analysis (LDA) is a well-known method for supervised dimensionality reduction. When dealing with high-dimensional and low sample size data, classical LDA suffers from the singularity problem. Over the years, many algorithms have been developed to overcome this problem, and they have been applied successfully in various applications. However, there is a lack of a systematic study of the commonalities and differences of these algorithms, as well as their intrinsic relationships. In this paper, a unified framework for generalized LDA is proposed, which elucidates the properties of various algorithms and their relationships. Based on the proposed framework, we show that the matrix computations involved in LDA-based algorithms can be simplified so that the cross-validation procedure for model selection can be performed efficiently. We conduct extensive experiments using a collection of high-dimensional data sets, including text documents, face images, gene expression data, and gene expression pattern images, to evaluate the proposed theories and algorithms.",

keywords = "Dimensionality reduction, Linear discriminant analysis (LDA), Model selection, Principal component analysis (PCA), Regularization, Visualization",

author = "Shuiwang Ji and Jieping Ye",

note = "Funding Information: Manuscript received November 7, 2007; revised March 31, 2008; accepted June 3, 2008. First published September 26, 2008; current version published October 8, 2008. This work was supported in part by the Arizona State University and by the National Science Foundation under Grant IIS-0612069.",

year = "2008",

doi = "10.1109/TNN.2008.2002078",

language = "English (US)",

volume = "19",

pages = "1768--1782",

journal = "IEEE Transactions on Neural Networks",

issn = "1045-9227",

publisher = "IEEE Computational Intelligence Society",

number = "10",

}

TY - JOUR

T1 - Generalized linear discriminant analysis

T2 - A unified framework and efficient model selection

AU - Ji, Shuiwang

AU - Ye, Jieping

N1 - Funding Information: Manuscript received November 7, 2007; revised March 31, 2008; accepted June 3, 2008. First published September 26, 2008; current version published October 8, 2008. This work was supported in part by the Arizona State University and by the National Science Foundation under Grant IIS-0612069.

PY - 2008

Y1 - 2008

N2 - High-dimensional data are common in many domains, and dimensionality reduction is the key to cope with the curse-of-dimensionality. Linear discriminant analysis (LDA) is a well-known method for supervised dimensionality reduction. When dealing with high-dimensional and low sample size data, classical LDA suffers from the singularity problem. Over the years, many algorithms have been developed to overcome this problem, and they have been applied successfully in various applications. However, there is a lack of a systematic study of the commonalities and differences of these algorithms, as well as their intrinsic relationships. In this paper, a unified framework for generalized LDA is proposed, which elucidates the properties of various algorithms and their relationships. Based on the proposed framework, we show that the matrix computations involved in LDA-based algorithms can be simplified so that the cross-validation procedure for model selection can be performed efficiently. We conduct extensive experiments using a collection of high-dimensional data sets, including text documents, face images, gene expression data, and gene expression pattern images, to evaluate the proposed theories and algorithms.

AB - High-dimensional data are common in many domains, and dimensionality reduction is the key to cope with the curse-of-dimensionality. Linear discriminant analysis (LDA) is a well-known method for supervised dimensionality reduction. When dealing with high-dimensional and low sample size data, classical LDA suffers from the singularity problem. Over the years, many algorithms have been developed to overcome this problem, and they have been applied successfully in various applications. However, there is a lack of a systematic study of the commonalities and differences of these algorithms, as well as their intrinsic relationships. In this paper, a unified framework for generalized LDA is proposed, which elucidates the properties of various algorithms and their relationships. Based on the proposed framework, we show that the matrix computations involved in LDA-based algorithms can be simplified so that the cross-validation procedure for model selection can be performed efficiently. We conduct extensive experiments using a collection of high-dimensional data sets, including text documents, face images, gene expression data, and gene expression pattern images, to evaluate the proposed theories and algorithms.

KW - Dimensionality reduction

KW - Linear discriminant analysis (LDA)

KW - Model selection

KW - Principal component analysis (PCA)

KW - Regularization

KW - Visualization

UR - http://www.scopus.com/inward/record.url?scp=54349111811&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=54349111811&partnerID=8YFLogxK

U2 - 10.1109/TNN.2008.2002078

DO - 10.1109/TNN.2008.2002078

M3 - Article

C2 - 18842480

AN - SCOPUS:54349111811

SN - 1045-9227

VL - 19

SP - 1768

EP - 1782

JO - IEEE Transactions on Neural Networks

JF - IEEE Transactions on Neural Networks

IS - 10

ER -

Generalized linear discriminant analysis: A unified framework and efficient model selection

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this