TY - JOUR
T1 - Generalized linear discriminant analysis
T2 - A unified framework and efficient model selection
AU - Ji, Shuiwang
AU - Ye, Jieping
N1 - Funding Information:
Manuscript received November 7, 2007; revised March 31, 2008; accepted June 3, 2008. First published September 26, 2008; current version published October 8, 2008. This work was supported in part by the Arizona State University and by the National Science Foundation under Grant IIS-0612069.
PY - 2008
Y1 - 2008
N2 - High-dimensional data are common in many domains, and dimensionality reduction is the key to cope with the curse-of-dimensionality. Linear discriminant analysis (LDA) is a well-known method for supervised dimensionality reduction. When dealing with high-dimensional and low sample size data, classical LDA suffers from the singularity problem. Over the years, many algorithms have been developed to overcome this problem, and they have been applied successfully in various applications. However, there is a lack of a systematic study of the commonalities and differences of these algorithms, as well as their intrinsic relationships. In this paper, a unified framework for generalized LDA is proposed, which elucidates the properties of various algorithms and their relationships. Based on the proposed framework, we show that the matrix computations involved in LDA-based algorithms can be simplified so that the cross-validation procedure for model selection can be performed efficiently. We conduct extensive experiments using a collection of high-dimensional data sets, including text documents, face images, gene expression data, and gene expression pattern images, to evaluate the proposed theories and algorithms.
AB - High-dimensional data are common in many domains, and dimensionality reduction is the key to cope with the curse-of-dimensionality. Linear discriminant analysis (LDA) is a well-known method for supervised dimensionality reduction. When dealing with high-dimensional and low sample size data, classical LDA suffers from the singularity problem. Over the years, many algorithms have been developed to overcome this problem, and they have been applied successfully in various applications. However, there is a lack of a systematic study of the commonalities and differences of these algorithms, as well as their intrinsic relationships. In this paper, a unified framework for generalized LDA is proposed, which elucidates the properties of various algorithms and their relationships. Based on the proposed framework, we show that the matrix computations involved in LDA-based algorithms can be simplified so that the cross-validation procedure for model selection can be performed efficiently. We conduct extensive experiments using a collection of high-dimensional data sets, including text documents, face images, gene expression data, and gene expression pattern images, to evaluate the proposed theories and algorithms.
KW - Dimensionality reduction
KW - Linear discriminant analysis (LDA)
KW - Model selection
KW - Principal component analysis (PCA)
KW - Regularization
KW - Visualization
UR - http://www.scopus.com/inward/record.url?scp=54349111811&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=54349111811&partnerID=8YFLogxK
U2 - 10.1109/TNN.2008.2002078
DO - 10.1109/TNN.2008.2002078
M3 - Article
C2 - 18842480
AN - SCOPUS:54349111811
SN - 2162-237X
VL - 19
SP - 1768
EP - 1782
JO - IEEE Transactions on Neural Networks
JF - IEEE Transactions on Neural Networks
IS - 10
ER -