Interactive exploration of microarray gene expression patterns in a reduced dimensional space

Jatin Misra, William Schmitt, Daehee Hwang, Li Li Hsiao, Steve Gullans, George Stephanopoulos, Gregory Stephanopoulos

Research output: Contribution to journalArticle

117 Citations (Scopus)

Abstract

The very high dimensional space of gene expression measurements obtained by DNA microarrays impedes the detection of underlying patterns in gene expression data and the identification of discriminatory genes. In this paper we show the use of projection methods such as principal components analysis (PCA) to obtain a direct link between patterns in the genes and patterns in samples. This feature is useful in the initial interactive pattern exploration of gene expression data and data-driven learning of the nature and types of samples. Using oligonucleotide microarray measurements of 40 samples from different normal human tissues, we show that distinct patterns are obtained when the genes are projected on a two-dimensional plane spanned by the loadings of the two major principal components. These patterns define the particular genes associated with a sample class (i.e., tissue). When used separately from the other genes, these class-specific (i.e., tissue-specific) genes in turn define distinct tissue patterns in the projection space spanned by the scores of the two major principal components. In this study, PCA projection facilitated discriminatory gene selection for different tissues and identified tissue-specific gene expression signatures for liver, skeletal muscle, and brain samples. Furthermore, it allowed the classification of nine new samples belonging to these three types using the linear combination of the expression levels of the tissue-specific genes determined from the first set of samples. The application of the technique to other published data sets is also discussed.

Original languageEnglish (US)
Pages (from-to)1112-1120
Number of pages9
JournalGenome Research
Volume12
Issue number7
DOIs
StatePublished - Jul 30 2002
Externally publishedYes

Fingerprint

Gene Expression
Genes
Principal Component Analysis
Oligonucleotide Array Sequence Analysis
Transcriptome
Skeletal Muscle
Learning
Liver
Brain

ASJC Scopus subject areas

  • Genetics
  • Genetics(clinical)

Cite this

Misra, J., Schmitt, W., Hwang, D., Hsiao, L. L., Gullans, S., Stephanopoulos, G., & Stephanopoulos, G. (2002). Interactive exploration of microarray gene expression patterns in a reduced dimensional space. Genome Research, 12(7), 1112-1120. https://doi.org/10.1101/gr.225302

Interactive exploration of microarray gene expression patterns in a reduced dimensional space. / Misra, Jatin; Schmitt, William; Hwang, Daehee; Hsiao, Li Li; Gullans, Steve; Stephanopoulos, George; Stephanopoulos, Gregory.

In: Genome Research, Vol. 12, No. 7, 30.07.2002, p. 1112-1120.

Research output: Contribution to journalArticle

Misra, J, Schmitt, W, Hwang, D, Hsiao, LL, Gullans, S, Stephanopoulos, G & Stephanopoulos, G 2002, 'Interactive exploration of microarray gene expression patterns in a reduced dimensional space', Genome Research, vol. 12, no. 7, pp. 1112-1120. https://doi.org/10.1101/gr.225302
Misra, Jatin ; Schmitt, William ; Hwang, Daehee ; Hsiao, Li Li ; Gullans, Steve ; Stephanopoulos, George ; Stephanopoulos, Gregory. / Interactive exploration of microarray gene expression patterns in a reduced dimensional space. In: Genome Research. 2002 ; Vol. 12, No. 7. pp. 1112-1120.
@article{73cf63970d914961a5a37e77f6961660,
title = "Interactive exploration of microarray gene expression patterns in a reduced dimensional space",
abstract = "The very high dimensional space of gene expression measurements obtained by DNA microarrays impedes the detection of underlying patterns in gene expression data and the identification of discriminatory genes. In this paper we show the use of projection methods such as principal components analysis (PCA) to obtain a direct link between patterns in the genes and patterns in samples. This feature is useful in the initial interactive pattern exploration of gene expression data and data-driven learning of the nature and types of samples. Using oligonucleotide microarray measurements of 40 samples from different normal human tissues, we show that distinct patterns are obtained when the genes are projected on a two-dimensional plane spanned by the loadings of the two major principal components. These patterns define the particular genes associated with a sample class (i.e., tissue). When used separately from the other genes, these class-specific (i.e., tissue-specific) genes in turn define distinct tissue patterns in the projection space spanned by the scores of the two major principal components. In this study, PCA projection facilitated discriminatory gene selection for different tissues and identified tissue-specific gene expression signatures for liver, skeletal muscle, and brain samples. Furthermore, it allowed the classification of nine new samples belonging to these three types using the linear combination of the expression levels of the tissue-specific genes determined from the first set of samples. The application of the technique to other published data sets is also discussed.",
author = "Jatin Misra and William Schmitt and Daehee Hwang and Hsiao, {Li Li} and Steve Gullans and George Stephanopoulos and Gregory Stephanopoulos",
year = "2002",
month = "7",
day = "30",
doi = "10.1101/gr.225302",
language = "English (US)",
volume = "12",
pages = "1112--1120",
journal = "Genome Research",
issn = "1088-9051",
publisher = "Cold Spring Harbor Laboratory Press",
number = "7",

}

TY - JOUR

T1 - Interactive exploration of microarray gene expression patterns in a reduced dimensional space

AU - Misra, Jatin

AU - Schmitt, William

AU - Hwang, Daehee

AU - Hsiao, Li Li

AU - Gullans, Steve

AU - Stephanopoulos, George

AU - Stephanopoulos, Gregory

PY - 2002/7/30

Y1 - 2002/7/30

N2 - The very high dimensional space of gene expression measurements obtained by DNA microarrays impedes the detection of underlying patterns in gene expression data and the identification of discriminatory genes. In this paper we show the use of projection methods such as principal components analysis (PCA) to obtain a direct link between patterns in the genes and patterns in samples. This feature is useful in the initial interactive pattern exploration of gene expression data and data-driven learning of the nature and types of samples. Using oligonucleotide microarray measurements of 40 samples from different normal human tissues, we show that distinct patterns are obtained when the genes are projected on a two-dimensional plane spanned by the loadings of the two major principal components. These patterns define the particular genes associated with a sample class (i.e., tissue). When used separately from the other genes, these class-specific (i.e., tissue-specific) genes in turn define distinct tissue patterns in the projection space spanned by the scores of the two major principal components. In this study, PCA projection facilitated discriminatory gene selection for different tissues and identified tissue-specific gene expression signatures for liver, skeletal muscle, and brain samples. Furthermore, it allowed the classification of nine new samples belonging to these three types using the linear combination of the expression levels of the tissue-specific genes determined from the first set of samples. The application of the technique to other published data sets is also discussed.

AB - The very high dimensional space of gene expression measurements obtained by DNA microarrays impedes the detection of underlying patterns in gene expression data and the identification of discriminatory genes. In this paper we show the use of projection methods such as principal components analysis (PCA) to obtain a direct link between patterns in the genes and patterns in samples. This feature is useful in the initial interactive pattern exploration of gene expression data and data-driven learning of the nature and types of samples. Using oligonucleotide microarray measurements of 40 samples from different normal human tissues, we show that distinct patterns are obtained when the genes are projected on a two-dimensional plane spanned by the loadings of the two major principal components. These patterns define the particular genes associated with a sample class (i.e., tissue). When used separately from the other genes, these class-specific (i.e., tissue-specific) genes in turn define distinct tissue patterns in the projection space spanned by the scores of the two major principal components. In this study, PCA projection facilitated discriminatory gene selection for different tissues and identified tissue-specific gene expression signatures for liver, skeletal muscle, and brain samples. Furthermore, it allowed the classification of nine new samples belonging to these three types using the linear combination of the expression levels of the tissue-specific genes determined from the first set of samples. The application of the technique to other published data sets is also discussed.

UR - http://www.scopus.com/inward/record.url?scp=0036069214&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0036069214&partnerID=8YFLogxK

U2 - 10.1101/gr.225302

DO - 10.1101/gr.225302

M3 - Article

C2 - 12097349

AN - SCOPUS:0036069214

VL - 12

SP - 1112

EP - 1120

JO - Genome Research

JF - Genome Research

SN - 1088-9051

IS - 7

ER -