Determination of minimum sample size and discriminatory expression patterns in microarray data

Daehee Hwang; William A. Schmitt; George Stephanopoulos; Gregory Stephanopoulos

doi:10.1093/bioinformatics/18.9.1184

Determination of minimum sample size and discriminatory expression patterns in microarray data

Daehee Hwang, William A. Schmitt, George Stephanopoulos, Gregory Stephanopoulos

Research output: Contribution to journal › Article › peer-review

100 Scopus citations

Abstract

Motivation: Transcriptional profiling using microarrays can reveal important information about cellular and tissue expression phenotypes, but these measurements are costly and time consuming. Additionally, tissue sample availability poses further constraints on the number of arrays that can be analyzed in connection with a particular disease or state of interest. It is therefore important to provide a method for the determination of the minimum number of microarrays required to separate, with statistical reliability, distinct disease states or other physiological differences. Results: Power analysis was applied to estimate the minimum sample size required for two-class and multi-class discrimination. The power analysis algorithm calculates the appropriate sample size for discrimination of phenotypic subtypes in a reduced dimensional space obtained by Fisher discriminant analysis (FDA). This approach was tested by applying the algorithm to existing data sets for estimation of the minimum sample size required for drawing certain conclusions on multi-class distinction with statistical reliability. It was confirmed that when the minimum number of samples estimated from power analysis is used, group mean in the FDA discrimination space are statistically different.

Original language	English (US)
Pages (from-to)	1184-1193
Number of pages	10
Journal	Bioinformatics
Volume	18
Issue number	9
DOIs	https://doi.org/10.1093/bioinformatics/18.9.1184
State	Published - Sep 2002
Externally published	Yes

ASJC Scopus subject areas

Statistics and Probability
Biochemistry
Molecular Biology
Computer Science Applications
Computational Theory and Mathematics
Computational Mathematics

Access to Document

10.1093/bioinformatics/18.9.1184

Cite this

@article{7ee5b9ef0de647a3a06378c12d65c924,

title = "Determination of minimum sample size and discriminatory expression patterns in microarray data",

abstract = "Motivation: Transcriptional profiling using microarrays can reveal important information about cellular and tissue expression phenotypes, but these measurements are costly and time consuming. Additionally, tissue sample availability poses further constraints on the number of arrays that can be analyzed in connection with a particular disease or state of interest. It is therefore important to provide a method for the determination of the minimum number of microarrays required to separate, with statistical reliability, distinct disease states or other physiological differences. Results: Power analysis was applied to estimate the minimum sample size required for two-class and multi-class discrimination. The power analysis algorithm calculates the appropriate sample size for discrimination of phenotypic subtypes in a reduced dimensional space obtained by Fisher discriminant analysis (FDA). This approach was tested by applying the algorithm to existing data sets for estimation of the minimum sample size required for drawing certain conclusions on multi-class distinction with statistical reliability. It was confirmed that when the minimum number of samples estimated from power analysis is used, group mean in the FDA discrimination space are statistically different.",

author = "Daehee Hwang and Schmitt, {William A.} and George Stephanopoulos and Gregory Stephanopoulos",

note = "Funding Information: This work was supported by the Engineering Research Program of the Office of Basic Energy Science at the Dept. of Energy, Grant No. DE-FG02-94ER-14487 and DE-FG02-99ER-15015 and NIH Grant number 1-R01-DK58533-01. We also acknowledge support by the Merck Genome Research Institute (MGRI).",

year = "2002",

month = sep,

doi = "10.1093/bioinformatics/18.9.1184",

language = "English (US)",

volume = "18",

pages = "1184--1193",

journal = "Bioinformatics",

issn = "1367-4803",

publisher = "Oxford University Press",

number = "9",

}

TY - JOUR

T1 - Determination of minimum sample size and discriminatory expression patterns in microarray data

AU - Hwang, Daehee

AU - Schmitt, William A.

AU - Stephanopoulos, George

AU - Stephanopoulos, Gregory

N1 - Funding Information: This work was supported by the Engineering Research Program of the Office of Basic Energy Science at the Dept. of Energy, Grant No. DE-FG02-94ER-14487 and DE-FG02-99ER-15015 and NIH Grant number 1-R01-DK58533-01. We also acknowledge support by the Merck Genome Research Institute (MGRI).

PY - 2002/9

Y1 - 2002/9

N2 - Motivation: Transcriptional profiling using microarrays can reveal important information about cellular and tissue expression phenotypes, but these measurements are costly and time consuming. Additionally, tissue sample availability poses further constraints on the number of arrays that can be analyzed in connection with a particular disease or state of interest. It is therefore important to provide a method for the determination of the minimum number of microarrays required to separate, with statistical reliability, distinct disease states or other physiological differences. Results: Power analysis was applied to estimate the minimum sample size required for two-class and multi-class discrimination. The power analysis algorithm calculates the appropriate sample size for discrimination of phenotypic subtypes in a reduced dimensional space obtained by Fisher discriminant analysis (FDA). This approach was tested by applying the algorithm to existing data sets for estimation of the minimum sample size required for drawing certain conclusions on multi-class distinction with statistical reliability. It was confirmed that when the minimum number of samples estimated from power analysis is used, group mean in the FDA discrimination space are statistically different.

AB - Motivation: Transcriptional profiling using microarrays can reveal important information about cellular and tissue expression phenotypes, but these measurements are costly and time consuming. Additionally, tissue sample availability poses further constraints on the number of arrays that can be analyzed in connection with a particular disease or state of interest. It is therefore important to provide a method for the determination of the minimum number of microarrays required to separate, with statistical reliability, distinct disease states or other physiological differences. Results: Power analysis was applied to estimate the minimum sample size required for two-class and multi-class discrimination. The power analysis algorithm calculates the appropriate sample size for discrimination of phenotypic subtypes in a reduced dimensional space obtained by Fisher discriminant analysis (FDA). This approach was tested by applying the algorithm to existing data sets for estimation of the minimum sample size required for drawing certain conclusions on multi-class distinction with statistical reliability. It was confirmed that when the minimum number of samples estimated from power analysis is used, group mean in the FDA discrimination space are statistically different.

UR - http://www.scopus.com/inward/record.url?scp=0036738906&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0036738906&partnerID=8YFLogxK

U2 - 10.1093/bioinformatics/18.9.1184

DO - 10.1093/bioinformatics/18.9.1184

M3 - Article

C2 - 12217910

AN - SCOPUS:0036738906

SN - 1367-4803

VL - 18

SP - 1184

EP - 1193

JO - Bioinformatics

JF - Bioinformatics

IS - 9

ER -

Determination of minimum sample size and discriminatory expression patterns in microarray data

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this