Linking physiology and transcriptional profiles by quantitative predictive models

Jatin Misra, Ilias Alevizos, Daehee Hwang, George Stephanopoulos, Gregory Stephanopoulos

Research output: Contribution to journalArticle

4 Citations (Scopus)

Abstract

A methodology for the construction of quantitative, predictive models of physiology from transcriptional profiles is presented. The method utilizes partial least squares (PLS) regression properly modified to allow gene pre-selection based on their signal-to-noise ratio (SNR). The final set of genes is obtained from a consensus ranking of genes across several thousand trials, each carried out with a different set of training samples. The method was tested with transcriptional data from a large-scale microarray study profiling the effects of high-fat diet on the diet-induced obese mouse model C57BL/6J, and the obese-resistant A/J mouse model. Quantitative predictive models were constructed for the age of the C57BL/6J mice and the A/J mice, and for the insulin and leptin levels of the C57Bl/6J mice based on transcriptional data of liver obtained over a 12-week period. Similarly, models for the growth rate of yeast mutants, and the age of Drosophila samples were developed from literature data. Specifically, it is demonstrated that highly predictive models can be constructed with current levels of precision in DNA microarray measurements provided the variation in the physiological measurements is controlled. Genes identified by this method are important for their ability to collectively predict phenotype. The method can be expanded to include various types of physiological or cellular data, thus providing an integrative framework for the construction of predictive models.

Original languageEnglish (US)
Pages (from-to)252-260
Number of pages9
JournalBiotechnology and Bioengineering
Volume98
Issue number1
DOIs
StatePublished - Sep 1 2007
Externally publishedYes

Fingerprint

Physiology
Genes
Obese Mice
Microarrays
Nutrition
Signal-To-Noise Ratio
High Fat Diet
Leptin
Oligonucleotide Array Sequence Analysis
Least-Squares Analysis
Inbred C57BL Mouse
Drosophila
Yeasts
Insulin
Diet
Phenotype
Oils and fats
Liver
Yeast
Growth

Keywords

  • Insulin resistance
  • Partial least squares

ASJC Scopus subject areas

  • Biotechnology
  • Bioengineering
  • Applied Microbiology and Biotechnology

Cite this

Linking physiology and transcriptional profiles by quantitative predictive models. / Misra, Jatin; Alevizos, Ilias; Hwang, Daehee; Stephanopoulos, George; Stephanopoulos, Gregory.

In: Biotechnology and Bioengineering, Vol. 98, No. 1, 01.09.2007, p. 252-260.

Research output: Contribution to journalArticle

Misra, Jatin ; Alevizos, Ilias ; Hwang, Daehee ; Stephanopoulos, George ; Stephanopoulos, Gregory. / Linking physiology and transcriptional profiles by quantitative predictive models. In: Biotechnology and Bioengineering. 2007 ; Vol. 98, No. 1. pp. 252-260.
@article{d2481ff264224359b5bbbd9674f20581,
title = "Linking physiology and transcriptional profiles by quantitative predictive models",
abstract = "A methodology for the construction of quantitative, predictive models of physiology from transcriptional profiles is presented. The method utilizes partial least squares (PLS) regression properly modified to allow gene pre-selection based on their signal-to-noise ratio (SNR). The final set of genes is obtained from a consensus ranking of genes across several thousand trials, each carried out with a different set of training samples. The method was tested with transcriptional data from a large-scale microarray study profiling the effects of high-fat diet on the diet-induced obese mouse model C57BL/6J, and the obese-resistant A/J mouse model. Quantitative predictive models were constructed for the age of the C57BL/6J mice and the A/J mice, and for the insulin and leptin levels of the C57Bl/6J mice based on transcriptional data of liver obtained over a 12-week period. Similarly, models for the growth rate of yeast mutants, and the age of Drosophila samples were developed from literature data. Specifically, it is demonstrated that highly predictive models can be constructed with current levels of precision in DNA microarray measurements provided the variation in the physiological measurements is controlled. Genes identified by this method are important for their ability to collectively predict phenotype. The method can be expanded to include various types of physiological or cellular data, thus providing an integrative framework for the construction of predictive models.",
keywords = "Insulin resistance, Partial least squares",
author = "Jatin Misra and Ilias Alevizos and Daehee Hwang and George Stephanopoulos and Gregory Stephanopoulos",
year = "2007",
month = "9",
day = "1",
doi = "10.1002/bit.21540",
language = "English (US)",
volume = "98",
pages = "252--260",
journal = "Biotechnology and Bioengineering",
issn = "0006-3592",
publisher = "Wiley-VCH Verlag",
number = "1",

}

TY - JOUR

T1 - Linking physiology and transcriptional profiles by quantitative predictive models

AU - Misra, Jatin

AU - Alevizos, Ilias

AU - Hwang, Daehee

AU - Stephanopoulos, George

AU - Stephanopoulos, Gregory

PY - 2007/9/1

Y1 - 2007/9/1

N2 - A methodology for the construction of quantitative, predictive models of physiology from transcriptional profiles is presented. The method utilizes partial least squares (PLS) regression properly modified to allow gene pre-selection based on their signal-to-noise ratio (SNR). The final set of genes is obtained from a consensus ranking of genes across several thousand trials, each carried out with a different set of training samples. The method was tested with transcriptional data from a large-scale microarray study profiling the effects of high-fat diet on the diet-induced obese mouse model C57BL/6J, and the obese-resistant A/J mouse model. Quantitative predictive models were constructed for the age of the C57BL/6J mice and the A/J mice, and for the insulin and leptin levels of the C57Bl/6J mice based on transcriptional data of liver obtained over a 12-week period. Similarly, models for the growth rate of yeast mutants, and the age of Drosophila samples were developed from literature data. Specifically, it is demonstrated that highly predictive models can be constructed with current levels of precision in DNA microarray measurements provided the variation in the physiological measurements is controlled. Genes identified by this method are important for their ability to collectively predict phenotype. The method can be expanded to include various types of physiological or cellular data, thus providing an integrative framework for the construction of predictive models.

AB - A methodology for the construction of quantitative, predictive models of physiology from transcriptional profiles is presented. The method utilizes partial least squares (PLS) regression properly modified to allow gene pre-selection based on their signal-to-noise ratio (SNR). The final set of genes is obtained from a consensus ranking of genes across several thousand trials, each carried out with a different set of training samples. The method was tested with transcriptional data from a large-scale microarray study profiling the effects of high-fat diet on the diet-induced obese mouse model C57BL/6J, and the obese-resistant A/J mouse model. Quantitative predictive models were constructed for the age of the C57BL/6J mice and the A/J mice, and for the insulin and leptin levels of the C57Bl/6J mice based on transcriptional data of liver obtained over a 12-week period. Similarly, models for the growth rate of yeast mutants, and the age of Drosophila samples were developed from literature data. Specifically, it is demonstrated that highly predictive models can be constructed with current levels of precision in DNA microarray measurements provided the variation in the physiological measurements is controlled. Genes identified by this method are important for their ability to collectively predict phenotype. The method can be expanded to include various types of physiological or cellular data, thus providing an integrative framework for the construction of predictive models.

KW - Insulin resistance

KW - Partial least squares

UR - http://www.scopus.com/inward/record.url?scp=34548236521&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34548236521&partnerID=8YFLogxK

U2 - 10.1002/bit.21540

DO - 10.1002/bit.21540

M3 - Article

C2 - 17551988

AN - SCOPUS:34548236521

VL - 98

SP - 252

EP - 260

JO - Biotechnology and Bioengineering

JF - Biotechnology and Bioengineering

SN - 0006-3592

IS - 1

ER -