Quantifying the intragenic distribution of human disease mutations

M. P. Miller, J. D. Parker, S. W. Rissing, Sudhir Kumar

Research output: Contribution to journalArticle

22 Citations (Scopus)

Abstract

A wide variety of functional domains exist within human genes. Since different domains vary in their roles regarding overall gene function, the ability for a mutation in a gene region to produce disease varies among domains. We tested two hypotheses regarding distributions of mutations among functional domains by using (1) sets of single nucleotide disease mutations for six genes (CFTR, TSC2, G6PD, PAX6, RS1, and PAH) and (2) sets of polymorphic replacement and silent mutations found in two genes (CFTR and TSC2). First, we tested the null hypothesis that sets of mutations are uniformly distributed among functional domains within genes. Second, we tested the null hypothesis that disease mutations are distributed among gene regions according to expectations derived from the distribution of evolutionary conserved and variable amino acid sites throughout each gene. In contrast to the mainly uniform distribution of sets of silent and polymorphic mutations, sets of disease mutations generally rejected the null hypotheses of both uniform and evolutionary-influenced distributions. Although the disease mutation data showed a better agreement with the evolutionary-derived expectations, disease mutations were found to be statistically overabundant in conserved domains, and under-represented in variable regions, even after accounting for amino acid site variability of domains over long-term evolutionary history. This finding suggests that there is a non-additive influence of amino acid site conservation on the observed intragenic distribution of disease mutations, and underscores the importance of understanding the patterns of neutral amino acid substitutions permitted in a gene over long-term evolutionary history.

Original languageEnglish (US)
Pages (from-to)567-579
Number of pages13
JournalAnnals of Human Genetics
Volume67
Issue number6
DOIs
StatePublished - Nov 2003

Fingerprint

Mutation
Genes
Amino Acids
History
Neutral Amino Acids
Amino Acid Substitution
Nucleotides
Silent Mutation

ASJC Scopus subject areas

  • Genetics(clinical)
  • Genetics

Cite this

Quantifying the intragenic distribution of human disease mutations. / Miller, M. P.; Parker, J. D.; Rissing, S. W.; Kumar, Sudhir.

In: Annals of Human Genetics, Vol. 67, No. 6, 11.2003, p. 567-579.

Research output: Contribution to journalArticle

Miller, M. P. ; Parker, J. D. ; Rissing, S. W. ; Kumar, Sudhir. / Quantifying the intragenic distribution of human disease mutations. In: Annals of Human Genetics. 2003 ; Vol. 67, No. 6. pp. 567-579.
@article{b2466cf07d144c79942697b81f24b258,
title = "Quantifying the intragenic distribution of human disease mutations",
abstract = "A wide variety of functional domains exist within human genes. Since different domains vary in their roles regarding overall gene function, the ability for a mutation in a gene region to produce disease varies among domains. We tested two hypotheses regarding distributions of mutations among functional domains by using (1) sets of single nucleotide disease mutations for six genes (CFTR, TSC2, G6PD, PAX6, RS1, and PAH) and (2) sets of polymorphic replacement and silent mutations found in two genes (CFTR and TSC2). First, we tested the null hypothesis that sets of mutations are uniformly distributed among functional domains within genes. Second, we tested the null hypothesis that disease mutations are distributed among gene regions according to expectations derived from the distribution of evolutionary conserved and variable amino acid sites throughout each gene. In contrast to the mainly uniform distribution of sets of silent and polymorphic mutations, sets of disease mutations generally rejected the null hypotheses of both uniform and evolutionary-influenced distributions. Although the disease mutation data showed a better agreement with the evolutionary-derived expectations, disease mutations were found to be statistically overabundant in conserved domains, and under-represented in variable regions, even after accounting for amino acid site variability of domains over long-term evolutionary history. This finding suggests that there is a non-additive influence of amino acid site conservation on the observed intragenic distribution of disease mutations, and underscores the importance of understanding the patterns of neutral amino acid substitutions permitted in a gene over long-term evolutionary history.",
author = "Miller, {M. P.} and Parker, {J. D.} and Rissing, {S. W.} and Sudhir Kumar",
year = "2003",
month = "11",
doi = "10.1046/j.1529-8817.2003.00072.x",
language = "English (US)",
volume = "67",
pages = "567--579",
journal = "Annals of Human Genetics",
issn = "0003-4800",
publisher = "Wiley-Blackwell",
number = "6",

}

TY - JOUR

T1 - Quantifying the intragenic distribution of human disease mutations

AU - Miller, M. P.

AU - Parker, J. D.

AU - Rissing, S. W.

AU - Kumar, Sudhir

PY - 2003/11

Y1 - 2003/11

N2 - A wide variety of functional domains exist within human genes. Since different domains vary in their roles regarding overall gene function, the ability for a mutation in a gene region to produce disease varies among domains. We tested two hypotheses regarding distributions of mutations among functional domains by using (1) sets of single nucleotide disease mutations for six genes (CFTR, TSC2, G6PD, PAX6, RS1, and PAH) and (2) sets of polymorphic replacement and silent mutations found in two genes (CFTR and TSC2). First, we tested the null hypothesis that sets of mutations are uniformly distributed among functional domains within genes. Second, we tested the null hypothesis that disease mutations are distributed among gene regions according to expectations derived from the distribution of evolutionary conserved and variable amino acid sites throughout each gene. In contrast to the mainly uniform distribution of sets of silent and polymorphic mutations, sets of disease mutations generally rejected the null hypotheses of both uniform and evolutionary-influenced distributions. Although the disease mutation data showed a better agreement with the evolutionary-derived expectations, disease mutations were found to be statistically overabundant in conserved domains, and under-represented in variable regions, even after accounting for amino acid site variability of domains over long-term evolutionary history. This finding suggests that there is a non-additive influence of amino acid site conservation on the observed intragenic distribution of disease mutations, and underscores the importance of understanding the patterns of neutral amino acid substitutions permitted in a gene over long-term evolutionary history.

AB - A wide variety of functional domains exist within human genes. Since different domains vary in their roles regarding overall gene function, the ability for a mutation in a gene region to produce disease varies among domains. We tested two hypotheses regarding distributions of mutations among functional domains by using (1) sets of single nucleotide disease mutations for six genes (CFTR, TSC2, G6PD, PAX6, RS1, and PAH) and (2) sets of polymorphic replacement and silent mutations found in two genes (CFTR and TSC2). First, we tested the null hypothesis that sets of mutations are uniformly distributed among functional domains within genes. Second, we tested the null hypothesis that disease mutations are distributed among gene regions according to expectations derived from the distribution of evolutionary conserved and variable amino acid sites throughout each gene. In contrast to the mainly uniform distribution of sets of silent and polymorphic mutations, sets of disease mutations generally rejected the null hypotheses of both uniform and evolutionary-influenced distributions. Although the disease mutation data showed a better agreement with the evolutionary-derived expectations, disease mutations were found to be statistically overabundant in conserved domains, and under-represented in variable regions, even after accounting for amino acid site variability of domains over long-term evolutionary history. This finding suggests that there is a non-additive influence of amino acid site conservation on the observed intragenic distribution of disease mutations, and underscores the importance of understanding the patterns of neutral amino acid substitutions permitted in a gene over long-term evolutionary history.

UR - http://www.scopus.com/inward/record.url?scp=0346882600&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0346882600&partnerID=8YFLogxK

U2 - 10.1046/j.1529-8817.2003.00072.x

DO - 10.1046/j.1529-8817.2003.00072.x

M3 - Article

C2 - 14641244

AN - SCOPUS:0346882600

VL - 67

SP - 567

EP - 579

JO - Annals of Human Genetics

JF - Annals of Human Genetics

SN - 0003-4800

IS - 6

ER -