Estimating divergence times in large molecular phylogenies

Koichiro Tamura, Fabia Ursula Battistuzzi, Paul Billing-Ross, Oscar Murillo, Alan Filipski, Sudhir Kumar

Research output: Contribution to journalArticle

207 Citations (Scopus)

Abstract

Molecular dating of species divergences has become an important means to add a temporal dimension to the Tree of Life. Increasingly larger datasets encompassing greater taxonomic diversity are becoming available to generate molecular timetrees by using sophisticated methods that model rate variation among lineages. However, the practical application of these methods is challenging because of the exorbitant calculation times required by current methods for contemporary data sizes, the difficulty in correctly modeling the rate heterogeneity in highly diverse taxonomic groups, and the lack of reliable clock calibrations and their uncertainty distributions for most groups of species. Here, we present a method that estimates relative times of divergences for all branching points (nodes) in very large phylogenetic trees without assuming a specific model for lineage rate variation or specifying any clock calibrations. The method (RelTime) performed better than existing methods when applied to very large computer simulated datasets where evolutionary rates were varied extensively among lineages by following autocorrelated and uncorrelated models. On average, RelTime completed calculations 1,000 times faster than the fastest Bayesian method, with even greater speed difference for larger number of sequences. This speed and accuracy will enable molecular dating analysis of very large datasets. Relative time estimates will be useful for determining the relative ordering and spacing of speciation events, identifying lineages with significantly slower or faster evolutionary rates, diagnosing the effect of selected calibrations on absolute divergence times, and estimating absolute times of divergence when highly reliable calibration points are available.

Original languageEnglish (US)
Pages (from-to)19333-19338
Number of pages6
JournalProceedings of the National Academy of Sciences of the United States of America
Volume109
Issue number47
DOIs
StatePublished - Nov 20 2012

Fingerprint

Phylogeny
Calibration
Bayes Theorem
Uncertainty
Datasets

Keywords

  • Bioinformatics
  • Relaxed clocks
  • Timescales

ASJC Scopus subject areas

  • General

Cite this

Tamura, K., Battistuzzi, F. U., Billing-Ross, P., Murillo, O., Filipski, A., & Kumar, S. (2012). Estimating divergence times in large molecular phylogenies. Proceedings of the National Academy of Sciences of the United States of America, 109(47), 19333-19338. https://doi.org/10.1073/pnas.1213199109

Estimating divergence times in large molecular phylogenies. / Tamura, Koichiro; Battistuzzi, Fabia Ursula; Billing-Ross, Paul; Murillo, Oscar; Filipski, Alan; Kumar, Sudhir.

In: Proceedings of the National Academy of Sciences of the United States of America, Vol. 109, No. 47, 20.11.2012, p. 19333-19338.

Research output: Contribution to journalArticle

Tamura, K, Battistuzzi, FU, Billing-Ross, P, Murillo, O, Filipski, A & Kumar, S 2012, 'Estimating divergence times in large molecular phylogenies', Proceedings of the National Academy of Sciences of the United States of America, vol. 109, no. 47, pp. 19333-19338. https://doi.org/10.1073/pnas.1213199109
Tamura, Koichiro ; Battistuzzi, Fabia Ursula ; Billing-Ross, Paul ; Murillo, Oscar ; Filipski, Alan ; Kumar, Sudhir. / Estimating divergence times in large molecular phylogenies. In: Proceedings of the National Academy of Sciences of the United States of America. 2012 ; Vol. 109, No. 47. pp. 19333-19338.
@article{bdb13c329ba74785943ebab5810256b6,
title = "Estimating divergence times in large molecular phylogenies",
abstract = "Molecular dating of species divergences has become an important means to add a temporal dimension to the Tree of Life. Increasingly larger datasets encompassing greater taxonomic diversity are becoming available to generate molecular timetrees by using sophisticated methods that model rate variation among lineages. However, the practical application of these methods is challenging because of the exorbitant calculation times required by current methods for contemporary data sizes, the difficulty in correctly modeling the rate heterogeneity in highly diverse taxonomic groups, and the lack of reliable clock calibrations and their uncertainty distributions for most groups of species. Here, we present a method that estimates relative times of divergences for all branching points (nodes) in very large phylogenetic trees without assuming a specific model for lineage rate variation or specifying any clock calibrations. The method (RelTime) performed better than existing methods when applied to very large computer simulated datasets where evolutionary rates were varied extensively among lineages by following autocorrelated and uncorrelated models. On average, RelTime completed calculations 1,000 times faster than the fastest Bayesian method, with even greater speed difference for larger number of sequences. This speed and accuracy will enable molecular dating analysis of very large datasets. Relative time estimates will be useful for determining the relative ordering and spacing of speciation events, identifying lineages with significantly slower or faster evolutionary rates, diagnosing the effect of selected calibrations on absolute divergence times, and estimating absolute times of divergence when highly reliable calibration points are available.",
keywords = "Bioinformatics, Relaxed clocks, Timescales",
author = "Koichiro Tamura and Battistuzzi, {Fabia Ursula} and Paul Billing-Ross and Oscar Murillo and Alan Filipski and Sudhir Kumar",
year = "2012",
month = "11",
day = "20",
doi = "10.1073/pnas.1213199109",
language = "English (US)",
volume = "109",
pages = "19333--19338",
journal = "Proceedings of the National Academy of Sciences of the United States of America",
issn = "0027-8424",
number = "47",

}

TY - JOUR

T1 - Estimating divergence times in large molecular phylogenies

AU - Tamura, Koichiro

AU - Battistuzzi, Fabia Ursula

AU - Billing-Ross, Paul

AU - Murillo, Oscar

AU - Filipski, Alan

AU - Kumar, Sudhir

PY - 2012/11/20

Y1 - 2012/11/20

N2 - Molecular dating of species divergences has become an important means to add a temporal dimension to the Tree of Life. Increasingly larger datasets encompassing greater taxonomic diversity are becoming available to generate molecular timetrees by using sophisticated methods that model rate variation among lineages. However, the practical application of these methods is challenging because of the exorbitant calculation times required by current methods for contemporary data sizes, the difficulty in correctly modeling the rate heterogeneity in highly diverse taxonomic groups, and the lack of reliable clock calibrations and their uncertainty distributions for most groups of species. Here, we present a method that estimates relative times of divergences for all branching points (nodes) in very large phylogenetic trees without assuming a specific model for lineage rate variation or specifying any clock calibrations. The method (RelTime) performed better than existing methods when applied to very large computer simulated datasets where evolutionary rates were varied extensively among lineages by following autocorrelated and uncorrelated models. On average, RelTime completed calculations 1,000 times faster than the fastest Bayesian method, with even greater speed difference for larger number of sequences. This speed and accuracy will enable molecular dating analysis of very large datasets. Relative time estimates will be useful for determining the relative ordering and spacing of speciation events, identifying lineages with significantly slower or faster evolutionary rates, diagnosing the effect of selected calibrations on absolute divergence times, and estimating absolute times of divergence when highly reliable calibration points are available.

AB - Molecular dating of species divergences has become an important means to add a temporal dimension to the Tree of Life. Increasingly larger datasets encompassing greater taxonomic diversity are becoming available to generate molecular timetrees by using sophisticated methods that model rate variation among lineages. However, the practical application of these methods is challenging because of the exorbitant calculation times required by current methods for contemporary data sizes, the difficulty in correctly modeling the rate heterogeneity in highly diverse taxonomic groups, and the lack of reliable clock calibrations and their uncertainty distributions for most groups of species. Here, we present a method that estimates relative times of divergences for all branching points (nodes) in very large phylogenetic trees without assuming a specific model for lineage rate variation or specifying any clock calibrations. The method (RelTime) performed better than existing methods when applied to very large computer simulated datasets where evolutionary rates were varied extensively among lineages by following autocorrelated and uncorrelated models. On average, RelTime completed calculations 1,000 times faster than the fastest Bayesian method, with even greater speed difference for larger number of sequences. This speed and accuracy will enable molecular dating analysis of very large datasets. Relative time estimates will be useful for determining the relative ordering and spacing of speciation events, identifying lineages with significantly slower or faster evolutionary rates, diagnosing the effect of selected calibrations on absolute divergence times, and estimating absolute times of divergence when highly reliable calibration points are available.

KW - Bioinformatics

KW - Relaxed clocks

KW - Timescales

UR - http://www.scopus.com/inward/record.url?scp=84869752137&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84869752137&partnerID=8YFLogxK

U2 - 10.1073/pnas.1213199109

DO - 10.1073/pnas.1213199109

M3 - Article

C2 - 23129628

AN - SCOPUS:84869752137

VL - 109

SP - 19333

EP - 19338

JO - Proceedings of the National Academy of Sciences of the United States of America

JF - Proceedings of the National Academy of Sciences of the United States of America

SN - 0027-8424

IS - 47

ER -