Efficiency of the neighbor-joining method in reconstructing deep and shallow evolutionary relationships in large phylogenies

Sudhir Kumar, Sudhindra R. Gadagkar

Research output: Contribution to journalArticle

69 Citations (Scopus)

Abstract

The neighbor-joining (NJ) method is widely used in reconstructing large phylogenies because of its computational speed and the high accuracy in phylogenetic inference as revealed in computer simulation studies. However, most computer simulation studies have quantified the overall performance of the NJ method in terms of the percentage of branches inferred correctly or the percentage of replications in which the correct tree is recovered. We have examined other aspects of its performance, such as the relative efficiency in correctly reconstructing shallow (close to the external branches of the tree) and deep branches in large phylogenies; the contribution of zero-length branches to topological errors in the inferred trees; and the influence of increasing the tree size (number of sequences), evolutionary rate, and sequence length on the efficiency of the NJ method. Results show that the correct reconstruction of deep branches is no more difficult than that of shallower branches. The presence of zero-length branches in realized trees contributes significantly to the overall error observed in the NJ tree, especially in large phylogenies or slowly evolving genes. Furthermore, the tree size does not influence the efficiency of NJ in reconstructing shallow and deep branches in our simulation study, in which the evolutionary process is assumed to be homogeneous in all lineages.

Original languageEnglish (US)
Pages (from-to)544-553
Number of pages10
JournalJournal of Molecular Evolution
Volume51
Issue number6
StatePublished - 2000

Fingerprint

Phylogeny
Joining
phylogeny
computer simulation
Computer Simulation
methodology
Computer simulation
Genes
method
phylogenetics
gene
simulation
genes

Keywords

  • Accuracy
  • Deep versus shallow branches
  • Large phylogenies
  • Neighbor-joining method
  • Phylogenetic inference
  • Zero-length branches

ASJC Scopus subject areas

  • Genetics
  • Biochemistry
  • Biochemistry, Genetics and Molecular Biology(all)
  • Genetics(clinical)
  • Ecology, Evolution, Behavior and Systematics
  • Molecular Biology
  • Agricultural and Biological Sciences(all)
  • Agricultural and Biological Sciences (miscellaneous)

Cite this

Efficiency of the neighbor-joining method in reconstructing deep and shallow evolutionary relationships in large phylogenies. / Kumar, Sudhir; Gadagkar, Sudhindra R.

In: Journal of Molecular Evolution, Vol. 51, No. 6, 2000, p. 544-553.

Research output: Contribution to journalArticle

@article{fc475ba24e0d4748b4c338526d8131e7,
title = "Efficiency of the neighbor-joining method in reconstructing deep and shallow evolutionary relationships in large phylogenies",
abstract = "The neighbor-joining (NJ) method is widely used in reconstructing large phylogenies because of its computational speed and the high accuracy in phylogenetic inference as revealed in computer simulation studies. However, most computer simulation studies have quantified the overall performance of the NJ method in terms of the percentage of branches inferred correctly or the percentage of replications in which the correct tree is recovered. We have examined other aspects of its performance, such as the relative efficiency in correctly reconstructing shallow (close to the external branches of the tree) and deep branches in large phylogenies; the contribution of zero-length branches to topological errors in the inferred trees; and the influence of increasing the tree size (number of sequences), evolutionary rate, and sequence length on the efficiency of the NJ method. Results show that the correct reconstruction of deep branches is no more difficult than that of shallower branches. The presence of zero-length branches in realized trees contributes significantly to the overall error observed in the NJ tree, especially in large phylogenies or slowly evolving genes. Furthermore, the tree size does not influence the efficiency of NJ in reconstructing shallow and deep branches in our simulation study, in which the evolutionary process is assumed to be homogeneous in all lineages.",
keywords = "Accuracy, Deep versus shallow branches, Large phylogenies, Neighbor-joining method, Phylogenetic inference, Zero-length branches",
author = "Sudhir Kumar and Gadagkar, {Sudhindra R.}",
year = "2000",
language = "English (US)",
volume = "51",
pages = "544--553",
journal = "Journal of Molecular Evolution",
issn = "0022-2844",
publisher = "Springer New York",
number = "6",

}

TY - JOUR

T1 - Efficiency of the neighbor-joining method in reconstructing deep and shallow evolutionary relationships in large phylogenies

AU - Kumar, Sudhir

AU - Gadagkar, Sudhindra R.

PY - 2000

Y1 - 2000

N2 - The neighbor-joining (NJ) method is widely used in reconstructing large phylogenies because of its computational speed and the high accuracy in phylogenetic inference as revealed in computer simulation studies. However, most computer simulation studies have quantified the overall performance of the NJ method in terms of the percentage of branches inferred correctly or the percentage of replications in which the correct tree is recovered. We have examined other aspects of its performance, such as the relative efficiency in correctly reconstructing shallow (close to the external branches of the tree) and deep branches in large phylogenies; the contribution of zero-length branches to topological errors in the inferred trees; and the influence of increasing the tree size (number of sequences), evolutionary rate, and sequence length on the efficiency of the NJ method. Results show that the correct reconstruction of deep branches is no more difficult than that of shallower branches. The presence of zero-length branches in realized trees contributes significantly to the overall error observed in the NJ tree, especially in large phylogenies or slowly evolving genes. Furthermore, the tree size does not influence the efficiency of NJ in reconstructing shallow and deep branches in our simulation study, in which the evolutionary process is assumed to be homogeneous in all lineages.

AB - The neighbor-joining (NJ) method is widely used in reconstructing large phylogenies because of its computational speed and the high accuracy in phylogenetic inference as revealed in computer simulation studies. However, most computer simulation studies have quantified the overall performance of the NJ method in terms of the percentage of branches inferred correctly or the percentage of replications in which the correct tree is recovered. We have examined other aspects of its performance, such as the relative efficiency in correctly reconstructing shallow (close to the external branches of the tree) and deep branches in large phylogenies; the contribution of zero-length branches to topological errors in the inferred trees; and the influence of increasing the tree size (number of sequences), evolutionary rate, and sequence length on the efficiency of the NJ method. Results show that the correct reconstruction of deep branches is no more difficult than that of shallower branches. The presence of zero-length branches in realized trees contributes significantly to the overall error observed in the NJ tree, especially in large phylogenies or slowly evolving genes. Furthermore, the tree size does not influence the efficiency of NJ in reconstructing shallow and deep branches in our simulation study, in which the evolutionary process is assumed to be homogeneous in all lineages.

KW - Accuracy

KW - Deep versus shallow branches

KW - Large phylogenies

KW - Neighbor-joining method

KW - Phylogenetic inference

KW - Zero-length branches

UR - http://www.scopus.com/inward/record.url?scp=0034548435&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0034548435&partnerID=8YFLogxK

M3 - Article

C2 - 11116328

AN - SCOPUS:0034548435

VL - 51

SP - 544

EP - 553

JO - Journal of Molecular Evolution

JF - Journal of Molecular Evolution

SN - 0022-2844

IS - 6

ER -