Codon-based detection of positive selection can be biased by heterogeneous distribution of polar amino acids along protein sequences.

Xuhua Xia, Sudhir Kumar

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

The ratio of the number of nonsynonymous substitutions per site (Ka) over the number of synonymous substitutions per site (Ks) has often been used to detect positive selection. Investigators now commonly generate Ka/Ks ratio profiles in a sliding window to look for peaks and valleys in order to identify regions under positive selection. Here we show that the interpretation of peaks in the Ka/Ks profile as evidence for positive selection can be misleading. Genic regions with Ka/Ks > 1 in the MRG gene family, previously claimed to be under positive selection, are associated with a high frequency of polar amino acids with a high mutability. This association between an increased Ka and a high proportion of polar amino acids appears general and not limited to the MRG gene family or the sliding-window approach. For example, the sites detected to be under positive selection in the HIV1 protein-coding genes with a high posterior probability turn out to be mostly occupied by polar amino acids. These findings caution against invoking positive selection from Ka/Ks ratios and highlight the need for considering biochemical properties of the protein domains showing high Ka/Ks ratios. In short, a high Ka/Ks ratio may arise from the intrinsic properties of amino acids instead of from extrinsic positive selection.

Original languageEnglish (US)
Pages (from-to)335-340
Number of pages6
JournalComputational systems bioinformatics / Life Sciences Society. Computational Systems Bioinformatics Conference
DOIs
StatePublished - 2006
Externally publishedYes

ASJC Scopus subject areas

  • General Medicine

Fingerprint

Dive into the research topics of 'Codon-based detection of positive selection can be biased by heterogeneous distribution of polar amino acids along protein sequences.'. Together they form a unique fingerprint.

Cite this