The effect of SNP discovery method and sample size on estimation of population genetic data for Chinese and Indian rhesus macaques (Macaca mulatta)

Jessica A.Satkoski Trask, Ripan S. Malhi, Sree Kanthaswamy, Jesse Johnson, Wendy T. Garnica, Venkat S. Malladi, David Glenn Smith

Research output: Contribution to journalArticlepeer-review

35 Scopus citations

Abstract

This study was designed to address issues regarding sample size and marker location that have arisen from the discovery of SNPs in the genomes of poorly characterized primate species and the application of these markers to the study of primate population genetics. We predict the effect of discovery sample size on the probability of discovering both rare and common SNPs and then compare this prediction with the proportion of common and rare SNPs discovered when different numbers of individuals are sequenced. Second, we examine the effect of genomic region on estimates of common population genetic data, comparing markers from both coding and non-coding regions of the rhesus macaque genome and the population genetic data calculated from these markers, to measure the degree and direction of bias introduced by SNPs located in coding versus non-coding regions of the genome. We found that both discovery sample size and genomic region surveyed affect SNP marker attributes and population genetic estimates, even when these are calculated from an expanded data set containing more individuals than the original discovery data set. Although none of the SNP detection methods or genomic regions tested in this study was completely uninformative, these results show that each has a different kind of genetic variation that is suitable for different purposes, and each introduces specific types of bias. Given that each SNP marker has an individual evolutionary history, we calculated that the most complete and unbiased representation of the genetic diversity present in the individual can be obtained by incorporating at least 10 individuals into the discovery sample set, to ensure the discovery of both common and rare polymorphisms.

Original languageEnglish (US)
Pages (from-to)129-138
Number of pages10
JournalPrimates
Volume52
Issue number2
DOIs
StatePublished - Apr 2011
Externally publishedYes

Keywords

  • Macaca mulatta
  • Population genetics
  • SNP discovery

ASJC Scopus subject areas

  • Animal Science and Zoology

Fingerprint

Dive into the research topics of 'The effect of SNP discovery method and sample size on estimation of population genetic data for Chinese and Indian rhesus macaques (Macaca mulatta)'. Together they form a unique fingerprint.

Cite this