Searching for footprints of positive selection in whole-genome SNP data from nonequilibrium populations

Pavlos Pavlidis, Jeffrey Jensen, Wolfgang Stephan

Research output: Contribution to journalArticle

89 Citations (Scopus)

Abstract

A major goal of population genomics is to reconstruct the history of natural populations and to infer the neutral and selective scenarios that can explain the present-day polymorphism patterns. However, the separation between neutral and selective hypotheses has proven hard, mainly because both may predict similar patterns in the genome. This study focuses on the development of methods that can be used to distinguish neutral from selective hypotheses in equilibrium and nonequilibrium populations. These methods utilize a combination of statistics on the basis of the site frequency spectrum (SFS) and linkage disequilibrium (LD). We investigate the patterns of genetic variation along recombining chromosomes using a multitude of comparisons between neutral and selective hypotheses, such as selection or neutrality in equilibrium and nonequilibrium populations and recurrent selection models. We perform hypothesis testing using the classical P-value approach, but we also introduce methods from the machine-learning field. We demonstrate that the combination of SFS- and LD-based statistics increases the power to detect recent positive selection in populations that have experienced past demographic changes.

Original languageEnglish (US)
Pages (from-to)907-922
Number of pages16
JournalGenetics
Volume185
Issue number3
DOIs
StatePublished - Jul 2010
Externally publishedYes

Fingerprint

Single Nucleotide Polymorphism
Genome
Linkage Disequilibrium
Population
Metagenomics
Natural History
Chromosomes
Demography

ASJC Scopus subject areas

  • Genetics

Cite this

Searching for footprints of positive selection in whole-genome SNP data from nonequilibrium populations. / Pavlidis, Pavlos; Jensen, Jeffrey; Stephan, Wolfgang.

In: Genetics, Vol. 185, No. 3, 07.2010, p. 907-922.

Research output: Contribution to journalArticle

@article{75682856281f4493b69d3de3c4bfd687,
title = "Searching for footprints of positive selection in whole-genome SNP data from nonequilibrium populations",
abstract = "A major goal of population genomics is to reconstruct the history of natural populations and to infer the neutral and selective scenarios that can explain the present-day polymorphism patterns. However, the separation between neutral and selective hypotheses has proven hard, mainly because both may predict similar patterns in the genome. This study focuses on the development of methods that can be used to distinguish neutral from selective hypotheses in equilibrium and nonequilibrium populations. These methods utilize a combination of statistics on the basis of the site frequency spectrum (SFS) and linkage disequilibrium (LD). We investigate the patterns of genetic variation along recombining chromosomes using a multitude of comparisons between neutral and selective hypotheses, such as selection or neutrality in equilibrium and nonequilibrium populations and recurrent selection models. We perform hypothesis testing using the classical P-value approach, but we also introduce methods from the machine-learning field. We demonstrate that the combination of SFS- and LD-based statistics increases the power to detect recent positive selection in populations that have experienced past demographic changes.",
author = "Pavlos Pavlidis and Jeffrey Jensen and Wolfgang Stephan",
year = "2010",
month = "7",
doi = "10.1534/genetics.110.116459",
language = "English (US)",
volume = "185",
pages = "907--922",
journal = "Genetics",
issn = "0016-6731",
publisher = "Genetics Society of America",
number = "3",

}

TY - JOUR

T1 - Searching for footprints of positive selection in whole-genome SNP data from nonequilibrium populations

AU - Pavlidis, Pavlos

AU - Jensen, Jeffrey

AU - Stephan, Wolfgang

PY - 2010/7

Y1 - 2010/7

N2 - A major goal of population genomics is to reconstruct the history of natural populations and to infer the neutral and selective scenarios that can explain the present-day polymorphism patterns. However, the separation between neutral and selective hypotheses has proven hard, mainly because both may predict similar patterns in the genome. This study focuses on the development of methods that can be used to distinguish neutral from selective hypotheses in equilibrium and nonequilibrium populations. These methods utilize a combination of statistics on the basis of the site frequency spectrum (SFS) and linkage disequilibrium (LD). We investigate the patterns of genetic variation along recombining chromosomes using a multitude of comparisons between neutral and selective hypotheses, such as selection or neutrality in equilibrium and nonequilibrium populations and recurrent selection models. We perform hypothesis testing using the classical P-value approach, but we also introduce methods from the machine-learning field. We demonstrate that the combination of SFS- and LD-based statistics increases the power to detect recent positive selection in populations that have experienced past demographic changes.

AB - A major goal of population genomics is to reconstruct the history of natural populations and to infer the neutral and selective scenarios that can explain the present-day polymorphism patterns. However, the separation between neutral and selective hypotheses has proven hard, mainly because both may predict similar patterns in the genome. This study focuses on the development of methods that can be used to distinguish neutral from selective hypotheses in equilibrium and nonequilibrium populations. These methods utilize a combination of statistics on the basis of the site frequency spectrum (SFS) and linkage disequilibrium (LD). We investigate the patterns of genetic variation along recombining chromosomes using a multitude of comparisons between neutral and selective hypotheses, such as selection or neutrality in equilibrium and nonequilibrium populations and recurrent selection models. We perform hypothesis testing using the classical P-value approach, but we also introduce methods from the machine-learning field. We demonstrate that the combination of SFS- and LD-based statistics increases the power to detect recent positive selection in populations that have experienced past demographic changes.

UR - http://www.scopus.com/inward/record.url?scp=77955759338&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77955759338&partnerID=8YFLogxK

U2 - 10.1534/genetics.110.116459

DO - 10.1534/genetics.110.116459

M3 - Article

C2 - 20407129

AN - SCOPUS:77955759338

VL - 185

SP - 907

EP - 922

JO - Genetics

JF - Genetics

SN - 0016-6731

IS - 3

ER -