Biological relevance of computationally predicted pathogenicity of noncoding variants

Li Liu, Maxwell D. Sanderford, Ravi Patel, Pramod Chandrashekar, Greg Gibson, Sudhir Kumar

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

Computational prediction of the phenotypic propensities of noncoding single nucleotide variants typically combines annotation of genomic, functional and evolutionary attributes into a single score. Here, we evaluate if the claimed excellent accuracies of these predictions translate into high rates of success in addressing questions important in biological research, such as fine mapping causal variants, distinguishing pathogenic allele(s) at a given position, and prioritizing variants for genetic risk assessment. A significant disconnect is found to exist between the statistical modelling and biological performance of predictive approaches. We discuss fundamental reasons underlying these deficiencies and suggest that future improvements of computational predictions need to address confounding of allelic, positional and regional effects as well as imbalance of the proportion of true positive variants in candidate lists.

Original languageEnglish (US)
Article number330
JournalNature communications
Volume10
Issue number1
DOIs
StatePublished - Dec 1 2019

Fingerprint

Virulence
Nucleotides
Alleles
predictions
Research
annotations
risk assessment
nucleotides
lists
Risk assessment
proportion

ASJC Scopus subject areas

  • Chemistry(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Physics and Astronomy(all)

Cite this

Biological relevance of computationally predicted pathogenicity of noncoding variants. / Liu, Li; Sanderford, Maxwell D.; Patel, Ravi; Chandrashekar, Pramod; Gibson, Greg; Kumar, Sudhir.

In: Nature communications, Vol. 10, No. 1, 330, 01.12.2019.

Research output: Contribution to journalArticle

Liu, Li ; Sanderford, Maxwell D. ; Patel, Ravi ; Chandrashekar, Pramod ; Gibson, Greg ; Kumar, Sudhir. / Biological relevance of computationally predicted pathogenicity of noncoding variants. In: Nature communications. 2019 ; Vol. 10, No. 1.
@article{9941691b2e9a43d6b976b5f743463fb5,
title = "Biological relevance of computationally predicted pathogenicity of noncoding variants",
abstract = "Computational prediction of the phenotypic propensities of noncoding single nucleotide variants typically combines annotation of genomic, functional and evolutionary attributes into a single score. Here, we evaluate if the claimed excellent accuracies of these predictions translate into high rates of success in addressing questions important in biological research, such as fine mapping causal variants, distinguishing pathogenic allele(s) at a given position, and prioritizing variants for genetic risk assessment. A significant disconnect is found to exist between the statistical modelling and biological performance of predictive approaches. We discuss fundamental reasons underlying these deficiencies and suggest that future improvements of computational predictions need to address confounding of allelic, positional and regional effects as well as imbalance of the proportion of true positive variants in candidate lists.",
author = "Li Liu and Sanderford, {Maxwell D.} and Ravi Patel and Pramod Chandrashekar and Greg Gibson and Sudhir Kumar",
year = "2019",
month = "12",
day = "1",
doi = "10.1038/s41467-018-08270-y",
language = "English (US)",
volume = "10",
journal = "Nature Communications",
issn = "2041-1723",
publisher = "Nature Publishing Group",
number = "1",

}

TY - JOUR

T1 - Biological relevance of computationally predicted pathogenicity of noncoding variants

AU - Liu, Li

AU - Sanderford, Maxwell D.

AU - Patel, Ravi

AU - Chandrashekar, Pramod

AU - Gibson, Greg

AU - Kumar, Sudhir

PY - 2019/12/1

Y1 - 2019/12/1

N2 - Computational prediction of the phenotypic propensities of noncoding single nucleotide variants typically combines annotation of genomic, functional and evolutionary attributes into a single score. Here, we evaluate if the claimed excellent accuracies of these predictions translate into high rates of success in addressing questions important in biological research, such as fine mapping causal variants, distinguishing pathogenic allele(s) at a given position, and prioritizing variants for genetic risk assessment. A significant disconnect is found to exist between the statistical modelling and biological performance of predictive approaches. We discuss fundamental reasons underlying these deficiencies and suggest that future improvements of computational predictions need to address confounding of allelic, positional and regional effects as well as imbalance of the proportion of true positive variants in candidate lists.

AB - Computational prediction of the phenotypic propensities of noncoding single nucleotide variants typically combines annotation of genomic, functional and evolutionary attributes into a single score. Here, we evaluate if the claimed excellent accuracies of these predictions translate into high rates of success in addressing questions important in biological research, such as fine mapping causal variants, distinguishing pathogenic allele(s) at a given position, and prioritizing variants for genetic risk assessment. A significant disconnect is found to exist between the statistical modelling and biological performance of predictive approaches. We discuss fundamental reasons underlying these deficiencies and suggest that future improvements of computational predictions need to address confounding of allelic, positional and regional effects as well as imbalance of the proportion of true positive variants in candidate lists.

UR - http://www.scopus.com/inward/record.url?scp=85060173750&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85060173750&partnerID=8YFLogxK

U2 - 10.1038/s41467-018-08270-y

DO - 10.1038/s41467-018-08270-y

M3 - Article

VL - 10

JO - Nature Communications

JF - Nature Communications

SN - 2041-1723

IS - 1

M1 - 330

ER -