Biological relevance of computationally predicted pathogenicity of noncoding variants

Li Liu; Maxwell D. Sanderford; Ravi Patel; Pramod Chandrashekar; Greg Gibson; Sudhir Kumar

doi:10.1038/s41467-018-08270-y

Biological relevance of computationally predicted pathogenicity of noncoding variants

Li Liu, Maxwell D. Sanderford, Ravi Patel, Pramod Chandrashekar, Greg Gibson, Sudhir Kumar

Health Solutions, College of (CHS)

Research output: Contribution to journal › Article › peer-review

32 Scopus citations

Abstract

Computational prediction of the phenotypic propensities of noncoding single nucleotide variants typically combines annotation of genomic, functional and evolutionary attributes into a single score. Here, we evaluate if the claimed excellent accuracies of these predictions translate into high rates of success in addressing questions important in biological research, such as fine mapping causal variants, distinguishing pathogenic allele(s) at a given position, and prioritizing variants for genetic risk assessment. A significant disconnect is found to exist between the statistical modelling and biological performance of predictive approaches. We discuss fundamental reasons underlying these deficiencies and suggest that future improvements of computational predictions need to address confounding of allelic, positional and regional effects as well as imbalance of the proportion of true positive variants in candidate lists.

Original language	English (US)
Article number	330
Journal	Nature communications
Volume	10
Issue number	1
DOIs	https://doi.org/10.1038/s41467-018-08270-y
State	Published - Dec 1 2019

ASJC Scopus subject areas

General Chemistry
General Biochemistry, Genetics and Molecular Biology
General Physics and Astronomy

Access to Document

10.1038/s41467-018-08270-y

Cite this

@article{9941691b2e9a43d6b976b5f743463fb5,

title = "Biological relevance of computationally predicted pathogenicity of noncoding variants",

abstract = "Computational prediction of the phenotypic propensities of noncoding single nucleotide variants typically combines annotation of genomic, functional and evolutionary attributes into a single score. Here, we evaluate if the claimed excellent accuracies of these predictions translate into high rates of success in addressing questions important in biological research, such as fine mapping causal variants, distinguishing pathogenic allele(s) at a given position, and prioritizing variants for genetic risk assessment. A significant disconnect is found to exist between the statistical modelling and biological performance of predictive approaches. We discuss fundamental reasons underlying these deficiencies and suggest that future improvements of computational predictions need to address confounding of allelic, positional and regional effects as well as imbalance of the proportion of true positive variants in candidate lists.",

author = "Li Liu and Sanderford, {Maxwell D.} and Ravi Patel and Pramod Chandrashekar and Greg Gibson and Sudhir Kumar",

note = "Publisher Copyright: {\textcopyright} 2019, The Author(s).",

year = "2019",

month = dec,

day = "1",

doi = "10.1038/s41467-018-08270-y",

language = "English (US)",

volume = "10",

journal = "Nature communications",

issn = "2041-1723",

publisher = "Nature Publishing Group",

number = "1",

}

TY - JOUR

T1 - Biological relevance of computationally predicted pathogenicity of noncoding variants

AU - Liu, Li

AU - Sanderford, Maxwell D.

AU - Patel, Ravi

AU - Chandrashekar, Pramod

AU - Gibson, Greg

AU - Kumar, Sudhir

PY - 2019/12/1

Y1 - 2019/12/1

N2 - Computational prediction of the phenotypic propensities of noncoding single nucleotide variants typically combines annotation of genomic, functional and evolutionary attributes into a single score. Here, we evaluate if the claimed excellent accuracies of these predictions translate into high rates of success in addressing questions important in biological research, such as fine mapping causal variants, distinguishing pathogenic allele(s) at a given position, and prioritizing variants for genetic risk assessment. A significant disconnect is found to exist between the statistical modelling and biological performance of predictive approaches. We discuss fundamental reasons underlying these deficiencies and suggest that future improvements of computational predictions need to address confounding of allelic, positional and regional effects as well as imbalance of the proportion of true positive variants in candidate lists.

AB - Computational prediction of the phenotypic propensities of noncoding single nucleotide variants typically combines annotation of genomic, functional and evolutionary attributes into a single score. Here, we evaluate if the claimed excellent accuracies of these predictions translate into high rates of success in addressing questions important in biological research, such as fine mapping causal variants, distinguishing pathogenic allele(s) at a given position, and prioritizing variants for genetic risk assessment. A significant disconnect is found to exist between the statistical modelling and biological performance of predictive approaches. We discuss fundamental reasons underlying these deficiencies and suggest that future improvements of computational predictions need to address confounding of allelic, positional and regional effects as well as imbalance of the proportion of true positive variants in candidate lists.

UR - http://www.scopus.com/inward/record.url?scp=85060173750&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85060173750&partnerID=8YFLogxK

U2 - 10.1038/s41467-018-08270-y

DO - 10.1038/s41467-018-08270-y

M3 - Article

C2 - 30659175

AN - SCOPUS:85060173750

SN - 2041-1723

VL - 10

JO - Nature communications

JF - Nature communications

IS - 1

M1 - 330

ER -

Biological relevance of computationally predicted pathogenicity of noncoding variants

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this