Biological relevance of computationally predicted pathogenicity of noncoding variants

Li Liu, Maxwell D. Sanderford, Ravi Patel, Pramod Chandrashekar, Greg Gibson, Sudhir Kumar

    Research output: Contribution to journalArticle

    3 Citations (Scopus)

    Abstract

    Computational prediction of the phenotypic propensities of noncoding single nucleotide variants typically combines annotation of genomic, functional and evolutionary attributes into a single score. Here, we evaluate if the claimed excellent accuracies of these predictions translate into high rates of success in addressing questions important in biological research, such as fine mapping causal variants, distinguishing pathogenic allele(s) at a given position, and prioritizing variants for genetic risk assessment. A significant disconnect is found to exist between the statistical modelling and biological performance of predictive approaches. We discuss fundamental reasons underlying these deficiencies and suggest that future improvements of computational predictions need to address confounding of allelic, positional and regional effects as well as imbalance of the proportion of true positive variants in candidate lists.

    Original languageEnglish (US)
    Article number330
    JournalNature communications
    Volume10
    Issue number1
    DOIs
    StatePublished - Dec 1 2019

    Fingerprint

    Virulence
    Nucleotides
    Alleles
    predictions
    Research
    annotations
    risk assessment
    nucleotides
    lists
    Risk assessment
    proportion

    ASJC Scopus subject areas

    • Chemistry(all)
    • Biochemistry, Genetics and Molecular Biology(all)
    • Physics and Astronomy(all)

    Cite this

    Biological relevance of computationally predicted pathogenicity of noncoding variants. / Liu, Li; Sanderford, Maxwell D.; Patel, Ravi; Chandrashekar, Pramod; Gibson, Greg; Kumar, Sudhir.

    In: Nature communications, Vol. 10, No. 1, 330, 01.12.2019.

    Research output: Contribution to journalArticle

    Liu, Li ; Sanderford, Maxwell D. ; Patel, Ravi ; Chandrashekar, Pramod ; Gibson, Greg ; Kumar, Sudhir. / Biological relevance of computationally predicted pathogenicity of noncoding variants. In: Nature communications. 2019 ; Vol. 10, No. 1.
    @article{9941691b2e9a43d6b976b5f743463fb5,
    title = "Biological relevance of computationally predicted pathogenicity of noncoding variants",
    abstract = "Computational prediction of the phenotypic propensities of noncoding single nucleotide variants typically combines annotation of genomic, functional and evolutionary attributes into a single score. Here, we evaluate if the claimed excellent accuracies of these predictions translate into high rates of success in addressing questions important in biological research, such as fine mapping causal variants, distinguishing pathogenic allele(s) at a given position, and prioritizing variants for genetic risk assessment. A significant disconnect is found to exist between the statistical modelling and biological performance of predictive approaches. We discuss fundamental reasons underlying these deficiencies and suggest that future improvements of computational predictions need to address confounding of allelic, positional and regional effects as well as imbalance of the proportion of true positive variants in candidate lists.",
    author = "Li Liu and Sanderford, {Maxwell D.} and Ravi Patel and Pramod Chandrashekar and Greg Gibson and Sudhir Kumar",
    year = "2019",
    month = "12",
    day = "1",
    doi = "10.1038/s41467-018-08270-y",
    language = "English (US)",
    volume = "10",
    journal = "Nature Communications",
    issn = "2041-1723",
    publisher = "Nature Publishing Group",
    number = "1",

    }

    TY - JOUR

    T1 - Biological relevance of computationally predicted pathogenicity of noncoding variants

    AU - Liu, Li

    AU - Sanderford, Maxwell D.

    AU - Patel, Ravi

    AU - Chandrashekar, Pramod

    AU - Gibson, Greg

    AU - Kumar, Sudhir

    PY - 2019/12/1

    Y1 - 2019/12/1

    N2 - Computational prediction of the phenotypic propensities of noncoding single nucleotide variants typically combines annotation of genomic, functional and evolutionary attributes into a single score. Here, we evaluate if the claimed excellent accuracies of these predictions translate into high rates of success in addressing questions important in biological research, such as fine mapping causal variants, distinguishing pathogenic allele(s) at a given position, and prioritizing variants for genetic risk assessment. A significant disconnect is found to exist between the statistical modelling and biological performance of predictive approaches. We discuss fundamental reasons underlying these deficiencies and suggest that future improvements of computational predictions need to address confounding of allelic, positional and regional effects as well as imbalance of the proportion of true positive variants in candidate lists.

    AB - Computational prediction of the phenotypic propensities of noncoding single nucleotide variants typically combines annotation of genomic, functional and evolutionary attributes into a single score. Here, we evaluate if the claimed excellent accuracies of these predictions translate into high rates of success in addressing questions important in biological research, such as fine mapping causal variants, distinguishing pathogenic allele(s) at a given position, and prioritizing variants for genetic risk assessment. A significant disconnect is found to exist between the statistical modelling and biological performance of predictive approaches. We discuss fundamental reasons underlying these deficiencies and suggest that future improvements of computational predictions need to address confounding of allelic, positional and regional effects as well as imbalance of the proportion of true positive variants in candidate lists.

    UR - http://www.scopus.com/inward/record.url?scp=85060173750&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=85060173750&partnerID=8YFLogxK

    U2 - 10.1038/s41467-018-08270-y

    DO - 10.1038/s41467-018-08270-y

    M3 - Article

    C2 - 30659175

    AN - SCOPUS:85060173750

    VL - 10

    JO - Nature Communications

    JF - Nature Communications

    SN - 2041-1723

    IS - 1

    M1 - 330

    ER -