Uncertainty quantification in the radiogenomics modeling of EGFR amplification in glioblastoma

Leland S. Hu; Lujia Wang; Andrea Hawkins-Daarud; Jennifer M. Eschbacher; Kyle W. Singleton; Pamela R. Jackson; Kamala Clark-Swanson; Christopher P. Sereduk; Sen Peng; Panwen Wang; Junwen Wang; Leslie C. Baxter; Kris A. Smith; Gina L. Mazza; Ashley M. Stokes; Bernard R. Bendok; Richard S. Zimmerman; Chandan Krishna; Alyx B. Porter; Maciej M. Mrugala; Joseph M. Hoxworth; Teresa Wu; Nhan L. Tran; Kristin R. Swanson; Jing Li

doi:10.1038/s41598-021-83141-z

Uncertainty quantification in the radiogenomics modeling of EGFR amplification in glioblastoma

Leland S. Hu, Lujia Wang, Andrea Hawkins-Daarud, Jennifer M. Eschbacher, Kyle W. Singleton, Pamela R. Jackson, Kamala Clark-Swanson, Christopher P. Sereduk, Sen Peng, Panwen Wang, Junwen Wang, Leslie C. Baxter, Kris A. Smith, Gina L. Mazza, Ashley M. Stokes, Bernard R. Bendok, Richard S. Zimmerman, Chandan Krishna, Alyx B. Porter, Maciej M. MrugalaJoseph M. Hoxworth, Teresa Wu, Nhan L. Tran, Kristin R. Swanson, Jing Li

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Contribution to journal › Article › peer-review

13 Scopus citations

Abstract

Radiogenomics uses machine-learning (ML) to directly connect the morphologic and physiological appearance of tumors on clinical imaging with underlying genomic features. Despite extensive growth in the area of radiogenomics across many cancers, and its potential role in advancing clinical decision making, no published studies have directly addressed uncertainty in these model predictions. We developed a radiogenomics ML model to quantify uncertainty using transductive Gaussian Processes (GP) and a unique dataset of 95 image-localized biopsies with spatially matched MRI from 25 untreated Glioblastoma (GBM) patients. The model generated predictions for regional EGFR amplification status (a common and important target in GBM) to resolve the intratumoral genetic heterogeneity across each individual tumor—a key factor for future personalized therapeutic paradigms. The model used probability distributions for each sample prediction to quantify uncertainty, and used transductive learning to reduce the overall uncertainty. We compared predictive accuracy and uncertainty of the transductive learning GP model against a standard GP model using leave-one-patient-out cross validation. Additionally, we used a separate dataset containing 24 image-localized biopsies from 7 high-grade glioma patients to validate the model. Predictive uncertainty informed the likelihood of achieving an accurate sample prediction. When stratifying predictions based on uncertainty, we observed substantially higher performance in the group cohort (75% accuracy, n = 95) and amongst sample predictions with the lowest uncertainty (83% accuracy, n = 72) compared to predictions with higher uncertainty (48% accuracy, n = 23), due largely to data interpolation (rather than extrapolation). On the separate validation set, our model achieved 78% accuracy amongst the sample predictions with lowest uncertainty. We present a novel approach to quantify radiogenomics uncertainty to enhance model performance and clinical interpretability. This should help integrate more reliable radiogenomics models for improved medical decision-making.

Original language	English (US)
Article number	3932
Journal	Scientific reports
Volume	11
Issue number	1
DOIs	https://doi.org/10.1038/s41598-021-83141-z
State	Published - Dec 2021

ASJC Scopus subject areas

General

Access to Document

10.1038/s41598-021-83141-z

Cite this

Hu, L. S., Wang, L., Hawkins-Daarud, A., Eschbacher, J. M., Singleton, K. W., Jackson, P. R., Clark-Swanson, K., Sereduk, C. P., Peng, S., Wang, P., Wang, J., Baxter, L. C., Smith, K. A., Mazza, G. L., Stokes, A. M., Bendok, B. R., Zimmerman, R. S., Krishna, C., Porter, A. B., ... Li, J. (2021). Uncertainty quantification in the radiogenomics modeling of EGFR amplification in glioblastoma. Scientific reports, 11(1), Article 3932. https://doi.org/10.1038/s41598-021-83141-z

Hu, LS, Wang, L, Hawkins-Daarud, A, Eschbacher, JM, Singleton, KW, Jackson, PR, Clark-Swanson, K, Sereduk, CP, Peng, S, Wang, P, Wang, J, Baxter, LC, Smith, KA, Mazza, GL, Stokes, AM, Bendok, BR, Zimmerman, RS, Krishna, C, Porter, AB, Mrugala, MM, Hoxworth, JM, Wu, T, Tran, NL, Swanson, KR & Li, J 2021, 'Uncertainty quantification in the radiogenomics modeling of EGFR amplification in glioblastoma', Scientific reports, vol. 11, no. 1, 3932. https://doi.org/10.1038/s41598-021-83141-z

@article{48a6ae3cf4f14c9db5a5ce980f3d87fb,

title = "Uncertainty quantification in the radiogenomics modeling of EGFR amplification in glioblastoma",

abstract = "Radiogenomics uses machine-learning (ML) to directly connect the morphologic and physiological appearance of tumors on clinical imaging with underlying genomic features. Despite extensive growth in the area of radiogenomics across many cancers, and its potential role in advancing clinical decision making, no published studies have directly addressed uncertainty in these model predictions. We developed a radiogenomics ML model to quantify uncertainty using transductive Gaussian Processes (GP) and a unique dataset of 95 image-localized biopsies with spatially matched MRI from 25 untreated Glioblastoma (GBM) patients. The model generated predictions for regional EGFR amplification status (a common and important target in GBM) to resolve the intratumoral genetic heterogeneity across each individual tumor—a key factor for future personalized therapeutic paradigms. The model used probability distributions for each sample prediction to quantify uncertainty, and used transductive learning to reduce the overall uncertainty. We compared predictive accuracy and uncertainty of the transductive learning GP model against a standard GP model using leave-one-patient-out cross validation. Additionally, we used a separate dataset containing 24 image-localized biopsies from 7 high-grade glioma patients to validate the model. Predictive uncertainty informed the likelihood of achieving an accurate sample prediction. When stratifying predictions based on uncertainty, we observed substantially higher performance in the group cohort (75% accuracy, n = 95) and amongst sample predictions with the lowest uncertainty (83% accuracy, n = 72) compared to predictions with higher uncertainty (48% accuracy, n = 23), due largely to data interpolation (rather than extrapolation). On the separate validation set, our model achieved 78% accuracy amongst the sample predictions with lowest uncertainty. We present a novel approach to quantify radiogenomics uncertainty to enhance model performance and clinical interpretability. This should help integrate more reliable radiogenomics models for improved medical decision-making.",

author = "Hu, {Leland S.} and Lujia Wang and Andrea Hawkins-Daarud and Eschbacher, {Jennifer M.} and Singleton, {Kyle W.} and Jackson, {Pamela R.} and Kamala Clark-Swanson and Sereduk, {Christopher P.} and Sen Peng and Panwen Wang and Junwen Wang and Baxter, {Leslie C.} and Smith, {Kris A.} and Mazza, {Gina L.} and Stokes, {Ashley M.} and Bendok, {Bernard R.} and Zimmerman, {Richard S.} and Chandan Krishna and Porter, {Alyx B.} and Mrugala, {Maciej M.} and Hoxworth, {Joseph M.} and Teresa Wu and Tran, {Nhan L.} and Swanson, {Kristin R.} and Jing Li",

note = "Publisher Copyright: {\textcopyright} 2021, The Author(s).",

year = "2021",

month = dec,

doi = "10.1038/s41598-021-83141-z",

language = "English (US)",

volume = "11",

journal = "Scientific reports",

issn = "2045-2322",

publisher = "Nature Publishing Group",

number = "1",

}

TY - JOUR

T1 - Uncertainty quantification in the radiogenomics modeling of EGFR amplification in glioblastoma

AU - Hu, Leland S.

AU - Wang, Lujia

AU - Hawkins-Daarud, Andrea

AU - Eschbacher, Jennifer M.

AU - Singleton, Kyle W.

AU - Jackson, Pamela R.

AU - Clark-Swanson, Kamala

AU - Sereduk, Christopher P.

AU - Peng, Sen

AU - Wang, Panwen

AU - Wang, Junwen

AU - Baxter, Leslie C.

AU - Smith, Kris A.

AU - Mazza, Gina L.

AU - Stokes, Ashley M.

AU - Bendok, Bernard R.

AU - Zimmerman, Richard S.

AU - Krishna, Chandan

AU - Porter, Alyx B.

AU - Mrugala, Maciej M.

AU - Hoxworth, Joseph M.

AU - Wu, Teresa

AU - Tran, Nhan L.

AU - Swanson, Kristin R.

AU - Li, Jing

PY - 2021/12

Y1 - 2021/12

N2 - Radiogenomics uses machine-learning (ML) to directly connect the morphologic and physiological appearance of tumors on clinical imaging with underlying genomic features. Despite extensive growth in the area of radiogenomics across many cancers, and its potential role in advancing clinical decision making, no published studies have directly addressed uncertainty in these model predictions. We developed a radiogenomics ML model to quantify uncertainty using transductive Gaussian Processes (GP) and a unique dataset of 95 image-localized biopsies with spatially matched MRI from 25 untreated Glioblastoma (GBM) patients. The model generated predictions for regional EGFR amplification status (a common and important target in GBM) to resolve the intratumoral genetic heterogeneity across each individual tumor—a key factor for future personalized therapeutic paradigms. The model used probability distributions for each sample prediction to quantify uncertainty, and used transductive learning to reduce the overall uncertainty. We compared predictive accuracy and uncertainty of the transductive learning GP model against a standard GP model using leave-one-patient-out cross validation. Additionally, we used a separate dataset containing 24 image-localized biopsies from 7 high-grade glioma patients to validate the model. Predictive uncertainty informed the likelihood of achieving an accurate sample prediction. When stratifying predictions based on uncertainty, we observed substantially higher performance in the group cohort (75% accuracy, n = 95) and amongst sample predictions with the lowest uncertainty (83% accuracy, n = 72) compared to predictions with higher uncertainty (48% accuracy, n = 23), due largely to data interpolation (rather than extrapolation). On the separate validation set, our model achieved 78% accuracy amongst the sample predictions with lowest uncertainty. We present a novel approach to quantify radiogenomics uncertainty to enhance model performance and clinical interpretability. This should help integrate more reliable radiogenomics models for improved medical decision-making.

AB - Radiogenomics uses machine-learning (ML) to directly connect the morphologic and physiological appearance of tumors on clinical imaging with underlying genomic features. Despite extensive growth in the area of radiogenomics across many cancers, and its potential role in advancing clinical decision making, no published studies have directly addressed uncertainty in these model predictions. We developed a radiogenomics ML model to quantify uncertainty using transductive Gaussian Processes (GP) and a unique dataset of 95 image-localized biopsies with spatially matched MRI from 25 untreated Glioblastoma (GBM) patients. The model generated predictions for regional EGFR amplification status (a common and important target in GBM) to resolve the intratumoral genetic heterogeneity across each individual tumor—a key factor for future personalized therapeutic paradigms. The model used probability distributions for each sample prediction to quantify uncertainty, and used transductive learning to reduce the overall uncertainty. We compared predictive accuracy and uncertainty of the transductive learning GP model against a standard GP model using leave-one-patient-out cross validation. Additionally, we used a separate dataset containing 24 image-localized biopsies from 7 high-grade glioma patients to validate the model. Predictive uncertainty informed the likelihood of achieving an accurate sample prediction. When stratifying predictions based on uncertainty, we observed substantially higher performance in the group cohort (75% accuracy, n = 95) and amongst sample predictions with the lowest uncertainty (83% accuracy, n = 72) compared to predictions with higher uncertainty (48% accuracy, n = 23), due largely to data interpolation (rather than extrapolation). On the separate validation set, our model achieved 78% accuracy amongst the sample predictions with lowest uncertainty. We present a novel approach to quantify radiogenomics uncertainty to enhance model performance and clinical interpretability. This should help integrate more reliable radiogenomics models for improved medical decision-making.

UR - http://www.scopus.com/inward/record.url?scp=85100934663&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85100934663&partnerID=8YFLogxK

U2 - 10.1038/s41598-021-83141-z

DO - 10.1038/s41598-021-83141-z

M3 - Article

C2 - 33594116

AN - SCOPUS:85100934663

SN - 2045-2322

VL - 11

JO - Scientific reports

JF - Scientific reports

IS - 1

M1 - 3932

ER -

Uncertainty quantification in the radiogenomics modeling of EGFR amplification in glioblastoma

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this