TY - JOUR
T1 - Nonparametric failure time
T2 - Time-to-event machine learning with heteroskedastic Bayesian additive regression trees and low information omnibus Dirichlet process mixtures
AU - Sparapani, Rodney A.
AU - Logan, Brent R.
AU - Maiers, Martin J.
AU - Laud, Purushottam W.
AU - McCulloch, Robert E.
N1 - Funding Information:
This research was supported by the US Office of Naval Research N00014‐18‐1‐2888 along with the assistance and computational resources provided by the Research Computing Center of the Medical College of Wisconsin. Dr. Logan's research with the Center for International Blood and Marrow Transplant Research (CIBMTR) is supported in part by Public Health Service U24CA076518 from the National Cancer Institute (NCI), the National Heart, Lung and Blood Institute (NHLBI), and the National Institute of Allergy and Infectious Diseases (NIAID).
Publisher Copyright:
© 2023 The International Biometric Society.
PY - 2023
Y1 - 2023
N2 - Many popular survival models rely on restrictive parametric, or semiparametric, assumptions that could provide erroneous predictions when the effects of covariates are complex. Modern advances in computational hardware have led to an increasing interest in flexible Bayesian nonparametric methods for time-to-event data such as Bayesian additive regression trees (BART). We propose a novel approach that we call nonparametric failure time (NFT) BART in order to increase the flexibility beyond accelerated failure time (AFT) and proportional hazard models. NFT BART has three key features: (1) a BART prior for the mean function of the event time logarithm; (2) a heteroskedastic BART prior to deduce a covariate-dependent variance function; and (3) a flexible nonparametric error distribution using Dirichlet process mixtures (DPM). Our proposed approach widens the scope of hazard shapes including nonproportional hazards, can be scaled up to large sample sizes, naturally provides estimates of uncertainty via the posterior and can be seamlessly employed for variable selection. We provide convenient, user-friendly, computer software that is freely available as a reference implementation. Simulations demonstrate that NFT BART maintains excellent performance for survival prediction especially when AFT assumptions are violated by heteroskedasticity. We illustrate the proposed approach on a study examining predictors for mortality risk in patients undergoing hematopoietic stem cell transplant (HSCT) for blood-borne cancer, where heteroskedasticity and nonproportional hazards are likely present.
AB - Many popular survival models rely on restrictive parametric, or semiparametric, assumptions that could provide erroneous predictions when the effects of covariates are complex. Modern advances in computational hardware have led to an increasing interest in flexible Bayesian nonparametric methods for time-to-event data such as Bayesian additive regression trees (BART). We propose a novel approach that we call nonparametric failure time (NFT) BART in order to increase the flexibility beyond accelerated failure time (AFT) and proportional hazard models. NFT BART has three key features: (1) a BART prior for the mean function of the event time logarithm; (2) a heteroskedastic BART prior to deduce a covariate-dependent variance function; and (3) a flexible nonparametric error distribution using Dirichlet process mixtures (DPM). Our proposed approach widens the scope of hazard shapes including nonproportional hazards, can be scaled up to large sample sizes, naturally provides estimates of uncertainty via the posterior and can be seamlessly employed for variable selection. We provide convenient, user-friendly, computer software that is freely available as a reference implementation. Simulations demonstrate that NFT BART maintains excellent performance for survival prediction especially when AFT assumptions are violated by heteroskedasticity. We illustrate the proposed approach on a study examining predictors for mortality risk in patients undergoing hematopoietic stem cell transplant (HSCT) for blood-borne cancer, where heteroskedasticity and nonproportional hazards are likely present.
KW - accelerated failure time
KW - BART
KW - constrained DPM
KW - hematopoietic stem cell transplant
KW - LIO prior hierarchy
KW - nonproportional hazards
KW - survival analysis
KW - Thompson sampling variable selection
UR - http://www.scopus.com/inward/record.url?scp=85153305626&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85153305626&partnerID=8YFLogxK
U2 - 10.1111/biom.13857
DO - 10.1111/biom.13857
M3 - Article
AN - SCOPUS:85153305626
SN - 0006-341X
JO - Biometrics
JF - Biometrics
ER -