Abstract
In this article, we introduce the BART R package which is an acronym for Bayesian additive regression trees. BART is a Bayesian nonparametric, machine learning, ensemble predictive modeling method for continuous, binary, categorical and time-to-event out-comes. Furthermore, BART is a tree-based, black-box method which fits the outcome to an arbitrary random function, f, of the covariates. The BART technique is relatively computationally efficient as compared to its competitors, but large sample sizes can be demanding. Therefore, the BART package includes efficient state-of-the-art implemen-tations for continuous, binary, categorical and time-to-event outcomes that can take advantage of modern off-the-shelf hardware and software multi-threading technology. The BART package is written in C++ for both programmer and execution efficiency. The BART package takes advantage of multi-threading via forking as provided by the parallel package and OpenMP when available and supported by the platform. The ensemble of binary trees produced by a BART fit can be stored and re-used later via the R predict function. In addition to being an R package, the installed BART routines can be called directly from C++. The BART package provides the tools for your BART toolbox.
Original language | English (US) |
---|---|
Pages (from-to) | 1-66 |
Number of pages | 66 |
Journal | Journal of Statistical Software |
Volume | 97 |
Issue number | 1 |
DOIs | |
State | Published - 2021 |
Keywords
- Binary trees
- Black-box
- Categorical
- Competing risks
- Continuous
- Ensemble predictive model
- Forking
- Multi-threading
- Multinomial
- OpenMP
- Recurrent events
- Survival analysis
ASJC Scopus subject areas
- Software
- Statistics and Probability
- Statistics, Probability and Uncertainty