Abstract
Predictive models benefit from a compact, non-redundant subset of features that improves interpretability and generalization. Modern data sets are wide, dirty, mixed with both numerical and categorical predictors, and may contain interactive effects that require complex models. This is a challenge for filters, wrappers, and embedded feature selection methods. We describe details of an algorithm using tree-based ensembles to generate a compact subset of non-redundant features. Parallel and serial ensembles of trees are combined into a mixed method that can uncover masking and detect features of secondary effect. Simulated and actual examples illustrate the effectiveness of the approach.
Original language | English (US) |
---|---|
Pages (from-to) | 1341-1366 |
Number of pages | 26 |
Journal | Journal of Machine Learning Research |
Volume | 10 |
State | Published - Jul 2009 |
Keywords
- Importance
- Masking
- Resampling
- Residuals
- Trees
ASJC Scopus subject areas
- Software
- Control and Systems Engineering
- Statistics and Probability
- Artificial Intelligence