Regression models involving nonlinear effects with missing data: A sequential modeling approach using Bayesian estimation

Oliver Lüdtke, Alexander Robitzsch, Stephen G. West

Research output: Contribution to journalArticlepeer-review

28 Scopus citations

Abstract

When estimating multiple regression models with incomplete predictor variables, it is necessary to specify a joint distribution for the predictor variables. A convenient assumption is that this distribution is a joint normal distribution, the default in many statistical software packages. This distribution will in general be misspecified if the predictors with missing data have nonlinear effects (e.g., x2) or are included in interaction terms (e.g., x · z). In the present article, we discuss a sequential modeling approach that can be applied to decompose the joint distribution of the variables into 2 parts: (a) a part that is due to the model of interest and (b) a part that is due to the model for the incomplete predictors. We demonstrate how the sequential modeling approach can be used to implement a multiple imputation strategy based on Bayesian estimation techniques that can accommodate rather complex substantive regression models with nonlinear effects and also allows a flexible treatment of auxiliary variables. In 4 simulation studies, we showed that the sequential modeling approach can be applied to estimate nonlinear effects in regression models with missing values on continuous, categorical, or skewed predictor variables under a broad range of conditions and investigated the robustness of the proposed approach against distributional misspecifications. We developed the R package mdmb, which facilitates a user-friendly application of the sequential modeling approach, and we present a real-data example that illustrates the flexibility of the software.

Original languageEnglish (US)
Pages (from-to)157-181
Number of pages25
JournalPsychological Methods
Volume25
Issue number2
DOIs
StatePublished - Apr 2020

Keywords

  • Interaction effects
  • Missing data
  • Multiple imputation
  • Multiple regression

ASJC Scopus subject areas

  • Psychology (miscellaneous)

Fingerprint

Dive into the research topics of 'Regression models involving nonlinear effects with missing data: A sequential modeling approach using Bayesian estimation'. Together they form a unique fingerprint.

Cite this