Propensity score analysis with missing data

Heining Cham; Stephen West

doi:10.1037/met0000076

Propensity score analysis with missing data

Heining Cham, Stephen West

Research output: Contribution to journal › Article › peer-review

41 Scopus citations

Abstract

Propensity score analysis is a method that equates treatment and control groups on a comprehensive set of measured confounders in observational (nonrandomized) studies. A successful propensity score analysis reduces bias in the estimate of the average treatment effect in a nonrandomized study, making the estimate more comparable with that obtained from a randomized experiment. This article reviews and discusses an important practical issue in propensity analysis, in which the baseline covariates (potential confounders) and the outcome have missing values (incompletely observed). We review the statistical theory of propensity score analysis and estimation methods for propensity scores with incompletely observed covariates. Traditional logistic regression and modern machine learning methods (e.g., random forests, generalized boosted modeling) as estimation methods for incompletely observed covariates are reviewed. Balance diagnostics and equating methods for incompletely observed covariates are briefly described. Using an empirical example, the propensity score estimation methods for incompletely observed covariates are illustrated and compared.

Original language	English (US)
Pages (from-to)	427-445
Number of pages	19
Journal	Psychological Methods
Volume	21
Issue number	3
DOIs	https://doi.org/10.1037/met0000076
State	Published - Sep 1 2016

Keywords

Machine learning
Missing data
Nonrandomization
Propensity score

ASJC Scopus subject areas

Psychology (miscellaneous)

Access to Document

10.1037/met0000076

Cite this

@article{2d56003883b34bd98f8230565b399ee1,

title = "Propensity score analysis with missing data",

abstract = "Propensity score analysis is a method that equates treatment and control groups on a comprehensive set of measured confounders in observational (nonrandomized) studies. A successful propensity score analysis reduces bias in the estimate of the average treatment effect in a nonrandomized study, making the estimate more comparable with that obtained from a randomized experiment. This article reviews and discusses an important practical issue in propensity analysis, in which the baseline covariates (potential confounders) and the outcome have missing values (incompletely observed). We review the statistical theory of propensity score analysis and estimation methods for propensity scores with incompletely observed covariates. Traditional logistic regression and modern machine learning methods (e.g., random forests, generalized boosted modeling) as estimation methods for incompletely observed covariates are reviewed. Balance diagnostics and equating methods for incompletely observed covariates are briefly described. Using an empirical example, the propensity score estimation methods for incompletely observed covariates are illustrated and compared.",

keywords = "Machine learning, Missing data, Nonrandomization, Propensity score",

author = "Heining Cham and Stephen West",

note = "Publisher Copyright: {\textcopyright} 2016 American Psychological Association.",

year = "2016",

month = sep,

day = "1",

doi = "10.1037/met0000076",

language = "English (US)",

volume = "21",

pages = "427--445",

journal = "Psychological Methods",

issn = "1082-989X",

publisher = "American Psychological Association Inc.",

number = "3",

}

TY - JOUR

T1 - Propensity score analysis with missing data

AU - Cham, Heining

AU - West, Stephen

PY - 2016/9/1

Y1 - 2016/9/1

N2 - Propensity score analysis is a method that equates treatment and control groups on a comprehensive set of measured confounders in observational (nonrandomized) studies. A successful propensity score analysis reduces bias in the estimate of the average treatment effect in a nonrandomized study, making the estimate more comparable with that obtained from a randomized experiment. This article reviews and discusses an important practical issue in propensity analysis, in which the baseline covariates (potential confounders) and the outcome have missing values (incompletely observed). We review the statistical theory of propensity score analysis and estimation methods for propensity scores with incompletely observed covariates. Traditional logistic regression and modern machine learning methods (e.g., random forests, generalized boosted modeling) as estimation methods for incompletely observed covariates are reviewed. Balance diagnostics and equating methods for incompletely observed covariates are briefly described. Using an empirical example, the propensity score estimation methods for incompletely observed covariates are illustrated and compared.

AB - Propensity score analysis is a method that equates treatment and control groups on a comprehensive set of measured confounders in observational (nonrandomized) studies. A successful propensity score analysis reduces bias in the estimate of the average treatment effect in a nonrandomized study, making the estimate more comparable with that obtained from a randomized experiment. This article reviews and discusses an important practical issue in propensity analysis, in which the baseline covariates (potential confounders) and the outcome have missing values (incompletely observed). We review the statistical theory of propensity score analysis and estimation methods for propensity scores with incompletely observed covariates. Traditional logistic regression and modern machine learning methods (e.g., random forests, generalized boosted modeling) as estimation methods for incompletely observed covariates are reviewed. Balance diagnostics and equating methods for incompletely observed covariates are briefly described. Using an empirical example, the propensity score estimation methods for incompletely observed covariates are illustrated and compared.

KW - Machine learning

KW - Missing data

KW - Nonrandomization

KW - Propensity score

UR - http://www.scopus.com/inward/record.url?scp=84959865972&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84959865972&partnerID=8YFLogxK

U2 - 10.1037/met0000076

DO - 10.1037/met0000076

M3 - Article

C2 - 26962757

AN - SCOPUS:84959865972

SN - 1082-989X

VL - 21

SP - 427

EP - 445

JO - Psychological Methods

JF - Psychological Methods

IS - 3

ER -

Propensity score analysis with missing data

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this