The use of propensity scores for nonrandomized designs with clustered data

Felix J. Thoemmes; Stephen West

doi:10.1080/00273171.2011.569395

The use of propensity scores for nonrandomized designs with clustered data

Felix J. Thoemmes, Stephen West

Psychology

Research output: Contribution to journal › Article › peer-review

67 Scopus citations

Abstract

In this article we propose several modeling choices to extend propensity score analysis to clustered data. We describe different possible model specifications for estimation of the propensity score: single-level model, fixed effects model, and two random effects models. We also consider both conditioning within clusters and conditioning across clusters. We examine the underlying assumptions of these modeling choices and the type of randomized experiment approximated by each approach. Using a simulation study, we compare the relative performance of these modeling and conditioning choices in reducing bias due to confounding variables at both the person and cluster levels. An applied example based on a study by Hughes, Chen, Thoemmes, and Kwok (2010) is provided in which the effect of retention in Grade 1 on passing an achievement test in Grade 3 is evaluated. We find that models that consider the clustered nature of the data both in estimation of the propensity score and conditioning on the propensity score performed best in our simulation study; however, other modeling choices also performed well. The applied example illustrates practical limitations of these models when cluster sizes are small.

Original language	English (US)
Pages (from-to)	514-543
Number of pages	30
Journal	Multivariate Behavioral Research
Volume	46
Issue number	3
DOIs	https://doi.org/10.1080/00273171.2011.569395
State	Published - May 2011

ASJC Scopus subject areas

Statistics and Probability
Experimental and Cognitive Psychology
Arts and Humanities (miscellaneous)

Access to Document

10.1080/00273171.2011.569395

Cite this

@article{5068f346604d4fbda712ff07ba2bee93,

title = "The use of propensity scores for nonrandomized designs with clustered data",

abstract = "In this article we propose several modeling choices to extend propensity score analysis to clustered data. We describe different possible model specifications for estimation of the propensity score: single-level model, fixed effects model, and two random effects models. We also consider both conditioning within clusters and conditioning across clusters. We examine the underlying assumptions of these modeling choices and the type of randomized experiment approximated by each approach. Using a simulation study, we compare the relative performance of these modeling and conditioning choices in reducing bias due to confounding variables at both the person and cluster levels. An applied example based on a study by Hughes, Chen, Thoemmes, and Kwok (2010) is provided in which the effect of retention in Grade 1 on passing an achievement test in Grade 3 is evaluated. We find that models that consider the clustered nature of the data both in estimation of the propensity score and conditioning on the propensity score performed best in our simulation study; however, other modeling choices also performed well. The applied example illustrates practical limitations of these models when cluster sizes are small.",

author = "Thoemmes, {Felix J.} and Stephen West",

note = "Funding Information: Stephen G. West was supported by a Forschungspreis (research prize) from the Alexander von Humboldt Foundation. The data collected in the empirical example was supported by a grant from the National Institute of Child Health and Development (5 R01 HD39367) to Jan N. Hughes (Principal Investigator), Department of Educational Psychology, Texas A & M University.",

year = "2011",

month = may,

doi = "10.1080/00273171.2011.569395",

language = "English (US)",

volume = "46",

pages = "514--543",

journal = "Multivariate Behavioral Research",

issn = "0027-3171",

publisher = "Psychology Press Ltd",

number = "3",

}

TY - JOUR

T1 - The use of propensity scores for nonrandomized designs with clustered data

AU - Thoemmes, Felix J.

AU - West, Stephen

N1 - Funding Information: Stephen G. West was supported by a Forschungspreis (research prize) from the Alexander von Humboldt Foundation. The data collected in the empirical example was supported by a grant from the National Institute of Child Health and Development (5 R01 HD39367) to Jan N. Hughes (Principal Investigator), Department of Educational Psychology, Texas A & M University.

PY - 2011/5

Y1 - 2011/5

N2 - In this article we propose several modeling choices to extend propensity score analysis to clustered data. We describe different possible model specifications for estimation of the propensity score: single-level model, fixed effects model, and two random effects models. We also consider both conditioning within clusters and conditioning across clusters. We examine the underlying assumptions of these modeling choices and the type of randomized experiment approximated by each approach. Using a simulation study, we compare the relative performance of these modeling and conditioning choices in reducing bias due to confounding variables at both the person and cluster levels. An applied example based on a study by Hughes, Chen, Thoemmes, and Kwok (2010) is provided in which the effect of retention in Grade 1 on passing an achievement test in Grade 3 is evaluated. We find that models that consider the clustered nature of the data both in estimation of the propensity score and conditioning on the propensity score performed best in our simulation study; however, other modeling choices also performed well. The applied example illustrates practical limitations of these models when cluster sizes are small.

AB - In this article we propose several modeling choices to extend propensity score analysis to clustered data. We describe different possible model specifications for estimation of the propensity score: single-level model, fixed effects model, and two random effects models. We also consider both conditioning within clusters and conditioning across clusters. We examine the underlying assumptions of these modeling choices and the type of randomized experiment approximated by each approach. Using a simulation study, we compare the relative performance of these modeling and conditioning choices in reducing bias due to confounding variables at both the person and cluster levels. An applied example based on a study by Hughes, Chen, Thoemmes, and Kwok (2010) is provided in which the effect of retention in Grade 1 on passing an achievement test in Grade 3 is evaluated. We find that models that consider the clustered nature of the data both in estimation of the propensity score and conditioning on the propensity score performed best in our simulation study; however, other modeling choices also performed well. The applied example illustrates practical limitations of these models when cluster sizes are small.

UR - http://www.scopus.com/inward/record.url?scp=79958709433&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79958709433&partnerID=8YFLogxK

U2 - 10.1080/00273171.2011.569395

DO - 10.1080/00273171.2011.569395

M3 - Article

AN - SCOPUS:79958709433

SN - 0027-3171

VL - 46

SP - 514

EP - 543

JO - Multivariate Behavioral Research

JF - Multivariate Behavioral Research

IS - 3

ER -

The use of propensity scores for nonrandomized designs with clustered data

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this