Clustered data with small sample sizes: Comparing the performance of model-based and design-based approaches

Daniel M. McNeish, Jeffery R. Harring

Research output: Contribution to journalArticlepeer-review

51 Scopus citations

Abstract

Two classes of methods properly account for clustering of data: design-based methods and model-based methods. Estimates from both methods have been shown to be approximately equal with large samples. However, both classes are known to produce biased standard error estimates with small samples. This paper compares the bias of standard errors and statistical power of marginal effects for generalized estimating equations (a design-based method) and generalized/linear mixed effects models (model-based methods) with small sample sizes via a simulation study. Provided that the distributional assumptions are met, model-based methods produced the least-biased standard error estimates and greater relative statistical power.

Original languageEnglish (US)
Pages (from-to)855-869
Number of pages15
JournalCommunications in Statistics: Simulation and Computation
Volume46
Issue number2
DOIs
StatePublished - Feb 7 2017
Externally publishedYes

Keywords

  • GEE
  • Kenward-Roger
  • Mixed model
  • Multilevel model
  • Small sample size

ASJC Scopus subject areas

  • Statistics and Probability
  • Modeling and Simulation

Fingerprint

Dive into the research topics of 'Clustered data with small sample sizes: Comparing the performance of model-based and design-based approaches'. Together they form a unique fingerprint.

Cite this