Hierarchical models for cross-classified overdispersed multinomial data

Jeffrey Wilson; Kenneth J. Koehler

doi:10.1080/07350015.1991.10509832

Hierarchical models for cross-classified overdispersed multinomial data

Jeffrey Wilson, Kenneth J. Koehler

Research output: Contribution to journal › Article › peer-review

10 Scopus citations

Abstract

When a vector of sample proportions is not obtained through a simple random sampling, the covariance matrix for the sample vector can differ substantially from the one corresponding to the multinomial model (Wilson 1989). For example, clustering effects of subject effects in repeated-measure experiments can cause the variance of the observed proportions to be much larger than variances under the multinomial model. The phenomenon is generally referred to as overdispersion. Tallis (1962) proposed a model for identically distributed multinomials with a common measure of correlation and referred to it as the generalized multinomial model. This generalized multinomial model is extended in this article to account for overdispersion by allowing the vectors of proportions to vary according to a Dirichlet distribution. The generalized Dirichlet- multinomial model (as it is referred to here) allows for a second order of pairwise correlation among units, a type of assumption found reasonable in some biological data (Kupper and Haseman 1978) and introduced here to business data. An alternative derivation allowing for two kinds of variation is also considered. Asymptotic normal properties of parameter estimators are used to construct Wald statistics for testing hypotheses. The methods are illustrated with applications to performance evaluation monthly data and an integrated circuit yield analysis.

Original language	English (US)
Pages (from-to)	103-110
Number of pages	8
Journal	Journal of Business and Economic Statistics
Volume	9
Issue number	1
DOIs	https://doi.org/10.1080/07350015.1991.10509832
State	Published - Jan 1991

Keywords

Correlated
Crossed
Dirichlet
Generalized multinomial model
Nested

ASJC Scopus subject areas

Statistics and Probability
Social Sciences (miscellaneous)
Economics and Econometrics
Statistics, Probability and Uncertainty

Access to Document

10.1080/07350015.1991.10509832

Cite this

@article{b77559e88e9b433d83878e647cef1758,

title = "Hierarchical models for cross-classified overdispersed multinomial data",

abstract = "When a vector of sample proportions is not obtained through a simple random sampling, the covariance matrix for the sample vector can differ substantially from the one corresponding to the multinomial model (Wilson 1989). For example, clustering effects of subject effects in repeated-measure experiments can cause the variance of the observed proportions to be much larger than variances under the multinomial model. The phenomenon is generally referred to as overdispersion. Tallis (1962) proposed a model for identically distributed multinomials with a common measure of correlation and referred to it as the generalized multinomial model. This generalized multinomial model is extended in this article to account for overdispersion by allowing the vectors of proportions to vary according to a Dirichlet distribution. The generalized Dirichlet- multinomial model (as it is referred to here) allows for a second order of pairwise correlation among units, a type of assumption found reasonable in some biological data (Kupper and Haseman 1978) and introduced here to business data. An alternative derivation allowing for two kinds of variation is also considered. Asymptotic normal properties of parameter estimators are used to construct Wald statistics for testing hypotheses. The methods are illustrated with applications to performance evaluation monthly data and an integrated circuit yield analysis.",

keywords = "Correlated, Crossed, Dirichlet, Generalized multinomial model, Nested",

author = "Jeffrey Wilson and Koehler, {Kenneth J.}",

note = "Funding Information: This work was partially supported by the Council 100 (1987) College of Business program at Arizona State University. We thank the reviewers for helpful comments on an earlier draft.",

year = "1991",

month = jan,

doi = "10.1080/07350015.1991.10509832",

language = "English (US)",

volume = "9",

pages = "103--110",

journal = "Journal of Business and Economic Statistics",

issn = "0735-0015",

publisher = "American Statistical Association",

number = "1",

}

TY - JOUR

T1 - Hierarchical models for cross-classified overdispersed multinomial data

AU - Wilson, Jeffrey

AU - Koehler, Kenneth J.

N1 - Funding Information: This work was partially supported by the Council 100 (1987) College of Business program at Arizona State University. We thank the reviewers for helpful comments on an earlier draft.

PY - 1991/1

Y1 - 1991/1

N2 - When a vector of sample proportions is not obtained through a simple random sampling, the covariance matrix for the sample vector can differ substantially from the one corresponding to the multinomial model (Wilson 1989). For example, clustering effects of subject effects in repeated-measure experiments can cause the variance of the observed proportions to be much larger than variances under the multinomial model. The phenomenon is generally referred to as overdispersion. Tallis (1962) proposed a model for identically distributed multinomials with a common measure of correlation and referred to it as the generalized multinomial model. This generalized multinomial model is extended in this article to account for overdispersion by allowing the vectors of proportions to vary according to a Dirichlet distribution. The generalized Dirichlet- multinomial model (as it is referred to here) allows for a second order of pairwise correlation among units, a type of assumption found reasonable in some biological data (Kupper and Haseman 1978) and introduced here to business data. An alternative derivation allowing for two kinds of variation is also considered. Asymptotic normal properties of parameter estimators are used to construct Wald statistics for testing hypotheses. The methods are illustrated with applications to performance evaluation monthly data and an integrated circuit yield analysis.

AB - When a vector of sample proportions is not obtained through a simple random sampling, the covariance matrix for the sample vector can differ substantially from the one corresponding to the multinomial model (Wilson 1989). For example, clustering effects of subject effects in repeated-measure experiments can cause the variance of the observed proportions to be much larger than variances under the multinomial model. The phenomenon is generally referred to as overdispersion. Tallis (1962) proposed a model for identically distributed multinomials with a common measure of correlation and referred to it as the generalized multinomial model. This generalized multinomial model is extended in this article to account for overdispersion by allowing the vectors of proportions to vary according to a Dirichlet distribution. The generalized Dirichlet- multinomial model (as it is referred to here) allows for a second order of pairwise correlation among units, a type of assumption found reasonable in some biological data (Kupper and Haseman 1978) and introduced here to business data. An alternative derivation allowing for two kinds of variation is also considered. Asymptotic normal properties of parameter estimators are used to construct Wald statistics for testing hypotheses. The methods are illustrated with applications to performance evaluation monthly data and an integrated circuit yield analysis.

KW - Correlated

KW - Crossed

KW - Dirichlet

KW - Generalized multinomial model

KW - Nested

UR - http://www.scopus.com/inward/record.url?scp=0038842365&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0038842365&partnerID=8YFLogxK

U2 - 10.1080/07350015.1991.10509832

DO - 10.1080/07350015.1991.10509832

M3 - Article

AN - SCOPUS:0038842365

SN - 0735-0015

VL - 9

SP - 103

EP - 110

JO - Journal of Business and Economic Statistics

JF - Journal of Business and Economic Statistics

IS - 1

ER -

Hierarchical models for cross-classified overdispersed multinomial data

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this