Uniformly more powerful tests for hypotheses concerning linear inequalities and normal means

Roger L. Berger

doi:10.1080/01621459.1989.10478755

Uniformly more powerful tests for hypotheses concerning linear inequalities and normal means

Roger L. Berger

Arizona State University

Research output: Contribution to journal › Article › peer-review

47 Scopus citations

Abstract

This article considers some hypothesis-testing problems regarding normal means. In these problems, the hypotheses are defined by linear inequalities on the means. We show that in certain problems the likelihood ratio test (LRT) is not very powerful. We describe a test that has the same size, α, as the LRT and is uniformly more powerful. The test is easily implemented, since its critical values are standard normal percentiles. The increase in power with the new test can be substantial. For example, the new test’s power is 1/2α times bigger (10 times bigger for α = .05) than the LRT’s power for some parameter points in a simple example. Specifically, let X = (X₁, …, X_p)′ (p ≥ 2) be a multivariate normal random vector with unknown mean μ = (μ₁, …, μ_p)′ and known, nonsingular covariance matrix Σ. We consider testing the null hypothesis H₀: B′_i,μ ≤ 0 for some i = 1, …, k versus the alternative hypothesis H₁: b′_iμ > 0 for all i = 1, …, k. Here b₁, …, b_k (k ≥ 2) are specified p-dimensional vectors that define the hypotheses. Many types of relationships among the means may be described with the linear inequalities. Two interesting types are those that specify the signs of the means and those that describe an order relationship. Some examples of alternative hypotheses that can be specified in this way are these: (Equation presented) (sign testing), (Equation presented) (simple order), (Equation presented) (simple loop), and (Equation presented) (simple tree). If μ_i = v_2i – v_1i, where v_ji is the average response of the ith patient subset to the jth treatment, then (Equation presented) states that Treatment 2 is better than Treatment 1 for all subsets. If the μ_i are regression coefficients, then (Equation presented) states that the mean response increases with each independent variable. In any case, these relationships would be the alternative hypothesis. Rejection of H₀ by a test with small size would be taken as strong evidence confirming that the specified sign or order relationship is true. Sasabuchi (1980) showed that the size-α LRT of H₀ versus H₁ is the test that rejects H₀ if Z_i = b′_iX/(b′_iΣb_i)^1/2 ≥ z_α for all i = 1, …, k, where z_α is the upper 100α percentile of a standard normal distribution. This test is biased and has very low power if all of the values b′_iμ (i = 1, …, k) are only slightly bigger than 0. We define an integer J and constants c₀, …, c_J that are certain standard normal percentiles. We show that, in many cases, a size-α test that is uniformly more powerful than the LRT is the test that rejects H₀ if X ∈ R₁ ∪ ··· ∪ R_J, where R_j = {x: c_j ≤ z_i ≤ c_j_–1, i = 1, …, k} and z_i = b′_ix/(b′_iΣb_i)^1/2 is the LRT statistic. The set R₁ is the rejection region of the LRT, so this test is obviously more powerful than the LRT. But we show that if, for each i = 1, …, k, there exists an m ≠ i such that b,′_iΣb_m ≤ 0, then this test is also a size-α test. It is easy to verify that this condition is satisfied, for example, for all of the aforementioned H₁ hypotheses, except the simple tree, if Σ is diagonal. Tests that are even more powerful than those just described might exist. We discuss an example of such a test. But despite this test’s superior power properties, it has some counterintuitive properties. Thus tests such as in this example may be primarily of theoretical interest. All of the previously mentioned results are derived in the Σ-known case. Sasabuchi (1980) showed that, if Σ is unknown, the LRT is very similar. The differences are that Σ is replaced by an estimate and z_α is replaced by t_α, a t-distribution percentile. We show, in an example, that making the same modifications to this test does not give a size-α test. But in the example the size of the test converges to α quickly as the degrees of freedom for the estimate of Σ becomes large. So even for moderate degrees of freedom (≥ 10), this test might be preferable to the LRT, since its size is approximately α and it is much more powerful than the LRT. A two-sided version of this problem is obtained if we test (Equation presented) versus (Equation presented), where H₁ is a one-sided alternative as defined above. Sasabuchi (1980) showed that the LRT rejects (Equation presented) if Z_i ≥ c for all i = 1, …, k or Z_i ≤ – c for all i = 1, …, k, Sasabuchi gave some conditions under which c = z_α gives a size-α test. We consider only the special case in which H₁ is the sign-testing alternative and (Equation presented), a diagonal matrix. For constants c₀, …, c_2J, similar to those above, we show that the test that rejects (Equation presented) if X ∈ R₁ ∪ ··· ∪ R_2J, where R_j = {x: c_j ≤ x_i/σ_i ≤ c_j_–1, i = 1, …, p}, is a size-α test that is uniformly more powerful than the LRT. For the special case of p = 2, this provides a test that is uniformly more powerful than a test proposed by Gail and Simon (1985) for testing for a qualitative interaction.

Original language	English (US)
Pages (from-to)	192-199
Number of pages	8
Journal	Journal of the American Statistical Association
Volume	84
Issue number	405
DOIs	https://doi.org/10.1080/01621459.1989.10478755
State	Published - Mar 1989

Keywords

Likelihood ratio test
Majorization
Polyhedral cone
Qualitative interaction

ASJC Scopus subject areas

Statistics and Probability
Statistics, Probability and Uncertainty

Access to Document

10.1080/01621459.1989.10478755

Cite this

@article{db225ee78511463d93034c6a6ad1efe5,

title = "Uniformly more powerful tests for hypotheses concerning linear inequalities and normal means",

abstract = "This article considers some hypothesis-testing problems regarding normal means. In these problems, the hypotheses are defined by linear inequalities on the means. We show that in certain problems the likelihood ratio test (LRT) is not very powerful. We describe a test that has the same size, α, as the LRT and is uniformly more powerful. The test is easily implemented, since its critical values are standard normal percentiles. The increase in power with the new test can be substantial. For example, the new test{\textquoteright}s power is 1/2α times bigger (10 times bigger for α = .05) than the LRT{\textquoteright}s power for some parameter points in a simple example. Specifically, let X = (X1, …, Xp)′ (p ≥ 2) be a multivariate normal random vector with unknown mean μ = (μ1, …, μp)′ and known, nonsingular covariance matrix Σ. We consider testing the null hypothesis H0: B′i,μ ≤ 0 for some i = 1, …, k versus the alternative hypothesis H1: b′iμ > 0 for all i = 1, …, k. Here b1, …, bk (k ≥ 2) are specified p-dimensional vectors that define the hypotheses. Many types of relationships among the means may be described with the linear inequalities. Two interesting types are those that specify the signs of the means and those that describe an order relationship. Some examples of alternative hypotheses that can be specified in this way are these: (Equation presented) (sign testing), (Equation presented) (simple order), (Equation presented) (simple loop), and (Equation presented) (simple tree). If μi = v2i – v1i, where vji is the average response of the ith patient subset to the jth treatment, then (Equation presented) states that Treatment 2 is better than Treatment 1 for all subsets. If the μi are regression coefficients, then (Equation presented) states that the mean response increases with each independent variable. In any case, these relationships would be the alternative hypothesis. Rejection of H0 by a test with small size would be taken as strong evidence confirming that the specified sign or order relationship is true. Sasabuchi (1980) showed that the size-α LRT of H0 versus H1 is the test that rejects H0 if Zi = b′iX/(b′iΣbi)1/2 ≥ zα for all i = 1, …, k, where zα is the upper 100α percentile of a standard normal distribution. This test is biased and has very low power if all of the values b′iμ (i = 1, …, k) are only slightly bigger than 0. We define an integer J and constants c0, …, cJ that are certain standard normal percentiles. We show that, in many cases, a size-α test that is uniformly more powerful than the LRT is the test that rejects H0 if X ∈ R1 ∪ ··· ∪ RJ, where Rj = {x: cj ≤ zi ≤ cj–1, i = 1, …, k} and zi = b′ix/(b′iΣbi)1/2 is the LRT statistic. The set R1 is the rejection region of the LRT, so this test is obviously more powerful than the LRT. But we show that if, for each i = 1, …, k, there exists an m ≠ i such that b,′iΣbm ≤ 0, then this test is also a size-α test. It is easy to verify that this condition is satisfied, for example, for all of the aforementioned H1 hypotheses, except the simple tree, if Σ is diagonal. Tests that are even more powerful than those just described might exist. We discuss an example of such a test. But despite this test{\textquoteright}s superior power properties, it has some counterintuitive properties. Thus tests such as in this example may be primarily of theoretical interest. All of the previously mentioned results are derived in the Σ-known case. Sasabuchi (1980) showed that, if Σ is unknown, the LRT is very similar. The differences are that Σ is replaced by an estimate and zα is replaced by tα, a t-distribution percentile. We show, in an example, that making the same modifications to this test does not give a size-α test. But in the example the size of the test converges to α quickly as the degrees of freedom for the estimate of Σ becomes large. So even for moderate degrees of freedom (≥ 10), this test might be preferable to the LRT, since its size is approximately α and it is much more powerful than the LRT. A two-sided version of this problem is obtained if we test (Equation presented) versus (Equation presented), where H1 is a one-sided alternative as defined above. Sasabuchi (1980) showed that the LRT rejects (Equation presented) if Zi ≥ c for all i = 1, …, k or Zi ≤ – c for all i = 1, …, k, Sasabuchi gave some conditions under which c = zα gives a size-α test. We consider only the special case in which H1 is the sign-testing alternative and (Equation presented), a diagonal matrix. For constants c0, …, c2J, similar to those above, we show that the test that rejects (Equation presented) if X ∈ R1 ∪ ··· ∪ R2J, where Rj = {x: cj ≤ xi/σi ≤ cj–1, i = 1, …, p}, is a size-α test that is uniformly more powerful than the LRT. For the special case of p = 2, this provides a test that is uniformly more powerful than a test proposed by Gail and Simon (1985) for testing for a qualitative interaction.",

keywords = "Likelihood ratio test, Majorization, Polyhedral cone, Qualitative interaction",

author = "Berger, {Roger L.}",

year = "1989",

month = mar,

doi = "10.1080/01621459.1989.10478755",

language = "English (US)",

volume = "84",

pages = "192--199",

journal = "Journal of the American Statistical Association",

issn = "0162-1459",

publisher = "Taylor and Francis Ltd.",

number = "405",

}

TY - JOUR

T1 - Uniformly more powerful tests for hypotheses concerning linear inequalities and normal means

AU - Berger, Roger L.

PY - 1989/3

Y1 - 1989/3

N2 - This article considers some hypothesis-testing problems regarding normal means. In these problems, the hypotheses are defined by linear inequalities on the means. We show that in certain problems the likelihood ratio test (LRT) is not very powerful. We describe a test that has the same size, α, as the LRT and is uniformly more powerful. The test is easily implemented, since its critical values are standard normal percentiles. The increase in power with the new test can be substantial. For example, the new test’s power is 1/2α times bigger (10 times bigger for α = .05) than the LRT’s power for some parameter points in a simple example. Specifically, let X = (X1, …, Xp)′ (p ≥ 2) be a multivariate normal random vector with unknown mean μ = (μ1, …, μp)′ and known, nonsingular covariance matrix Σ. We consider testing the null hypothesis H0: B′i,μ ≤ 0 for some i = 1, …, k versus the alternative hypothesis H1: b′iμ > 0 for all i = 1, …, k. Here b1, …, bk (k ≥ 2) are specified p-dimensional vectors that define the hypotheses. Many types of relationships among the means may be described with the linear inequalities. Two interesting types are those that specify the signs of the means and those that describe an order relationship. Some examples of alternative hypotheses that can be specified in this way are these: (Equation presented) (sign testing), (Equation presented) (simple order), (Equation presented) (simple loop), and (Equation presented) (simple tree). If μi = v2i – v1i, where vji is the average response of the ith patient subset to the jth treatment, then (Equation presented) states that Treatment 2 is better than Treatment 1 for all subsets. If the μi are regression coefficients, then (Equation presented) states that the mean response increases with each independent variable. In any case, these relationships would be the alternative hypothesis. Rejection of H0 by a test with small size would be taken as strong evidence confirming that the specified sign or order relationship is true. Sasabuchi (1980) showed that the size-α LRT of H0 versus H1 is the test that rejects H0 if Zi = b′iX/(b′iΣbi)1/2 ≥ zα for all i = 1, …, k, where zα is the upper 100α percentile of a standard normal distribution. This test is biased and has very low power if all of the values b′iμ (i = 1, …, k) are only slightly bigger than 0. We define an integer J and constants c0, …, cJ that are certain standard normal percentiles. We show that, in many cases, a size-α test that is uniformly more powerful than the LRT is the test that rejects H0 if X ∈ R1 ∪ ··· ∪ RJ, where Rj = {x: cj ≤ zi ≤ cj–1, i = 1, …, k} and zi = b′ix/(b′iΣbi)1/2 is the LRT statistic. The set R1 is the rejection region of the LRT, so this test is obviously more powerful than the LRT. But we show that if, for each i = 1, …, k, there exists an m ≠ i such that b,′iΣbm ≤ 0, then this test is also a size-α test. It is easy to verify that this condition is satisfied, for example, for all of the aforementioned H1 hypotheses, except the simple tree, if Σ is diagonal. Tests that are even more powerful than those just described might exist. We discuss an example of such a test. But despite this test’s superior power properties, it has some counterintuitive properties. Thus tests such as in this example may be primarily of theoretical interest. All of the previously mentioned results are derived in the Σ-known case. Sasabuchi (1980) showed that, if Σ is unknown, the LRT is very similar. The differences are that Σ is replaced by an estimate and zα is replaced by tα, a t-distribution percentile. We show, in an example, that making the same modifications to this test does not give a size-α test. But in the example the size of the test converges to α quickly as the degrees of freedom for the estimate of Σ becomes large. So even for moderate degrees of freedom (≥ 10), this test might be preferable to the LRT, since its size is approximately α and it is much more powerful than the LRT. A two-sided version of this problem is obtained if we test (Equation presented) versus (Equation presented), where H1 is a one-sided alternative as defined above. Sasabuchi (1980) showed that the LRT rejects (Equation presented) if Zi ≥ c for all i = 1, …, k or Zi ≤ – c for all i = 1, …, k, Sasabuchi gave some conditions under which c = zα gives a size-α test. We consider only the special case in which H1 is the sign-testing alternative and (Equation presented), a diagonal matrix. For constants c0, …, c2J, similar to those above, we show that the test that rejects (Equation presented) if X ∈ R1 ∪ ··· ∪ R2J, where Rj = {x: cj ≤ xi/σi ≤ cj–1, i = 1, …, p}, is a size-α test that is uniformly more powerful than the LRT. For the special case of p = 2, this provides a test that is uniformly more powerful than a test proposed by Gail and Simon (1985) for testing for a qualitative interaction.

AB - This article considers some hypothesis-testing problems regarding normal means. In these problems, the hypotheses are defined by linear inequalities on the means. We show that in certain problems the likelihood ratio test (LRT) is not very powerful. We describe a test that has the same size, α, as the LRT and is uniformly more powerful. The test is easily implemented, since its critical values are standard normal percentiles. The increase in power with the new test can be substantial. For example, the new test’s power is 1/2α times bigger (10 times bigger for α = .05) than the LRT’s power for some parameter points in a simple example. Specifically, let X = (X1, …, Xp)′ (p ≥ 2) be a multivariate normal random vector with unknown mean μ = (μ1, …, μp)′ and known, nonsingular covariance matrix Σ. We consider testing the null hypothesis H0: B′i,μ ≤ 0 for some i = 1, …, k versus the alternative hypothesis H1: b′iμ > 0 for all i = 1, …, k. Here b1, …, bk (k ≥ 2) are specified p-dimensional vectors that define the hypotheses. Many types of relationships among the means may be described with the linear inequalities. Two interesting types are those that specify the signs of the means and those that describe an order relationship. Some examples of alternative hypotheses that can be specified in this way are these: (Equation presented) (sign testing), (Equation presented) (simple order), (Equation presented) (simple loop), and (Equation presented) (simple tree). If μi = v2i – v1i, where vji is the average response of the ith patient subset to the jth treatment, then (Equation presented) states that Treatment 2 is better than Treatment 1 for all subsets. If the μi are regression coefficients, then (Equation presented) states that the mean response increases with each independent variable. In any case, these relationships would be the alternative hypothesis. Rejection of H0 by a test with small size would be taken as strong evidence confirming that the specified sign or order relationship is true. Sasabuchi (1980) showed that the size-α LRT of H0 versus H1 is the test that rejects H0 if Zi = b′iX/(b′iΣbi)1/2 ≥ zα for all i = 1, …, k, where zα is the upper 100α percentile of a standard normal distribution. This test is biased and has very low power if all of the values b′iμ (i = 1, …, k) are only slightly bigger than 0. We define an integer J and constants c0, …, cJ that are certain standard normal percentiles. We show that, in many cases, a size-α test that is uniformly more powerful than the LRT is the test that rejects H0 if X ∈ R1 ∪ ··· ∪ RJ, where Rj = {x: cj ≤ zi ≤ cj–1, i = 1, …, k} and zi = b′ix/(b′iΣbi)1/2 is the LRT statistic. The set R1 is the rejection region of the LRT, so this test is obviously more powerful than the LRT. But we show that if, for each i = 1, …, k, there exists an m ≠ i such that b,′iΣbm ≤ 0, then this test is also a size-α test. It is easy to verify that this condition is satisfied, for example, for all of the aforementioned H1 hypotheses, except the simple tree, if Σ is diagonal. Tests that are even more powerful than those just described might exist. We discuss an example of such a test. But despite this test’s superior power properties, it has some counterintuitive properties. Thus tests such as in this example may be primarily of theoretical interest. All of the previously mentioned results are derived in the Σ-known case. Sasabuchi (1980) showed that, if Σ is unknown, the LRT is very similar. The differences are that Σ is replaced by an estimate and zα is replaced by tα, a t-distribution percentile. We show, in an example, that making the same modifications to this test does not give a size-α test. But in the example the size of the test converges to α quickly as the degrees of freedom for the estimate of Σ becomes large. So even for moderate degrees of freedom (≥ 10), this test might be preferable to the LRT, since its size is approximately α and it is much more powerful than the LRT. A two-sided version of this problem is obtained if we test (Equation presented) versus (Equation presented), where H1 is a one-sided alternative as defined above. Sasabuchi (1980) showed that the LRT rejects (Equation presented) if Zi ≥ c for all i = 1, …, k or Zi ≤ – c for all i = 1, …, k, Sasabuchi gave some conditions under which c = zα gives a size-α test. We consider only the special case in which H1 is the sign-testing alternative and (Equation presented), a diagonal matrix. For constants c0, …, c2J, similar to those above, we show that the test that rejects (Equation presented) if X ∈ R1 ∪ ··· ∪ R2J, where Rj = {x: cj ≤ xi/σi ≤ cj–1, i = 1, …, p}, is a size-α test that is uniformly more powerful than the LRT. For the special case of p = 2, this provides a test that is uniformly more powerful than a test proposed by Gail and Simon (1985) for testing for a qualitative interaction.

KW - Likelihood ratio test

KW - Majorization

KW - Polyhedral cone

KW - Qualitative interaction

UR - http://www.scopus.com/inward/record.url?scp=0001474912&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0001474912&partnerID=8YFLogxK

U2 - 10.1080/01621459.1989.10478755

DO - 10.1080/01621459.1989.10478755

M3 - Article

AN - SCOPUS:0001474912

SN - 0162-1459

VL - 84

SP - 192

EP - 199

JO - Journal of the American Statistical Association

JF - Journal of the American Statistical Association

IS - 405

ER -

Uniformly more powerful tests for hypotheses concerning linear inequalities and normal means

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this