Uniformly more powerful tests for hypotheses concerning linear inequalities and normal means

Roger L. Berger

Research output: Contribution to journalArticlepeer-review

46 Scopus citations


This article considers some hypothesis-testing problems regarding normal means. In these problems, the hypotheses are defined by linear inequalities on the means. We show that in certain problems the likelihood ratio test (LRT) is not very powerful. We describe a test that has the same size, α, as the LRT and is uniformly more powerful. The test is easily implemented, since its critical values are standard normal percentiles. The increase in power with the new test can be substantial. For example, the new test’s power is 1/2α times bigger (10 times bigger for α = .05) than the LRT’s power for some parameter points in a simple example. Specifically, let X = (X1, …, Xp)′ (p ≥ 2) be a multivariate normal random vector with unknown mean μ = (μ1, …, μp)′ and known, nonsingular covariance matrix Σ. We consider testing the null hypothesis H0: B′i,μ ≤ 0 for some i = 1, …, k versus the alternative hypothesis H1: b′iμ > 0 for all i = 1, …, k. Here b1, …, bk (k ≥ 2) are specified p-dimensional vectors that define the hypotheses. Many types of relationships among the means may be described with the linear inequalities. Two interesting types are those that specify the signs of the means and those that describe an order relationship. Some examples of alternative hypotheses that can be specified in this way are these: (Equation presented) (sign testing), (Equation presented) (simple order), (Equation presented) (simple loop), and (Equation presented) (simple tree). If μi = v2i – v1i, where vji is the average response of the ith patient subset to the jth treatment, then (Equation presented) states that Treatment 2 is better than Treatment 1 for all subsets. If the μi are regression coefficients, then (Equation presented) states that the mean response increases with each independent variable. In any case, these relationships would be the alternative hypothesis. Rejection of H0 by a test with small size would be taken as strong evidence confirming that the specified sign or order relationship is true. Sasabuchi (1980) showed that the size-α LRT of H0 versus H1 is the test that rejects H0 if Zi = b′iX/(b′iΣbi)1/2 ≥ zα for all i = 1, …, k, where zα is the upper 100α percentile of a standard normal distribution. This test is biased and has very low power if all of the values b′iμ (i = 1, …, k) are only slightly bigger than 0. We define an integer J and constants c0, …, cJ that are certain standard normal percentiles. We show that, in many cases, a size-α test that is uniformly more powerful than the LRT is the test that rejects H0 if X ∈ R1 ∪ ··· ∪ RJ, where Rj = {x: cj ≤ zi ≤ cj–1, i = 1, …, k} and zi = b′ix/(b′iΣbi)1/2 is the LRT statistic. The set R1 is the rejection region of the LRT, so this test is obviously more powerful than the LRT. But we show that if, for each i = 1, …, k, there exists an m ≠ i such that b,′iΣbm ≤ 0, then this test is also a size-α test. It is easy to verify that this condition is satisfied, for example, for all of the aforementioned H1 hypotheses, except the simple tree, if Σ is diagonal. Tests that are even more powerful than those just described might exist. We discuss an example of such a test. But despite this test’s superior power properties, it has some counterintuitive properties. Thus tests such as in this example may be primarily of theoretical interest. All of the previously mentioned results are derived in the Σ-known case. Sasabuchi (1980) showed that, if Σ is unknown, the LRT is very similar. The differences are that Σ is replaced by an estimate and zα is replaced by tα, a t-distribution percentile. We show, in an example, that making the same modifications to this test does not give a size-α test. But in the example the size of the test converges to α quickly as the degrees of freedom for the estimate of Σ becomes large. So even for moderate degrees of freedom (≥ 10), this test might be preferable to the LRT, since its size is approximately α and it is much more powerful than the LRT. A two-sided version of this problem is obtained if we test (Equation presented) versus (Equation presented), where H1 is a one-sided alternative as defined above. Sasabuchi (1980) showed that the LRT rejects (Equation presented) if Zi ≥ c for all i = 1, …, k or Zi ≤ – c for all i = 1, …, k, Sasabuchi gave some conditions under which c = zα gives a size-α test. We consider only the special case in which H1 is the sign-testing alternative and (Equation presented), a diagonal matrix. For constants c0, …, c2J, similar to those above, we show that the test that rejects (Equation presented) if X ∈ R1 ∪ ··· ∪ R2J, where Rj = {x: cj ≤ xii ≤ cj–1, i = 1, …, p}, is a size-α test that is uniformly more powerful than the LRT. For the special case of p = 2, this provides a test that is uniformly more powerful than a test proposed by Gail and Simon (1985) for testing for a qualitative interaction.

Original languageEnglish (US)
Pages (from-to)192-199
Number of pages8
JournalJournal of the American Statistical Association
Issue number405
StatePublished - Mar 1989


  • Likelihood ratio test
  • Majorization
  • Polyhedral cone
  • Qualitative interaction

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Fingerprint Dive into the research topics of 'Uniformly more powerful tests for hypotheses concerning linear inequalities and normal means'. Together they form a unique fingerprint.

Cite this