## Abstract

This article considers some hypothesis-testing problems regarding normal means. In these problems, the hypotheses are defined by linear inequalities on the means. We show that in certain problems the likelihood ratio test (LRT) is not very powerful. We describe a test that has the same size, α, as the LRT and is uniformly more powerful. The test is easily implemented, since its critical values are standard normal percentiles. The increase in power with the new test can be substantial. For example, the new test’s power is 1/2α times bigger (10 times bigger for α = .05) than the LRT’s power for some parameter points in a simple example. Specifically, let X = (X_{1}, …, X_{p})′ (p ≥ 2) be a multivariate normal random vector with unknown mean μ = (μ_{1}, …, μ_{p})′ and known, nonsingular covariance matrix Σ. We consider testing the null hypothesis H_{0}: B′_{i},μ ≤ 0 for some i = 1, …, k versus the alternative hypothesis H_{1}: b′_{i}μ > 0 for all i = 1, …, k. Here b_{1}, …, b_{k} (k ≥ 2) are specified p-dimensional vectors that define the hypotheses. Many types of relationships among the means may be described with the linear inequalities. Two interesting types are those that specify the signs of the means and those that describe an order relationship. Some examples of alternative hypotheses that can be specified in this way are these: (Equation presented) (sign testing), (Equation presented) (simple order), (Equation presented) (simple loop), and (Equation presented) (simple tree). If μ_{i} = v_{2i} – v_{1i}, where v_{ji} is the average response of the ith patient subset to the jth treatment, then (Equation presented) states that Treatment 2 is better than Treatment 1 for all subsets. If the μ_{i} are regression coefficients, then (Equation presented) states that the mean response increases with each independent variable. In any case, these relationships would be the alternative hypothesis. Rejection of H_{0} by a test with small size would be taken as strong evidence confirming that the specified sign or order relationship is true. Sasabuchi (1980) showed that the size-α LRT of H_{0} versus H_{1} is the test that rejects H_{0} if Z_{i} = b′_{i}X/(b′_{i}Σb_{i})^{1/2} ≥ z_{α} for all i = 1, …, k, where z_{α} is the upper 100α percentile of a standard normal distribution. This test is biased and has very low power if all of the values b′_{i}μ (i = 1, …, k) are only slightly bigger than 0. We define an integer J and constants c_{0}, …, c_{J} that are certain standard normal percentiles. We show that, in many cases, a size-α test that is uniformly more powerful than the LRT is the test that rejects H_{0} if X ∈ R_{1} ∪ ··· ∪ R_{J}, where R_{j} = {x: c_{j} ≤ z_{i} ≤ c_{j}_{–1}, i = 1, …, k} and z_{i} = b′_{i}x/(b′_{i}Σb_{i})^{1/2} is the LRT statistic. The set R_{1} is the rejection region of the LRT, so this test is obviously more powerful than the LRT. But we show that if, for each i = 1, …, k, there exists an m ≠ i such that b,′_{i}Σb_{m} ≤ 0, then this test is also a size-α test. It is easy to verify that this condition is satisfied, for example, for all of the aforementioned H_{1} hypotheses, except the simple tree, if Σ is diagonal. Tests that are even more powerful than those just described might exist. We discuss an example of such a test. But despite this test’s superior power properties, it has some counterintuitive properties. Thus tests such as in this example may be primarily of theoretical interest. All of the previously mentioned results are derived in the Σ-known case. Sasabuchi (1980) showed that, if Σ is unknown, the LRT is very similar. The differences are that Σ is replaced by an estimate and z_{α} is replaced by t_{α}, a t-distribution percentile. We show, in an example, that making the same modifications to this test does not give a size-α test. But in the example the size of the test converges to α quickly as the degrees of freedom for the estimate of Σ becomes large. So even for moderate degrees of freedom (≥ 10), this test might be preferable to the LRT, since its size is approximately α and it is much more powerful than the LRT. A two-sided version of this problem is obtained if we test (Equation presented) versus (Equation presented), where H_{1} is a one-sided alternative as defined above. Sasabuchi (1980) showed that the LRT rejects (Equation presented) if Z_{i} ≥ c for all i = 1, …, k or Z_{i} ≤ – c for all i = 1, …, k, Sasabuchi gave some conditions under which c = z_{α} gives a size-α test. We consider only the special case in which H_{1} is the sign-testing alternative and (Equation presented), a diagonal matrix. For constants c_{0}, …, c_{2J}, similar to those above, we show that the test that rejects (Equation presented) if X ∈ R_{1} ∪ ··· ∪ R_{2J}, where R_{j} = {x: c_{j} ≤ x_{i}/σ_{i} ≤ c_{j}_{–1}, i = 1, …, p}, is a size-α test that is uniformly more powerful than the LRT. For the special case of p = 2, this provides a test that is uniformly more powerful than a test proposed by Gail and Simon (1985) for testing for a qualitative interaction.

Original language | English (US) |
---|---|

Pages (from-to) | 192-199 |

Number of pages | 8 |

Journal | Journal of the American Statistical Association |

Volume | 84 |

Issue number | 405 |

DOIs | |

State | Published - Mar 1989 |

## Keywords

- Likelihood ratio test
- Majorization
- Polyhedral cone
- Qualitative interaction

## ASJC Scopus subject areas

- Statistics and Probability
- Statistics, Probability and Uncertainty