The intermediate endpoint effect in logistic and probit regression

David Mackinnon, C. M. Lockwood, C. H. Brown, W. Wang, J. M. Hoffman

Research output: Contribution to journalArticle

130 Citations (Scopus)

Abstract

Background: An intermediate endpoint is hypothesized to be in the middle of the causal sequence relating an independent variable to a dependent variable. The intermediate variable is also called a surrogate or mediating variable and the corresponding effect is called the mediated, surrogate endpoint, or intermediate endpoint effect. Clinical studies are often designed to change an intermediate or surrogate endpoint and through this intermediate change influence the ultimate endpoint. In many intermediate endpoint clinical studies the dependent variable is binary, and logistic or probit regression is used. Purpose: The purpose of this study is to describe a limitation of a widely used approach to assessing intermediate endpoint effects and to propose an alternative method, based on products of coefficients, that yields more accurate results. Methods: The intermediate endpoint model for a binary outcome is described for a true binary outcome and for a dichotomization of a latent continuous outcome. Plots of true values and a simulation study are used to evaluate the different methods. Results: Distorted estimates of the intermediate endpoint effect and incorrect conclusions can result from the application of widely used methods to assess the intermediate endpoint effect. The same problem occurs for the proportion of an effect explained by an intermediate endpoint, which has been suggested as a useful measure for identifying intermediate endpoints. A solution to this problem is given based on the relationship between latent variable modeling and logistic or probit regression. Limitations: More complicated intermediate variable models are not addressed in the study, although the methods described in the article can be extended to these more complicated models. Conclusions: Researchers are encouraged to use an intermediate endpoint method based on the product of regression coefficients. A common method based on difference in coefficient methods can lead to distorted conclusions regarding the intermediate effect.

Original languageEnglish (US)
Pages (from-to)499-513
Number of pages15
JournalClinical Trials
Volume4
Issue number5
DOIs
StatePublished - 2007

Fingerprint

Logistic Models
Biomarkers
Research Personnel

ASJC Scopus subject areas

  • Medicine(all)

Cite this

The intermediate endpoint effect in logistic and probit regression. / Mackinnon, David; Lockwood, C. M.; Brown, C. H.; Wang, W.; Hoffman, J. M.

In: Clinical Trials, Vol. 4, No. 5, 2007, p. 499-513.

Research output: Contribution to journalArticle

Mackinnon, D, Lockwood, CM, Brown, CH, Wang, W & Hoffman, JM 2007, 'The intermediate endpoint effect in logistic and probit regression', Clinical Trials, vol. 4, no. 5, pp. 499-513. https://doi.org/10.1177/1740774507083434
Mackinnon, David ; Lockwood, C. M. ; Brown, C. H. ; Wang, W. ; Hoffman, J. M. / The intermediate endpoint effect in logistic and probit regression. In: Clinical Trials. 2007 ; Vol. 4, No. 5. pp. 499-513.
@article{a2d0b2d51fe24ba4a754b73379480e1c,
title = "The intermediate endpoint effect in logistic and probit regression",
abstract = "Background: An intermediate endpoint is hypothesized to be in the middle of the causal sequence relating an independent variable to a dependent variable. The intermediate variable is also called a surrogate or mediating variable and the corresponding effect is called the mediated, surrogate endpoint, or intermediate endpoint effect. Clinical studies are often designed to change an intermediate or surrogate endpoint and through this intermediate change influence the ultimate endpoint. In many intermediate endpoint clinical studies the dependent variable is binary, and logistic or probit regression is used. Purpose: The purpose of this study is to describe a limitation of a widely used approach to assessing intermediate endpoint effects and to propose an alternative method, based on products of coefficients, that yields more accurate results. Methods: The intermediate endpoint model for a binary outcome is described for a true binary outcome and for a dichotomization of a latent continuous outcome. Plots of true values and a simulation study are used to evaluate the different methods. Results: Distorted estimates of the intermediate endpoint effect and incorrect conclusions can result from the application of widely used methods to assess the intermediate endpoint effect. The same problem occurs for the proportion of an effect explained by an intermediate endpoint, which has been suggested as a useful measure for identifying intermediate endpoints. A solution to this problem is given based on the relationship between latent variable modeling and logistic or probit regression. Limitations: More complicated intermediate variable models are not addressed in the study, although the methods described in the article can be extended to these more complicated models. Conclusions: Researchers are encouraged to use an intermediate endpoint method based on the product of regression coefficients. A common method based on difference in coefficient methods can lead to distorted conclusions regarding the intermediate effect.",
author = "David Mackinnon and Lockwood, {C. M.} and Brown, {C. H.} and W. Wang and Hoffman, {J. M.}",
year = "2007",
doi = "10.1177/1740774507083434",
language = "English (US)",
volume = "4",
pages = "499--513",
journal = "Clinical Trials",
issn = "1740-7745",
publisher = "SAGE Publications Ltd",
number = "5",

}

TY - JOUR

T1 - The intermediate endpoint effect in logistic and probit regression

AU - Mackinnon, David

AU - Lockwood, C. M.

AU - Brown, C. H.

AU - Wang, W.

AU - Hoffman, J. M.

PY - 2007

Y1 - 2007

N2 - Background: An intermediate endpoint is hypothesized to be in the middle of the causal sequence relating an independent variable to a dependent variable. The intermediate variable is also called a surrogate or mediating variable and the corresponding effect is called the mediated, surrogate endpoint, or intermediate endpoint effect. Clinical studies are often designed to change an intermediate or surrogate endpoint and through this intermediate change influence the ultimate endpoint. In many intermediate endpoint clinical studies the dependent variable is binary, and logistic or probit regression is used. Purpose: The purpose of this study is to describe a limitation of a widely used approach to assessing intermediate endpoint effects and to propose an alternative method, based on products of coefficients, that yields more accurate results. Methods: The intermediate endpoint model for a binary outcome is described for a true binary outcome and for a dichotomization of a latent continuous outcome. Plots of true values and a simulation study are used to evaluate the different methods. Results: Distorted estimates of the intermediate endpoint effect and incorrect conclusions can result from the application of widely used methods to assess the intermediate endpoint effect. The same problem occurs for the proportion of an effect explained by an intermediate endpoint, which has been suggested as a useful measure for identifying intermediate endpoints. A solution to this problem is given based on the relationship between latent variable modeling and logistic or probit regression. Limitations: More complicated intermediate variable models are not addressed in the study, although the methods described in the article can be extended to these more complicated models. Conclusions: Researchers are encouraged to use an intermediate endpoint method based on the product of regression coefficients. A common method based on difference in coefficient methods can lead to distorted conclusions regarding the intermediate effect.

AB - Background: An intermediate endpoint is hypothesized to be in the middle of the causal sequence relating an independent variable to a dependent variable. The intermediate variable is also called a surrogate or mediating variable and the corresponding effect is called the mediated, surrogate endpoint, or intermediate endpoint effect. Clinical studies are often designed to change an intermediate or surrogate endpoint and through this intermediate change influence the ultimate endpoint. In many intermediate endpoint clinical studies the dependent variable is binary, and logistic or probit regression is used. Purpose: The purpose of this study is to describe a limitation of a widely used approach to assessing intermediate endpoint effects and to propose an alternative method, based on products of coefficients, that yields more accurate results. Methods: The intermediate endpoint model for a binary outcome is described for a true binary outcome and for a dichotomization of a latent continuous outcome. Plots of true values and a simulation study are used to evaluate the different methods. Results: Distorted estimates of the intermediate endpoint effect and incorrect conclusions can result from the application of widely used methods to assess the intermediate endpoint effect. The same problem occurs for the proportion of an effect explained by an intermediate endpoint, which has been suggested as a useful measure for identifying intermediate endpoints. A solution to this problem is given based on the relationship between latent variable modeling and logistic or probit regression. Limitations: More complicated intermediate variable models are not addressed in the study, although the methods described in the article can be extended to these more complicated models. Conclusions: Researchers are encouraged to use an intermediate endpoint method based on the product of regression coefficients. A common method based on difference in coefficient methods can lead to distorted conclusions regarding the intermediate effect.

UR - http://www.scopus.com/inward/record.url?scp=36549038047&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=36549038047&partnerID=8YFLogxK

U2 - 10.1177/1740774507083434

DO - 10.1177/1740774507083434

M3 - Article

C2 - 17942466

AN - SCOPUS:36549038047

VL - 4

SP - 499

EP - 513

JO - Clinical Trials

JF - Clinical Trials

SN - 1740-7745

IS - 5

ER -