A multivariate procedure for identifying correlations between diagnoses and over-the-counter products from historical datasets.

Ran Li, Garrick L. Wallstrom, William R. Hogan

Research output: Contribution to journalArticle

5 Citations (Scopus)

Abstract

A general problem in biosurveillance is finding the optimal aggregates of more basic data to monitor for the detection of disease outbreaks. We developed a multivariate procedure for identifying the set of over-the-counter (OTC) healthcare products that correlates best with a set of diagnoses. To ensure that the procedure produces results that agree with clinical knowledge of diseases and (OTC) products, we applied it to a set of products and set of diagnoses for which the correlation was known to be high. Our hypothesis was that the model could achieve parsimony in the set of diagnoses that correlate with sales of pediatric electrolytes while still producing a high correlation. The procedure narrowed the set of diagnoses that correlate with pediatric electrolytes from 51 diagnoses to eight diagnoses. The correlation of the set of 51 diagnoses with electrolyte sales was 0.95 and the correlation of the set of 8 diagnoses with electrolytes was 0.96. We conclude that the procedure functions as intended and is suitable for further testing with other problems in finding optimal aggregates of OTC products, and more generally of other types of biosurveillance data, to monitor for the detection of various disease outbreaks.

Original languageEnglish (US)
Pages (from-to)450-454
Number of pages5
JournalAMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium
StatePublished - 2005
Externally publishedYes

Fingerprint

Electrolytes
Biosurveillance
Disease Outbreaks
Pediatrics
Datasets
Delivery of Health Care

ASJC Scopus subject areas

  • Medicine(all)

Cite this

@article{955612f4e3e64fd690e30100c58753b5,
title = "A multivariate procedure for identifying correlations between diagnoses and over-the-counter products from historical datasets.",
abstract = "A general problem in biosurveillance is finding the optimal aggregates of more basic data to monitor for the detection of disease outbreaks. We developed a multivariate procedure for identifying the set of over-the-counter (OTC) healthcare products that correlates best with a set of diagnoses. To ensure that the procedure produces results that agree with clinical knowledge of diseases and (OTC) products, we applied it to a set of products and set of diagnoses for which the correlation was known to be high. Our hypothesis was that the model could achieve parsimony in the set of diagnoses that correlate with sales of pediatric electrolytes while still producing a high correlation. The procedure narrowed the set of diagnoses that correlate with pediatric electrolytes from 51 diagnoses to eight diagnoses. The correlation of the set of 51 diagnoses with electrolyte sales was 0.95 and the correlation of the set of 8 diagnoses with electrolytes was 0.96. We conclude that the procedure functions as intended and is suitable for further testing with other problems in finding optimal aggregates of OTC products, and more generally of other types of biosurveillance data, to monitor for the detection of various disease outbreaks.",
author = "Ran Li and Wallstrom, {Garrick L.} and Hogan, {William R.}",
year = "2005",
language = "English (US)",
pages = "450--454",
journal = "AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium",
issn = "1559-4076",
publisher = "American Medical Informatics Association",

}

TY - JOUR

T1 - A multivariate procedure for identifying correlations between diagnoses and over-the-counter products from historical datasets.

AU - Li, Ran

AU - Wallstrom, Garrick L.

AU - Hogan, William R.

PY - 2005

Y1 - 2005

N2 - A general problem in biosurveillance is finding the optimal aggregates of more basic data to monitor for the detection of disease outbreaks. We developed a multivariate procedure for identifying the set of over-the-counter (OTC) healthcare products that correlates best with a set of diagnoses. To ensure that the procedure produces results that agree with clinical knowledge of diseases and (OTC) products, we applied it to a set of products and set of diagnoses for which the correlation was known to be high. Our hypothesis was that the model could achieve parsimony in the set of diagnoses that correlate with sales of pediatric electrolytes while still producing a high correlation. The procedure narrowed the set of diagnoses that correlate with pediatric electrolytes from 51 diagnoses to eight diagnoses. The correlation of the set of 51 diagnoses with electrolyte sales was 0.95 and the correlation of the set of 8 diagnoses with electrolytes was 0.96. We conclude that the procedure functions as intended and is suitable for further testing with other problems in finding optimal aggregates of OTC products, and more generally of other types of biosurveillance data, to monitor for the detection of various disease outbreaks.

AB - A general problem in biosurveillance is finding the optimal aggregates of more basic data to monitor for the detection of disease outbreaks. We developed a multivariate procedure for identifying the set of over-the-counter (OTC) healthcare products that correlates best with a set of diagnoses. To ensure that the procedure produces results that agree with clinical knowledge of diseases and (OTC) products, we applied it to a set of products and set of diagnoses for which the correlation was known to be high. Our hypothesis was that the model could achieve parsimony in the set of diagnoses that correlate with sales of pediatric electrolytes while still producing a high correlation. The procedure narrowed the set of diagnoses that correlate with pediatric electrolytes from 51 diagnoses to eight diagnoses. The correlation of the set of 51 diagnoses with electrolyte sales was 0.95 and the correlation of the set of 8 diagnoses with electrolytes was 0.96. We conclude that the procedure functions as intended and is suitable for further testing with other problems in finding optimal aggregates of OTC products, and more generally of other types of biosurveillance data, to monitor for the detection of various disease outbreaks.

UR - http://www.scopus.com/inward/record.url?scp=39049175698&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=39049175698&partnerID=8YFLogxK

M3 - Article

C2 - 16779080

AN - SCOPUS:39049175698

SP - 450

EP - 454

JO - AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium

JF - AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium

SN - 1559-4076

ER -