Understanding the effects of sampling on healthcare risk modeling for the prediction of future high-cost patients

Sai T. Moturu; Huan Liu; William Johnson

doi:10.1007/978-3-540-92219-3_37

Understanding the effects of sampling on healthcare risk modeling for the prediction of future high-cost patients

Sai T. Moturu, Huan Liu, William Johnson

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

2 Scopus citations

Abstract

Rapidly rising healthcare costs represent one of the major issues plaguing the healthcare system. Data from the Arizona Health Care Cost Containment System, Arizona's Medicaid program provide a unique opportunity to exploit state-of-the-art machine learning and data mining algorithms to analyze data and provide actionable findings that can aid cost containment. Our work addresses specific challenges in this real-life healthcare application with respect to data imbalance in the process of building predictive risk models for forecasting high-cost patients. We survey the literature and propose novel data mining approaches customized for this compelling application with specific focus on non-random sampling. Our empirical study indicates that the proposed approach is highly effective and can benefit further research on cost containment in the healthcare industry.

Original language	English (US)
Title of host publication	Biomedical Engineering Systems and Technologies - International Joint Conference, BIOSTEC 2008, Revised Selected Papers
Pages	493-506
Number of pages	14
DOIs	https://doi.org/10.1007/978-3-540-92219-3_37
State	Published - 2008
Event	1st International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2008 - Funchal, Madeira, Portugal Duration: Jan 28 2008 → Jan 31 2008

Publication series

Name	Communications in Computer and Information Science
Volume	25 CCIS
ISSN (Print)	1865-0929

Other

Other	1st International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2008
Country/Territory	Portugal
City	Funchal, Madeira
Period	1/28/08 → 1/31/08

Keywords

Medicaid
Predictive risk modeling
data mining
future high-cost patients
health care expenditures
imbalanced data classification
non-random sampling
risk adjustment
skewed data

ASJC Scopus subject areas

General Computer Science
General Mathematics

Access to Document

10.1007/978-3-540-92219-3_37

Cite this

Moturu, S. T., Liu, H., & Johnson, W. (2008). Understanding the effects of sampling on healthcare risk modeling for the prediction of future high-cost patients. In Biomedical Engineering Systems and Technologies - International Joint Conference, BIOSTEC 2008, Revised Selected Papers (pp. 493-506). (Communications in Computer and Information Science; Vol. 25 CCIS). https://doi.org/10.1007/978-3-540-92219-3_37

Understanding the effects of sampling on healthcare risk modeling for the prediction of future high-cost patients. / Moturu, Sai T.; Liu, Huan; Johnson, William.
Biomedical Engineering Systems and Technologies - International Joint Conference, BIOSTEC 2008, Revised Selected Papers. 2008. p. 493-506 (Communications in Computer and Information Science; Vol. 25 CCIS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Moturu, ST, Liu, H & Johnson, W 2008, Understanding the effects of sampling on healthcare risk modeling for the prediction of future high-cost patients. in Biomedical Engineering Systems and Technologies - International Joint Conference, BIOSTEC 2008, Revised Selected Papers. Communications in Computer and Information Science, vol. 25 CCIS, pp. 493-506, 1st International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2008, Funchal, Madeira, Portugal, 1/28/08. https://doi.org/10.1007/978-3-540-92219-3_37

Moturu ST, Liu H, Johnson W. Understanding the effects of sampling on healthcare risk modeling for the prediction of future high-cost patients. In Biomedical Engineering Systems and Technologies - International Joint Conference, BIOSTEC 2008, Revised Selected Papers. 2008. p. 493-506. (Communications in Computer and Information Science). doi: 10.1007/978-3-540-92219-3_37

@inproceedings{5d2a0da4e4cb43cb83e5d040e5db0112,

title = "Understanding the effects of sampling on healthcare risk modeling for the prediction of future high-cost patients",

abstract = "Rapidly rising healthcare costs represent one of the major issues plaguing the healthcare system. Data from the Arizona Health Care Cost Containment System, Arizona's Medicaid program provide a unique opportunity to exploit state-of-the-art machine learning and data mining algorithms to analyze data and provide actionable findings that can aid cost containment. Our work addresses specific challenges in this real-life healthcare application with respect to data imbalance in the process of building predictive risk models for forecasting high-cost patients. We survey the literature and propose novel data mining approaches customized for this compelling application with specific focus on non-random sampling. Our empirical study indicates that the proposed approach is highly effective and can benefit further research on cost containment in the healthcare industry.",

keywords = "Medicaid, Predictive risk modeling, data mining, future high-cost patients, health care expenditures, imbalanced data classification, non-random sampling, risk adjustment, skewed data",

author = "Moturu, {Sai T.} and Huan Liu and William Johnson",

year = "2008",

doi = "10.1007/978-3-540-92219-3_37",

language = "English (US)",

isbn = "3540922180",

series = "Communications in Computer and Information Science",

pages = "493--506",

booktitle = "Biomedical Engineering Systems and Technologies - International Joint Conference, BIOSTEC 2008, Revised Selected Papers",

note = "1st International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2008 ; Conference date: 28-01-2008 Through 31-01-2008",

}

TY - GEN

T1 - Understanding the effects of sampling on healthcare risk modeling for the prediction of future high-cost patients

AU - Moturu, Sai T.

AU - Liu, Huan

AU - Johnson, William

PY - 2008

Y1 - 2008

N2 - Rapidly rising healthcare costs represent one of the major issues plaguing the healthcare system. Data from the Arizona Health Care Cost Containment System, Arizona's Medicaid program provide a unique opportunity to exploit state-of-the-art machine learning and data mining algorithms to analyze data and provide actionable findings that can aid cost containment. Our work addresses specific challenges in this real-life healthcare application with respect to data imbalance in the process of building predictive risk models for forecasting high-cost patients. We survey the literature and propose novel data mining approaches customized for this compelling application with specific focus on non-random sampling. Our empirical study indicates that the proposed approach is highly effective and can benefit further research on cost containment in the healthcare industry.

AB - Rapidly rising healthcare costs represent one of the major issues plaguing the healthcare system. Data from the Arizona Health Care Cost Containment System, Arizona's Medicaid program provide a unique opportunity to exploit state-of-the-art machine learning and data mining algorithms to analyze data and provide actionable findings that can aid cost containment. Our work addresses specific challenges in this real-life healthcare application with respect to data imbalance in the process of building predictive risk models for forecasting high-cost patients. We survey the literature and propose novel data mining approaches customized for this compelling application with specific focus on non-random sampling. Our empirical study indicates that the proposed approach is highly effective and can benefit further research on cost containment in the healthcare industry.

KW - Medicaid

KW - Predictive risk modeling

KW - data mining

KW - future high-cost patients

KW - health care expenditures

KW - imbalanced data classification

KW - non-random sampling

KW - risk adjustment

KW - skewed data

UR - http://www.scopus.com/inward/record.url?scp=78049402991&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78049402991&partnerID=8YFLogxK

U2 - 10.1007/978-3-540-92219-3_37

DO - 10.1007/978-3-540-92219-3_37

M3 - Conference contribution

AN - SCOPUS:78049402991

SN - 3540922180

SN - 9783540922186

T3 - Communications in Computer and Information Science

SP - 493

EP - 506

BT - Biomedical Engineering Systems and Technologies - International Joint Conference, BIOSTEC 2008, Revised Selected Papers

T2 - 1st International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2008

Y2 - 28 January 2008 through 31 January 2008

ER -

Understanding the effects of sampling on healthcare risk modeling for the prediction of future high-cost patients

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this