Balancing Performance Measures in Classification Using Ensemble Learning Methods

Neeraj Bahl; Ajay Bansal

doi:10.1007/978-3-030-20482-2_25

Balancing Performance Measures in Classification Using Ensemble Learning Methods

Neeraj Bahl, Ajay Bansal

Software Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

1 Scopus citations

Abstract

Ensemble learning methods have recently been widely used in various domains and applications owing to the improvements in computational efficiency and distributed computing advances. However, with the advent of wide variety of applications of machine learning techniques to class imbalance problems, further focus is needed to evaluate, improve and balance other performance measures such as sensitivity (true positive rate) and specificity (true negative rate) in classification. This paper demonstrates an approach to evaluate and balance the performance measures (specifically sensitivity and specificity) using ensemble learning methods for classification that can be especially useful in class imbalanced datasets. In this paper, ensemble learning methods (specifically bagging and boosting) are used to balance the performance measures (sensitivity and specificity) on a diabetes dataset to predict if a patient will be readmitted to the hospital based on various feature vectors. From the experiments conducted, it can be empirically concluded that, by using ensemble learning methods, although accuracy does improve to some margin, both sensitivity and specificity are balanced significantly and consistently over different cross validation approaches.

Original language	English (US)
Title of host publication	Business Information Systems - 22nd International Conference, BIS 2019, Proceedings
Editors	Witold Abramowicz, Rafael Corchuelo
Publisher	Springer Verlag
Pages	311-324
Number of pages	14
ISBN (Print)	9783030204815
DOIs	https://doi.org/10.1007/978-3-030-20482-2_25
State	Published - Jan 1 2019
Event	22nd International Conference on Business Information Systems, BIS 2019 - Seville, Spain Duration: Jun 26 2019 → Jun 28 2019

Publication series

Name	Lecture Notes in Business Information Processing
Volume	354
ISSN (Print)	1865-1348

Conference

Conference	22nd International Conference on Business Information Systems, BIS 2019
Country/Territory	Spain
City	Seville
Period	6/26/19 → 6/28/19

Keywords

Balancing
Boosting
Classification
Ensemble methods

ASJC Scopus subject areas

Management Information Systems
Control and Systems Engineering
Business and International Management
Information Systems
Modeling and Simulation
Information Systems and Management

Access to Document

10.1007/978-3-030-20482-2_25

Cite this

Bahl, N., & Bansal, A. (2019). Balancing Performance Measures in Classification Using Ensemble Learning Methods. In W. Abramowicz, & R. Corchuelo (Eds.), Business Information Systems - 22nd International Conference, BIS 2019, Proceedings (pp. 311-324). (Lecture Notes in Business Information Processing; Vol. 354). Springer Verlag. https://doi.org/10.1007/978-3-030-20482-2_25

Balancing Performance Measures in Classification Using Ensemble Learning Methods. / Bahl, Neeraj; Bansal, Ajay.
Business Information Systems - 22nd International Conference, BIS 2019, Proceedings. ed. / Witold Abramowicz; Rafael Corchuelo. Springer Verlag, 2019. p. 311-324 (Lecture Notes in Business Information Processing; Vol. 354).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Bahl, N & Bansal, A 2019, Balancing Performance Measures in Classification Using Ensemble Learning Methods. in W Abramowicz & R Corchuelo (eds), Business Information Systems - 22nd International Conference, BIS 2019, Proceedings. Lecture Notes in Business Information Processing, vol. 354, Springer Verlag, pp. 311-324, 22nd International Conference on Business Information Systems, BIS 2019, Seville, Spain, 6/26/19. https://doi.org/10.1007/978-3-030-20482-2_25

@inproceedings{752d1aced2a94a7c9ca2344b280fa5b9,

title = "Balancing Performance Measures in Classification Using Ensemble Learning Methods",

abstract = "Ensemble learning methods have recently been widely used in various domains and applications owing to the improvements in computational efficiency and distributed computing advances. However, with the advent of wide variety of applications of machine learning techniques to class imbalance problems, further focus is needed to evaluate, improve and balance other performance measures such as sensitivity (true positive rate) and specificity (true negative rate) in classification. This paper demonstrates an approach to evaluate and balance the performance measures (specifically sensitivity and specificity) using ensemble learning methods for classification that can be especially useful in class imbalanced datasets. In this paper, ensemble learning methods (specifically bagging and boosting) are used to balance the performance measures (sensitivity and specificity) on a diabetes dataset to predict if a patient will be readmitted to the hospital based on various feature vectors. From the experiments conducted, it can be empirically concluded that, by using ensemble learning methods, although accuracy does improve to some margin, both sensitivity and specificity are balanced significantly and consistently over different cross validation approaches.",

keywords = "Balancing, Boosting, Classification, Ensemble methods",

author = "Neeraj Bahl and Ajay Bansal",

year = "2019",

month = jan,

day = "1",

doi = "10.1007/978-3-030-20482-2_25",

language = "English (US)",

isbn = "9783030204815",

series = "Lecture Notes in Business Information Processing",

publisher = "Springer Verlag",

pages = "311--324",

editor = "Witold Abramowicz and Rafael Corchuelo",

booktitle = "Business Information Systems - 22nd International Conference, BIS 2019, Proceedings",

note = "22nd International Conference on Business Information Systems, BIS 2019 ; Conference date: 26-06-2019 Through 28-06-2019",

}

TY - GEN

T1 - Balancing Performance Measures in Classification Using Ensemble Learning Methods

AU - Bahl, Neeraj

AU - Bansal, Ajay

PY - 2019/1/1

Y1 - 2019/1/1

N2 - Ensemble learning methods have recently been widely used in various domains and applications owing to the improvements in computational efficiency and distributed computing advances. However, with the advent of wide variety of applications of machine learning techniques to class imbalance problems, further focus is needed to evaluate, improve and balance other performance measures such as sensitivity (true positive rate) and specificity (true negative rate) in classification. This paper demonstrates an approach to evaluate and balance the performance measures (specifically sensitivity and specificity) using ensemble learning methods for classification that can be especially useful in class imbalanced datasets. In this paper, ensemble learning methods (specifically bagging and boosting) are used to balance the performance measures (sensitivity and specificity) on a diabetes dataset to predict if a patient will be readmitted to the hospital based on various feature vectors. From the experiments conducted, it can be empirically concluded that, by using ensemble learning methods, although accuracy does improve to some margin, both sensitivity and specificity are balanced significantly and consistently over different cross validation approaches.

AB - Ensemble learning methods have recently been widely used in various domains and applications owing to the improvements in computational efficiency and distributed computing advances. However, with the advent of wide variety of applications of machine learning techniques to class imbalance problems, further focus is needed to evaluate, improve and balance other performance measures such as sensitivity (true positive rate) and specificity (true negative rate) in classification. This paper demonstrates an approach to evaluate and balance the performance measures (specifically sensitivity and specificity) using ensemble learning methods for classification that can be especially useful in class imbalanced datasets. In this paper, ensemble learning methods (specifically bagging and boosting) are used to balance the performance measures (sensitivity and specificity) on a diabetes dataset to predict if a patient will be readmitted to the hospital based on various feature vectors. From the experiments conducted, it can be empirically concluded that, by using ensemble learning methods, although accuracy does improve to some margin, both sensitivity and specificity are balanced significantly and consistently over different cross validation approaches.

KW - Balancing

KW - Boosting

KW - Classification

KW - Ensemble methods

UR - http://www.scopus.com/inward/record.url?scp=85068141788&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85068141788&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-20482-2_25

DO - 10.1007/978-3-030-20482-2_25

M3 - Conference contribution

AN - SCOPUS:85068141788

SN - 9783030204815

T3 - Lecture Notes in Business Information Processing

SP - 311

EP - 324

BT - Business Information Systems - 22nd International Conference, BIS 2019, Proceedings

A2 - Abramowicz, Witold

A2 - Corchuelo, Rafael

PB - Springer Verlag

T2 - 22nd International Conference on Business Information Systems, BIS 2019

Y2 - 26 June 2019 through 28 June 2019

ER -

Balancing Performance Measures in Classification Using Ensemble Learning Methods

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this