A Tunable Loss Function for Binary Classification

Tyler Sypherd; Mario Diaz; Lalitha Sankar; Peter Kairouz

doi:10.1109/ISIT.2019.8849796

A Tunable Loss Function for Binary Classification

Tyler Sypherd, Mario Diaz, Lalitha Sankar, Peter Kairouz

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

18 Scopus citations

Abstract

We present α-loss, α ϵ [1, ∞], a tunable loss function for binary classification that bridges log-loss (α = 1) and 0-1 loss (α = ∞). We prove that α-loss has an equivalent margin-based form and is classification-calibrated, two desirable properties for a good surrogate loss function for the ideal yet intractable 0-1 loss. For logistic regression-based classification, we provide an upper bound on the difference between the empirical and expected risk for α-loss at the critical points of the empirical risk by exploiting its Lipschitzianity along with recent results on the landscape features of empirical risk functions. Finally, we show that α-loss with α = 2 performs better than log-loss on MNIST for logistic regression.

Original language	English (US)
Title of host publication	2019 IEEE International Symposium on Information Theory, ISIT 2019 - Proceedings
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	2479-2483
Number of pages	5
ISBN (Electronic)	9781538692912
DOIs	https://doi.org/10.1109/ISIT.2019.8849796
State	Published - Jul 2019
Event	2019 IEEE International Symposium on Information Theory, ISIT 2019 - Paris, France Duration: Jul 7 2019 → Jul 12 2019

Publication series

Name	IEEE International Symposium on Information Theory - Proceedings
Volume	2019-July
ISSN (Print)	2157-8095

Conference

Conference	2019 IEEE International Symposium on Information Theory, ISIT 2019
Country/Territory	France
City	Paris
Period	7/7/19 → 7/12/19

ASJC Scopus subject areas

Theoretical Computer Science
Information Systems
Modeling and Simulation
Applied Mathematics

Access to Document

10.1109/ISIT.2019.8849796

Cite this

Sypherd, T., Diaz, M., Sankar, L., & Kairouz, P. (2019). A Tunable Loss Function for Binary Classification. In 2019 IEEE International Symposium on Information Theory, ISIT 2019 - Proceedings (pp. 2479-2483). Article 8849796 (IEEE International Symposium on Information Theory - Proceedings; Vol. 2019-July). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ISIT.2019.8849796

A Tunable Loss Function for Binary Classification. / Sypherd, Tyler; Diaz, Mario; Sankar, Lalitha et al.
2019 IEEE International Symposium on Information Theory, ISIT 2019 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2019. p. 2479-2483 8849796 (IEEE International Symposium on Information Theory - Proceedings; Vol. 2019-July).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Sypherd, T, Diaz, M, Sankar, L & Kairouz, P 2019, A Tunable Loss Function for Binary Classification. in 2019 IEEE International Symposium on Information Theory, ISIT 2019 - Proceedings., 8849796, IEEE International Symposium on Information Theory - Proceedings, vol. 2019-July, Institute of Electrical and Electronics Engineers Inc., pp. 2479-2483, 2019 IEEE International Symposium on Information Theory, ISIT 2019, Paris, France, 7/7/19. https://doi.org/10.1109/ISIT.2019.8849796

@inproceedings{15c7fc07013441efbb8e1bbd0d0f3626,

title = "A Tunable Loss Function for Binary Classification",

abstract = "We present α-loss, α ϵ [1, ∞], a tunable loss function for binary classification that bridges log-loss (α = 1) and 0-1 loss (α = ∞). We prove that α-loss has an equivalent margin-based form and is classification-calibrated, two desirable properties for a good surrogate loss function for the ideal yet intractable 0-1 loss. For logistic regression-based classification, we provide an upper bound on the difference between the empirical and expected risk for α-loss at the critical points of the empirical risk by exploiting its Lipschitzianity along with recent results on the landscape features of empirical risk functions. Finally, we show that α-loss with α = 2 performs better than log-loss on MNIST for logistic regression.",

author = "Tyler Sypherd and Mario Diaz and Lalitha Sankar and Peter Kairouz",

note = "Funding Information: This material is based upon work supported by the National Science Foundation under Grant Nos. CCF-1350914 and CIF-1815261. Publisher Copyright: {\textcopyright} 2019 IEEE.; 2019 IEEE International Symposium on Information Theory, ISIT 2019 ; Conference date: 07-07-2019 Through 12-07-2019",

year = "2019",

month = jul,

doi = "10.1109/ISIT.2019.8849796",

language = "English (US)",

series = "IEEE International Symposium on Information Theory - Proceedings",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "2479--2483",

booktitle = "2019 IEEE International Symposium on Information Theory, ISIT 2019 - Proceedings",

}

TY - GEN

T1 - A Tunable Loss Function for Binary Classification

AU - Sypherd, Tyler

AU - Diaz, Mario

AU - Sankar, Lalitha

AU - Kairouz, Peter

PY - 2019/7

Y1 - 2019/7

N2 - We present α-loss, α ϵ [1, ∞], a tunable loss function for binary classification that bridges log-loss (α = 1) and 0-1 loss (α = ∞). We prove that α-loss has an equivalent margin-based form and is classification-calibrated, two desirable properties for a good surrogate loss function for the ideal yet intractable 0-1 loss. For logistic regression-based classification, we provide an upper bound on the difference between the empirical and expected risk for α-loss at the critical points of the empirical risk by exploiting its Lipschitzianity along with recent results on the landscape features of empirical risk functions. Finally, we show that α-loss with α = 2 performs better than log-loss on MNIST for logistic regression.

AB - We present α-loss, α ϵ [1, ∞], a tunable loss function for binary classification that bridges log-loss (α = 1) and 0-1 loss (α = ∞). We prove that α-loss has an equivalent margin-based form and is classification-calibrated, two desirable properties for a good surrogate loss function for the ideal yet intractable 0-1 loss. For logistic regression-based classification, we provide an upper bound on the difference between the empirical and expected risk for α-loss at the critical points of the empirical risk by exploiting its Lipschitzianity along with recent results on the landscape features of empirical risk functions. Finally, we show that α-loss with α = 2 performs better than log-loss on MNIST for logistic regression.

UR - http://www.scopus.com/inward/record.url?scp=85073160283&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85073160283&partnerID=8YFLogxK

U2 - 10.1109/ISIT.2019.8849796

DO - 10.1109/ISIT.2019.8849796

M3 - Conference contribution

AN - SCOPUS:85073160283

T3 - IEEE International Symposium on Information Theory - Proceedings

SP - 2479

EP - 2483

BT - 2019 IEEE International Symposium on Information Theory, ISIT 2019 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2019 IEEE International Symposium on Information Theory, ISIT 2019

Y2 - 7 July 2019 through 12 July 2019

ER -

A Tunable Loss Function for Binary Classification

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this