Hypothesis testing in the high privacy limit

Jiachun Liao; Lalitha Sankar; Vincent Y F Tan; Flavio P. Calmon

doi:10.1109/ALLERTON.2016.7852293

Hypothesis testing in the high privacy limit

Jiachun Liao, Lalitha Sankar, Vincent Y F Tan, Flavio P. Calmon

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

14 Scopus citations

Abstract

Binary hypothesis testing under the Neyman-Pearson formalism is a statistical inference framework for distinguishing data generated by two different source distributions. Privacy restrictions may require the curator of the data or the data respondents themselves to share data with the test only after applying a randomizing privacy mechanism. Using mutual information as the privacy metric and the relative entropy between the two distributions of the output (post-randomization) source classes as the utility metric (motivated by the Chernoff-Stein Lemma), this work focuses on finding an optimal mechanism that maximizes the chosen utility function while ensuring that the mutual information based leakage for both source distributions is bounded. Focusing on the high privacy regime, an Euclidean information-theoretic (E-IT) approximation to the tradeoff problem is presented. It is shown that the solution to the E-IT approximation is independent of the alphabet size and clarifies that a mutual information based privacy metric preserves the privacy of the source symbols in inverse proportion to their likelihood.

Original language	English (US)
Title of host publication	54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	649-656
Number of pages	8
ISBN (Electronic)	9781509045495
DOIs	https://doi.org/10.1109/ALLERTON.2016.7852293
State	Published - Feb 10 2017
Event	54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016 - Monticello, United States Duration: Sep 27 2016 → Sep 30 2016

Publication series

Name	54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016

Other

Other	54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016
Country/Territory	United States
City	Monticello
Period	9/27/16 → 9/30/16

Keywords

Binary hypothesis testing
Euclidean information theory
Privacy

ASJC Scopus subject areas

Artificial Intelligence
Computational Theory and Mathematics
Computer Networks and Communications
Hardware and Architecture
Control and Optimization

Access to Document

10.1109/ALLERTON.2016.7852293

Cite this

Liao, J., Sankar, L., Tan, V. Y. F., & Calmon, F. P. (2017). Hypothesis testing in the high privacy limit. In 54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016 (pp. 649-656). Article 7852293 (54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ALLERTON.2016.7852293

Hypothesis testing in the high privacy limit. / Liao, Jiachun; Sankar, Lalitha; Tan, Vincent Y F et al.
54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016. Institute of Electrical and Electronics Engineers Inc., 2017. p. 649-656 7852293 (54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Liao, J, Sankar, L, Tan, VYF & Calmon, FP 2017, Hypothesis testing in the high privacy limit. in 54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016., 7852293, 54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016, Institute of Electrical and Electronics Engineers Inc., pp. 649-656, 54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016, Monticello, United States, 9/27/16. https://doi.org/10.1109/ALLERTON.2016.7852293

Liao J, Sankar L, Tan VYF, Calmon FP. Hypothesis testing in the high privacy limit. In 54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016. Institute of Electrical and Electronics Engineers Inc. 2017. p. 649-656. 7852293. (54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016). doi: 10.1109/ALLERTON.2016.7852293

@inproceedings{63951a2f5e9d46b69d4461a346538194,

title = "Hypothesis testing in the high privacy limit",

abstract = "Binary hypothesis testing under the Neyman-Pearson formalism is a statistical inference framework for distinguishing data generated by two different source distributions. Privacy restrictions may require the curator of the data or the data respondents themselves to share data with the test only after applying a randomizing privacy mechanism. Using mutual information as the privacy metric and the relative entropy between the two distributions of the output (post-randomization) source classes as the utility metric (motivated by the Chernoff-Stein Lemma), this work focuses on finding an optimal mechanism that maximizes the chosen utility function while ensuring that the mutual information based leakage for both source distributions is bounded. Focusing on the high privacy regime, an Euclidean information-theoretic (E-IT) approximation to the tradeoff problem is presented. It is shown that the solution to the E-IT approximation is independent of the alphabet size and clarifies that a mutual information based privacy metric preserves the privacy of the source symbols in inverse proportion to their likelihood.",

keywords = "Binary hypothesis testing, Euclidean information theory, Privacy",

author = "Jiachun Liao and Lalitha Sankar and Tan, {Vincent Y F} and Calmon, {Flavio P.}",

note = "Funding Information: This work is supported in part by the National Science Foundation under grants CCF-1350914 and CIF-1422358 Publisher Copyright: {\textcopyright} 2016 IEEE.; 54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016 ; Conference date: 27-09-2016 Through 30-09-2016",

year = "2017",

month = feb,

day = "10",

doi = "10.1109/ALLERTON.2016.7852293",

language = "English (US)",

series = "54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "649--656",

booktitle = "54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016",

}

TY - GEN

T1 - Hypothesis testing in the high privacy limit

AU - Liao, Jiachun

AU - Sankar, Lalitha

AU - Tan, Vincent Y F

AU - Calmon, Flavio P.

PY - 2017/2/10

Y1 - 2017/2/10

N2 - Binary hypothesis testing under the Neyman-Pearson formalism is a statistical inference framework for distinguishing data generated by two different source distributions. Privacy restrictions may require the curator of the data or the data respondents themselves to share data with the test only after applying a randomizing privacy mechanism. Using mutual information as the privacy metric and the relative entropy between the two distributions of the output (post-randomization) source classes as the utility metric (motivated by the Chernoff-Stein Lemma), this work focuses on finding an optimal mechanism that maximizes the chosen utility function while ensuring that the mutual information based leakage for both source distributions is bounded. Focusing on the high privacy regime, an Euclidean information-theoretic (E-IT) approximation to the tradeoff problem is presented. It is shown that the solution to the E-IT approximation is independent of the alphabet size and clarifies that a mutual information based privacy metric preserves the privacy of the source symbols in inverse proportion to their likelihood.

AB - Binary hypothesis testing under the Neyman-Pearson formalism is a statistical inference framework for distinguishing data generated by two different source distributions. Privacy restrictions may require the curator of the data or the data respondents themselves to share data with the test only after applying a randomizing privacy mechanism. Using mutual information as the privacy metric and the relative entropy between the two distributions of the output (post-randomization) source classes as the utility metric (motivated by the Chernoff-Stein Lemma), this work focuses on finding an optimal mechanism that maximizes the chosen utility function while ensuring that the mutual information based leakage for both source distributions is bounded. Focusing on the high privacy regime, an Euclidean information-theoretic (E-IT) approximation to the tradeoff problem is presented. It is shown that the solution to the E-IT approximation is independent of the alphabet size and clarifies that a mutual information based privacy metric preserves the privacy of the source symbols in inverse proportion to their likelihood.

KW - Binary hypothesis testing

KW - Euclidean information theory

KW - Privacy

UR - http://www.scopus.com/inward/record.url?scp=85015187588&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85015187588&partnerID=8YFLogxK

U2 - 10.1109/ALLERTON.2016.7852293

DO - 10.1109/ALLERTON.2016.7852293

M3 - Conference contribution

AN - SCOPUS:85015187588

T3 - 54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016

SP - 649

EP - 656

BT - 54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 54th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2016

Y2 - 27 September 2016 through 30 September 2016

ER -

Hypothesis testing in the high privacy limit

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this