An optimal uniform concentration inequality for discrete entropies on finite alphabets in the high-dimensional setting

Yunpeng Zhao

doi:10.3150/21-BEJ1403

An optimal uniform concentration inequality for discrete entropies on finite alphabets in the high-dimensional setting

Yunpeng Zhao

Mathematical and Natural Sciences, School of (SMNS)

Research output: Contribution to journal › Article › peer-review

2 Scopus citations

Abstract

We prove an exponential decay concentration inequality to bound the tail probability of the difference between the log-likelihood of discrete random variables on a finite alphabet and the negative entropy. The concentration bound we derive holds uniformly over all parameter values. The new result improves the convergence rate in an earlier result of Zhao (2020), from (K² log K)/n = o(1) to (log K)² /n = o(1), where n is the sample size and K is the size of the alphabet. We further prove that the rate (log K)² /n = o(1) is optimal. The result is extended to misspecified log-likelihoods for grouped random variables. We give applications of the new result in information theory.

Original language	English (US)
Pages (from-to)	1892-1911
Number of pages	20
Journal	Bernoulli
Volume	28
Issue number	3
DOIs	https://doi.org/10.3150/21-BEJ1403
State	Published - Aug 2022

Keywords

Concentration inequality
entropy
log-likelihood
non-convex optimization
source coding theorem
typical set

ASJC Scopus subject areas

Statistics and Probability

Access to Document

10.3150/21-BEJ1403

Cite this

@article{cf83e66232f64dd488cc4eb4f4c67f60,

title = "An optimal uniform concentration inequality for discrete entropies on finite alphabets in the high-dimensional setting",

abstract = "We prove an exponential decay concentration inequality to bound the tail probability of the difference between the log-likelihood of discrete random variables on a finite alphabet and the negative entropy. The concentration bound we derive holds uniformly over all parameter values. The new result improves the convergence rate in an earlier result of Zhao (2020), from (K2 log K)/n = o(1) to (log K)2 /n = o(1), where n is the sample size and K is the size of the alphabet. We further prove that the rate (log K)2 /n = o(1) is optimal. The result is extended to misspecified log-likelihoods for grouped random variables. We give applications of the new result in information theory.",

keywords = "Concentration inequality, entropy, log-likelihood, non-convex optimization, source coding theorem, typical set",

author = "Yunpeng Zhao",

note = "Funding Information: This research was supported by the National Science Foundation grant DMS-1840203. Publisher Copyright: {\textcopyright} 2022 ISI/BS.",

year = "2022",

month = aug,

doi = "10.3150/21-BEJ1403",

language = "English (US)",

volume = "28",

pages = "1892--1911",

journal = "Bernoulli",

issn = "1350-7265",

publisher = "International Statistical Institute",

number = "3",

}

TY - JOUR

T1 - An optimal uniform concentration inequality for discrete entropies on finite alphabets in the high-dimensional setting

AU - Zhao, Yunpeng

PY - 2022/8

Y1 - 2022/8

N2 - We prove an exponential decay concentration inequality to bound the tail probability of the difference between the log-likelihood of discrete random variables on a finite alphabet and the negative entropy. The concentration bound we derive holds uniformly over all parameter values. The new result improves the convergence rate in an earlier result of Zhao (2020), from (K2 log K)/n = o(1) to (log K)2 /n = o(1), where n is the sample size and K is the size of the alphabet. We further prove that the rate (log K)2 /n = o(1) is optimal. The result is extended to misspecified log-likelihoods for grouped random variables. We give applications of the new result in information theory.

AB - We prove an exponential decay concentration inequality to bound the tail probability of the difference between the log-likelihood of discrete random variables on a finite alphabet and the negative entropy. The concentration bound we derive holds uniformly over all parameter values. The new result improves the convergence rate in an earlier result of Zhao (2020), from (K2 log K)/n = o(1) to (log K)2 /n = o(1), where n is the sample size and K is the size of the alphabet. We further prove that the rate (log K)2 /n = o(1) is optimal. The result is extended to misspecified log-likelihoods for grouped random variables. We give applications of the new result in information theory.

KW - Concentration inequality

KW - entropy

KW - log-likelihood

KW - non-convex optimization

KW - source coding theorem

KW - typical set

UR - http://www.scopus.com/inward/record.url?scp=85134464788&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85134464788&partnerID=8YFLogxK

U2 - 10.3150/21-BEJ1403

DO - 10.3150/21-BEJ1403

M3 - Article

AN - SCOPUS:85134464788

SN - 1350-7265

VL - 28

SP - 1892

EP - 1911

JO - Bernoulli

JF - Bernoulli

IS - 3

ER -

An optimal uniform concentration inequality for discrete entropies on finite alphabets in the high-dimensional setting

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this