On near optimality of the set of finite-state controllers for average cost POMDP

Huizhen Yu; Dimitri P. Bertsekas

doi:10.1287/moor.1070.0279

On near optimality of the set of finite-state controllers for average cost POMDP

Huizhen Yu, Dimitri P. Bertsekas

Research output: Contribution to journal › Article › peer-review

24 Scopus citations

Abstract

We consider the average cost problem for partially observable Markov decision processes (POMDP) with finite state, observation, and control spaces. We prove that there exists an e-optimal finite-state controller (FSC) functionally independent of initial distributions for any ε > 0, under the assumption that the optimal liminf average cost function of the POMDP is constant. As part of our proof, we establish that if the optimal liminf average cost function is constant, then the optimal limsup average cost function is also constant, and the two are equal. We also discuss the connection between the existence of nearly optimal finite-history controllers and two other important issues for average cost POMDP: the existence of an average cost that is independent of the initial state distribution, and the existence of a bounded solution to the constant average cost optimality equation.

Original language	English (US)
Pages (from-to)	1-11
Number of pages	11
Journal	Mathematics of Operations Research
Volume	33
Issue number	1
DOIs	https://doi.org/10.1287/moor.1070.0279
State	Published - Feb 2008
Externally published	Yes

Keywords

Average cost criterion
Finite-state and control models
Optimality conditions
Partially observable markov decision processes

ASJC Scopus subject areas

General Mathematics
Computer Science Applications
Management Science and Operations Research

Access to Document

10.1287/moor.1070.0279

Cite this

@article{7fd7ce7e97d3496eb72c6c14fee0d7d6,

title = "On near optimality of the set of finite-state controllers for average cost POMDP",

abstract = "We consider the average cost problem for partially observable Markov decision processes (POMDP) with finite state, observation, and control spaces. We prove that there exists an e-optimal finite-state controller (FSC) functionally independent of initial distributions for any ε > 0, under the assumption that the optimal liminf average cost function of the POMDP is constant. As part of our proof, we establish that if the optimal liminf average cost function is constant, then the optimal limsup average cost function is also constant, and the two are equal. We also discuss the connection between the existence of nearly optimal finite-history controllers and two other important issues for average cost POMDP: the existence of an average cost that is independent of the initial state distribution, and the existence of a bounded solution to the constant average cost optimality equation.",

keywords = "Average cost criterion, Finite-state and control models, Optimality conditions, Partially observable markov decision processes",

author = "Huizhen Yu and Bertsekas, {Dimitri P.}",

year = "2008",

month = feb,

doi = "10.1287/moor.1070.0279",

language = "English (US)",

volume = "33",

pages = "1--11",

journal = "Mathematics of Operations Research",

issn = "0364-765X",

publisher = "INFORMS Inst.for Operations Res.and the Management Sciences",

number = "1",

}

TY - JOUR

T1 - On near optimality of the set of finite-state controllers for average cost POMDP

AU - Yu, Huizhen

AU - Bertsekas, Dimitri P.

PY - 2008/2

Y1 - 2008/2

N2 - We consider the average cost problem for partially observable Markov decision processes (POMDP) with finite state, observation, and control spaces. We prove that there exists an e-optimal finite-state controller (FSC) functionally independent of initial distributions for any ε > 0, under the assumption that the optimal liminf average cost function of the POMDP is constant. As part of our proof, we establish that if the optimal liminf average cost function is constant, then the optimal limsup average cost function is also constant, and the two are equal. We also discuss the connection between the existence of nearly optimal finite-history controllers and two other important issues for average cost POMDP: the existence of an average cost that is independent of the initial state distribution, and the existence of a bounded solution to the constant average cost optimality equation.

AB - We consider the average cost problem for partially observable Markov decision processes (POMDP) with finite state, observation, and control spaces. We prove that there exists an e-optimal finite-state controller (FSC) functionally independent of initial distributions for any ε > 0, under the assumption that the optimal liminf average cost function of the POMDP is constant. As part of our proof, we establish that if the optimal liminf average cost function is constant, then the optimal limsup average cost function is also constant, and the two are equal. We also discuss the connection between the existence of nearly optimal finite-history controllers and two other important issues for average cost POMDP: the existence of an average cost that is independent of the initial state distribution, and the existence of a bounded solution to the constant average cost optimality equation.

KW - Average cost criterion

KW - Finite-state and control models

KW - Optimality conditions

KW - Partially observable markov decision processes

UR - http://www.scopus.com/inward/record.url?scp=61349089285&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=61349089285&partnerID=8YFLogxK

U2 - 10.1287/moor.1070.0279

DO - 10.1287/moor.1070.0279

M3 - Article

AN - SCOPUS:61349089285

SN - 0364-765X

VL - 33

SP - 1

EP - 11

JO - Mathematics of Operations Research

JF - Mathematics of Operations Research

IS - 1

ER -

On near optimality of the set of finite-state controllers for average cost POMDP

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this