TY - JOUR
T1 - Stochastic shortest path games
AU - Patek, Stephen D.
AU - Bertsekas, Dimitri P.
N1 - Copyright:
Copyright 2017 Elsevier B.V., All rights reserved.
PY - 1999
Y1 - 1999
N2 - We consider dynamic, two-player, zero-sum games where the `minimizing' player seeks to drive an underlying finite-state dynamic system to a special terminal state along a least expected cost path. The `maximizer' seeks to interfere with the minimizer's progress so as to maximize the expected total cost. We consider, for the first time, undiscounted finite-state problems, with compact action spaces, and transition costs that are not strictly positive. We admit that there are policies for the minimizer which permit the maximizer to prolong the game indefinitely. Under assumptions which generalize deterministic shortest path problems, we establish (i) the existence of a real-valued equilibrium cost vector achievable with stationary policies for the opposing players and (ii) the convergence of value iteration and policy iteration to the unique solution of Bellman's equation.
AB - We consider dynamic, two-player, zero-sum games where the `minimizing' player seeks to drive an underlying finite-state dynamic system to a special terminal state along a least expected cost path. The `maximizer' seeks to interfere with the minimizer's progress so as to maximize the expected total cost. We consider, for the first time, undiscounted finite-state problems, with compact action spaces, and transition costs that are not strictly positive. We admit that there are policies for the minimizer which permit the maximizer to prolong the game indefinitely. Under assumptions which generalize deterministic shortest path problems, we establish (i) the existence of a real-valued equilibrium cost vector achievable with stationary policies for the opposing players and (ii) the convergence of value iteration and policy iteration to the unique solution of Bellman's equation.
UR - http://www.scopus.com/inward/record.url?scp=0032652241&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0032652241&partnerID=8YFLogxK
U2 - 10.1137/S0363012996299557
DO - 10.1137/S0363012996299557
M3 - Article
AN - SCOPUS:0032652241
SN - 0363-0129
VL - 37
SP - 804
EP - 824
JO - SIAM Journal on Control and Optimization
JF - SIAM Journal on Control and Optimization
IS - 3
ER -