Blackwell optimality in the class of all policies in Markov decision chains with a Borel state space and unbounded rewards
This paper is the second part of our study of Blackwell optimal policies in Markov decision chains with a Borel state space and unbounded rewards. We prove that a stationary policy is Blackwell optimal in the class of all history-dependent policies if it is Blackwell optimal in the class of stationary policies. We also develop recurrence and drift conditions which ensure ergodicity and integrability assumptions made in the previous paper, and which are more suitable for applications. As an example we study a cash-balance model. Copyright Springer-Verlag Berlin Heidelberg 1999
Year of publication: |
1999
|
---|---|
Authors: | Hordijk, Arie ; Yushkevich, Alexander A. |
Published in: |
Computational Statistics. - Springer. - Vol. 50.1999, 3, p. 421-448
|
Publisher: |
Springer |
Subject: | Markov decision chains | Blackwell optimality | drift and recurrence conditions |
Saved in:
Online Resource