Value iteration and approximately optimal stationary policies in finite-state average Markov decision chains
Year of publication: |
2002
|
---|---|
Authors: | Cavazos-Cadena, Rolando ; Cavazos-Cadena, Rolando |
Published in: |
Computational Statistics. - Springer. - Vol. 56.2002, 2, p. 181-196
|
Publisher: |
Springer |
Subject: | AMS Subject Classifications. Primary | Secondary | Key words: Successive approximations | Markov decision processes | Schweitzer's Transformation | Optimality Equation | Convergence of the value iteration approximations |
-
Cavazos-Cadena, Rolando, (2002)
-
The finiteness of the reward function and the optimal value function in Markov decision processes
Hu, Qiying, (1999)
-
The finiteness of the reward function and the optimal value function in Markov decision processes
Hu, Qiying, (1999)
- More ...
-
Cavazos-Cadena, Rolando, (2002)
-
Cavazos-Cadena, Rolando, (2002)
-
Cavazos-Cadena, Rolando, (1999)
- More ...