Value iteration and approximately optimal stationary policies in finite-state average Markov decision chains
| Year of publication: |
2002
|
|---|---|
| Authors: | Cavazos-Cadena, Rolando ; Cavazos-Cadena, Rolando |
| Published in: |
Mathematical Methods of Operations Research. - Springer. - Vol. 56.2002, 2, p. 181-196
|
| Publisher: |
Springer |
| Subject: | AMS Subject Classifications. Primary | Secondary | Key words: Successive approximations | Markov decision processes | Schweitzer's Transformation | Optimality Equation | Convergence of the value iteration approximations |
-
Cavazos-Cadena, Rolando, (2002)
-
The finiteness of the reward function and the optimal value function in Markov decision processes
Hu, Qiying, (1999)
-
The finiteness of the reward function and the optimal value function in Markov decision processes
Hu, Qiying, (1999)
- More ...
-
Cavazos-Cadena, Rolando, (2002)
-
Cavazos-Cadena, Rolando, (2002)
-
Cavazos-Cadena, Rolando, (2009)
- More ...