Modified policy iteration algorithms are not strongly polynomial for discounted dynamic programming
Year of publication: |
2014
|
---|---|
Authors: | Feinberg, Eugene A. ; Huang, Jefferson ; Scherrer, Bruno |
Published in: |
Operations research letters. - Amsterdam [u.a.] : Elsevier, ISSN 0167-6377, ZDB-ID 720735-9. - Vol. 42.2014, 6/7, p. 429-431
|
Subject: | Markov decision process | Modified policy iteration | Strongly polynomial | Policy | Algorithm | Theorie | Theory | Mathematische Optimierung | Mathematical programming | Dynamische Optimierung | Dynamic programming | Algorithmus | Markov-Kette | Markov chain |
-
The value iteration algorithm is not strongly polynomial for discounted dynamic programming
Feinberg, Eugene A., (2014)
-
A unified algorithm framework for mean-variance optimization in discounted Markov decision processes
Ma, Shuai, (2023)
-
Penalty-based algorithms for the stochastic obstacle scene problem
Aksakalli, Vural, (2014)
- More ...
-
Feinberg, Eugene A., (2013)
-
The value iteration algorithm is not strongly polynomial for discounted dynamic programming
Feinberg, Eugene A., (2014)
-
Feinberg, Eugene A., (2018)
- More ...