Adaptive aggregation for reinforcement learning in average reward Markov decision processes
Year of publication: |
2013
|
---|---|
Authors: | Ortner, Ronald |
Published in: |
Optimization under uncertainty ; Vol. 1. - New York, NY : Springer. - 2013, p. 321-336
|
Subject: | Entscheidung | Decision | Theorie | Theory | Markov-Kette | Markov chain | Lernprozess | Learning process |
-
On boundedness of Q-learning iterates for stochastic shortest path problems
Yu, Huizhen, (2013)
-
Springborn, Michael, (2013)
-
Strategy selection and outcome prediction in sport using dynamic learning for stochastic processes
Percy, David Frank, (2015)
- More ...
-
Adaptive aggregation for reinforcement learning in average reward Markov decision processes
Ortner, Ronald, (2013)
-
Linear dependence of stationary distributions in ergodic Markov decision processes
Ortner, Ronald, (2007)
-
A new heuristic and an exact approach for a production planning problem
Auer, Peter, (2021)
- More ...