//-->
On boundedness of Q-learning iterates for stochastic shortest path problems
Yu, Huizhen, (2013)
A density projection approach for non-trivial information dynamics : adaptive management of stochastic natural resources
Springborn, Michael, (2013)
Strategy selection and outcome prediction in sport using dynamic learning for stochastic processes
Percy, David Frank, (2015)
Adaptive aggregation for reinforcement learning in average reward Markov decision processes
Ortner, Ronald, (2013)
Linear dependence of stationary distributions in ergodic Markov decision processes
Ortner, Ronald, (2007)
A new heuristic and an exact approach for a production planning problem
Auer, Peter, (2021)