//-->
Adaptive aggregation for reinforcement learning in average reward Markov decision processes
Ortner, Ronald, (2013)
Linear dependence of stationary distributions in ergodic Markov decision processes
Ortner, Ronald, (2007)
A new heuristic and an exact approach for a production planning problem
Auer, Peter, (2021)