//-->
Adaptive aggregation for reinforcement learning in average reward Markov decision processes
Ortner, Ronald, (2013)
A new heuristic and an exact approach for a production planning problem
Auer, Peter, (2021)