//-->
Reinforcement learning : a tutorial survey and recent advances
Gosavi, Abhijit, (2009)
Reinforcement learning for long-run average cost
Gosavi, Abhijit, (2004)
A Simulation-Based Learning Automata Framework for Solving Semi-Markov Decision Problems Under Long-Run Average Reward