//-->
Value iteration and approximately optimal stationary policies in finite-state average Markov decision chains
Cavazos-Cadena, Rolando, (2002)
Adaptive control of average Markov decision chains under the Lyapunov stability condition
Cavazos-Cadena, Rolando, (2001)