//-->
Markov decision processes with arbitrary reward processes
Yu, Jia Yuan, (2009)
Regret minimization in repeated matrix games with variable stage duration
Mannor, Shie, (2008)