//-->
Markov decision processes with arbitrary reward processes
Yu, Jia Yuan, (2009)
Regret minimization in repeated matrix games with variable stage duration
Mannor, Shie, (2008)
The Empirical Bayes Envelope and Regret Minimization in Competitive Markov Decision Processes
Mannor, Shie, (2003)