//-->
Regret minimization in repeated matrix games with variable stage duration
Mannor, Shie, (2008)
The Empirical Bayes Envelope and Regret Minimization in Competitive Markov Decision Processes
Mannor, Shie, (2003)
Basis Function Adaptation in Temporal Difference Reinforcement Learning
Menache, Ishai, (2005)