//-->
Regret minimization in repeated matrix games with variable stage duration
Mannor, Shie, (2008)
Basis Function Adaptation in Temporal Difference Reinforcement Learning
Menache, Ishai, (2005)