//-->
Generalised weakened fictitious play
Leslie, David S., (2006)
Finite-horizon variance penalised Markov decision processes
Collins, E.J., (1997)
Convergent learning algorithms for potential games with unknown noisy rewards
Chapman, Archie C., (2011)