Type of publication: Article
Notes:
DOI:10.1613/jair.806
Baxter, J. & Bartlett, P. L. (2001) Infinite-horizon policy-gradient estimation. Journal of Artificial Intelligence Research, 15, 319-350 .
Faculty of Science and Technology; Mathematical Sciences
Source:
BASE