Infinite-horizon policy-gradient estimation
Year of publication: |
2001
|
---|---|
Authors: | Baxter, J. ; Bartlett, P. L. |
Publisher: |
AI Access Foundation |
Subject: | APPLIED MATHEMATICS | ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING | Algorithms | Computational methods | Markov processes | Multi agent systems | Problem solving | Random processes | Gradient-based approaches | Policy parameters | Value-function methods | Learning systems | OAVJ |
Type of publication: | Article |
---|---|
Notes: | DOI:10.1613/jair.806 Baxter, J. & Bartlett, P. L. (2001) Infinite-horizon policy-gradient estimation. Journal of Artificial Intelligence Research, 15, 319-350 . Faculty of Science and Technology; Mathematical Sciences |
Source: | BASE |
-
Model selection and error estimation
Bartlett, P. L., (2002)
-
Barrera, Javiera, (2020)
-
Genetic algorithms in the design of complex distribution networks
Berry, L.M., (1998)
- More ...
-
Model selection and error estimation
Bartlett, P. L., (2002)
-
Hirsch, R., (2010)
-
Baxter, John, (2023)
- More ...