Similar Search Results

Delete all filters | 1 applied filter

Approximate Policy Iteration with a Policy Language Bias: Learning Control Knowledge Planning in Planning Domains

Fern, Alan; Yoon, Sungwook; Givan, Robert - 2003

We design a novel approximate policy iteration (API) method suited for learning good domain-specific control knowledge in large relational planning domains. The learned knowledge takes the form of a control policy for a single Markov decision process representing all problem instances of the...

Persistent link: https://www.econbiz.de/10009430815