The role of lookahead and approximate policy evaluation in reinforcement learning with linear value function approximation

Anna Winnicki, Joseph Lubars, Michael Livesay, R. Srikant

Year of publication:	2025
Authors:	Winnicki, Anna ; Lubars, Joseph ; Livesay, Michael ; Srikant, Rayadurgam
Published in:	Operations research. - Linthicum, Md. : INFORMS, ISSN 1526-5463, ZDB-ID 2019440-7. - Vol. 73.2025, 1, p. 139-156
Subject:	dynamic programming \| Machine Learning and Data Science \| Markov decision processes \| Künstliche Intelligenz \| Artificial intelligence \| Dynamische Optimierung \| Dynamic programming \| Markov-Kette \| Markov chain \| Theorie \| Theory \| Lernprozess \| Learning process \| Mathematische Optimierung \| Mathematical programming

Type of publication:	Article
Type of publication (narrower categories):	Aufsatz in Zeitschrift ; Article in journal
Language:	English
Other identifiers:	10.1287/opre.2022.0357 [DOI]
Source:	ECONIS - Online Catalogue of the ZBW

Persistent link: https://www.econbiz.de/10015445282