On boundedness of Q-learning iterates for stochastic shortest path problems

Huizhen Yu; Dimitri P. Bertsekas

Year of publication:	2013
Authors:	Yu, Huizhen ; Bertsekas, Dimitri P.
Published in:	Mathematics of operations research. - Catonsville, MD : INFORMS, ISSN 0364-765X, ZDB-ID 195683-8. - Vol. 38.2013, 2, p. 209-227
Subject:	Markov decision processes \| Q-learning \| stochastic approximation \| dynamic programming \| reinforcement learning \| Theorie \| Theory \| Markov-Kette \| Markov chain \| Dynamische Optimierung \| Dynamic programming \| Stochastischer Prozess \| Stochastic process \| Lernprozess \| Learning process \| Mathematische Optimierung \| Mathematical programming \| Entscheidung \| Decision

Extent:	graph. Darst.
Type of publication:	Article
Type of publication (narrower categories):	Aufsatz in Zeitschrift ; Article in journal
Language:	English
Source:	ECONIS - Online Catalogue of the ZBW

Persistent link: https://www.econbiz.de/10009751534