Corruption-robust exploration in episodic reinforcement learning

Thodoris Lykouris, Max Simchowitz, Aleksandrs Slivkins, Wen Sun

Year of publication:	2025
Authors:	Lykouris, Thodoris ; Simchowitz, Max ; Slivkins, Aleksandrs ; Sun, Wen
Published in:	Mathematics of operations research. - Hanover, Md. : INFORMS, ISSN 1526-5471, ZDB-ID 2004273-5. - Vol. 50.2025, 2, p. 1277-1304
Subject:	reinforcement learning \| exploration \| regret \| robustness \| bandit feedback \| Lernen \| Learning \| Lernprozess \| Learning process \| Entscheidung unter Unsicherheit \| Decision under uncertainty \| Spieltheorie \| Game theory

Type of publication:	Article
Type of publication (narrower categories):	Aufsatz in Zeitschrift ; Article in journal
Language:	English
Other identifiers:	10.1287/moor.2021.0202 [DOI]
Source:	ECONIS - Online Catalogue of the ZBW

Persistent link: https://www.econbiz.de/10015444113