Provably efficient reinforcement learning with linear function approximation
Year of publication: |
2023
|
---|---|
Authors: | Jin, Chi ; Yang, Zhuoran ; Wang, Zhaoran ; Jordan, Michael Irwin |
Published in: |
Mathematics of operations research. - Hanover, Md. : INFORMS, ISSN 1526-5471, ZDB-ID 2004273-5. - Vol. 48.2023, 3, p. 1496-1521
|
Subject: | episodic MDP | exploration | linear function approximation | reinforcement learning | Theorie | Theory | Lernprozess | Learning process | Lernen | Learning | Mathematische Optimierung | Mathematical programming |
-
A general framework for bandit problems beyond cumulative objectives
Cassel, Asaf, (2023)
-
A hybrid breakout local search and reinforcement learning approach to the vertex separator problem
Benlic, Una, (2017)
-
Efficient reinforcement learning in deterministic systems with value function generalization
Wen, Zheng, (2017)
- More ...
-
Neural temporal difference and Q learning provably converge to global optima
Cai, Qi, (2024)
-
Xie, Qiaomin, (2023)
-
Monotone inclusions, acceleration, and closed-loop control
Lin, Tianyi, (2023)
- More ...