Provably efficient reinforcement learning with linear function approximation
| Year of publication: |
2023
|
|---|---|
| Authors: | Jin, Chi ; Yang, Zhuoran ; Wang, Zhaoran ; Jordan, Michael Irwin |
| Published in: |
Mathematics of operations research. - Hanover, Md. : INFORMS, ISSN 1526-5471, ZDB-ID 2004273-5. - Vol. 48.2023, 3, p. 1496-1521
|
| Subject: | episodic MDP | exploration | linear function approximation | reinforcement learning | Theorie | Theory | Lernprozess | Learning process | Lernen | Learning | Mathematische Optimierung | Mathematical programming |
-
Learning to steer nonlinear interior-point methods
Kuhlmann, Renke, (2019)
-
Reinforcement learning for combinatorial optimization : a survey
Mazyavkina, Nina, (2021)
-
A finite time analysis of temporal difference learning with linear function approximation
Bhandari, Jalaj, (2021)
- More ...
-
Xie, Qiaomin, (2023)
-
Neural temporal difference and Q learning provably converge to global optima
Cai, Qi, (2024)
-
Hierarchical Dirichlet processes
Teh, Yee Whye, (2006)
- More ...