Logarithmic Regret for Episodic Continuous-Time Linear-Quadratic Reinforcement Learning Over a Finite-Time Horizon
Year of publication: |
[2021]
|
---|---|
Authors: | Basei, Matteo ; Guo, Xin ; Hu, Anran ; Zhang, Yufei |
Publisher: |
[S.l.] : SSRN |
Subject: | Theorie | Theory | Lernen | Learning | Lernprozess | Learning process |
-
The aggregation-learning trade-off
Piezunka, Henning, (2022)
-
Learning by convex combination
Flores-Szwagrzak, Karol, (2022)
-
Bretschger, Lucas, (2024)
- More ...
-
A general framework for learning mean-field games
Guo, Xin, (2023)
-
Nonzero-sum stochastic games and mean-field games with impulse controls
Basei, Matteo, (2022)
-
First-train timing synchronisation using multi-objective optimisation in urban transit networks
Guo, Xin, (2019)
- More ...