Logarithmic Regret for Episodic Continuous-Time Linear-Quadratic Reinforcement Learning Over a Finite-Time Horizon
Year of publication: |
[2021]
|
---|---|
Authors: | Basei, Matteo ; Guo, Xin ; Hu, Anran ; Zhang, Yufei |
Publisher: |
[S.l.] : SSRN |
Subject: | Theorie | Theory | Lernen | Learning | Lernprozess | Learning process |
Extent: | 1 Online-Ressource (24 p) |
---|---|
Type of publication: | Book / Working Paper |
Language: | English |
Notes: | Nach Informationen von SSRN wurde die ursprüngliche Fassung des Dokuments May 18, 2021 erstellt |
Other identifiers: | 10.2139/ssrn.3848428 [DOI] |
Source: | ECONIS - Online Catalogue of the ZBW |
-
The aggregation-learning trade-off
Piezunka, Henning, (2022)
-
Learning by convex combination
Flores-Szwagrzak, Karol, (2022)
-
Bretschger, Lucas, (2024)
- More ...
-
A general framework for learning mean-field games
Guo, Xin, (2023)
-
Nonzero-sum stochastic games and mean-field games with impulse controls
Basei, Matteo, (2022)
-
First-train timing synchronisation using multi-objective optimisation in urban transit networks
Guo, Xin, (2019)
- More ...