Approximated multi-agent fitted Q iteration
Year of publication: |
[2022]
|
---|---|
Authors: | Lesage-Landry, Antoine ; Callaway, Duncan S. |
Publisher: |
Montréal (Québec), Canada : GERAD, HÉC Montréal |
Subject: | approximate dynamic programming | batch reinforcement learning | Markov decision process | multi-agent reinforcement learning | Agentenbasierte Modellierung | Agent-based modeling | Theorie | Theory | Lernprozess | Learning process | Markov-Kette | Markov chain | Dynamische Optimierung | Dynamic programming | Lernen | Learning | Mathematische Optimierung | Mathematical programming |
-
Optimising darts strategy using Markov decision processes and reinforcement learning
Baird, Graham, (2020)
-
Bayesian exploration for approximate dynamic programming
Ryzhov, Ilya O., (2019)
-
Dynamic programming principles for mean-field controls with learning
Gu, Haotian, (2023)
- More ...
-
Batch reinforcement learning for network-safe demand response in unknown electric grids
Lesage-Landry, Antoine, (2021)
-
Batch reinforcement learning for network-safe demand response in unknown electric grids
Lesage-Landry, Antoine, (2022)
-
Optimally scheduling public safety power shutoffs
Lesage-Landry, Antoine, (2022)
- More ...