Relative Q-Learning for Average-Reward Markov Decision Processes with Continuous States
Year of publication: |
[2021]
|
---|---|
Authors: | Yang, Xiangyu ; Hu, Jiaqiao ; Hu, Jianqiang |
Publisher: |
[S.l.] : SSRN |
Subject: | Entscheidung | Decision | Theorie | Theory | Markov-Kette | Markov chain |
Extent: | 1 Online-Ressource (32 p) |
---|---|
Type of publication: | Book / Working Paper |
Language: | English |
Notes: | Nach Informationen von SSRN wurde die ursprüngliche Fassung des Dokuments December 25, 2021 erstellt |
Other identifiers: | 10.2139/ssrn.3993508 [DOI] |
Classification: | C61 - Optimization Techniques; Programming Models; Dynamic Analysis ; C63 - Computational Techniques |
Source: | ECONIS - Online Catalogue of the ZBW |
-
A Partitioning Algorithm for Markov Decision Processes with Applications to Market Microstructure
Chen, Ningyuan, (2020)
-
Equilibrium in misspecified Markov decision processes
Esponda, Ignacio, (2021)
-
On the Optimality of Regularity in Mixing Markovian Decision Rules for MDP Control
Van der Laan, Dinard, (2010)
- More ...
-
Moment estimators for parameters of Lévy‐driven Ornstein–Uhlenbeck processes
Wu, Yanfeng, (2021)
-
Tong, Jun, (2017)
-
Wang, Zhenguo, (2022)
- More ...