A Reinforcement Learning Method of Solving Markov Decision Processes : An Adaptive Exploration Model Based on Temporal Difference Error
Year of publication: |
[2023]
|
---|---|
Authors: | Wang, Xianjia ; yang, zhipeng ; Chen, Guici ; Liu, Yanli |
Publisher: |
[S.l.] : SSRN |
Subject: | Markov-Kette | Markov chain | Theorie | Theory | Entscheidung | Decision |
-
A new approach to decision prioritization : case for healthcare decision-makers
Singh, Sudhanshu, (2019)
-
New tests of optimality in Markov decision processes
Lasserre, Jean B., (1993)
-
Average reward optimality equation in Markov decision processes with a general state space
Tijms, Henk C., (1993)
- More ...
-
Xiao, Haixia, (2019)
-
Noise suppress exponential growth for hybrid Hopfield neural networks
Zhu, Song, (2012)
-
Segmenting the Chinese consumer goods market : a hybrid approach
Bauer, Erich, (2006)
- More ...