A Reinforcement Learning Method of Solving Markov Decision Processes : An Adaptive Exploration Model Based on Temporal Difference Error
Year of publication: |
[2023]
|
---|---|
Authors: | Wang, Xianjia ; yang, zhipeng ; Chen, Guici ; Liu, Yanli |
Publisher: |
[S.l.] : SSRN |
Subject: | Markov-Kette | Markov chain | Theorie | Theory | Entscheidung | Decision |
-
A new approach to decision prioritization : case for healthcare decision-makers
Singh, Sudhanshu, (2019)
-
New tests of optimality in Markov decision processes
Lasserre, Jean B., (1993)
-
Average reward optimality equation in Markov decision processes with a general state space
Tijms, Henk C., (1993)
- More ...
-
Options for implementing a strategy of market segmentation in Chinese consumer goods markets
Liu, Yanli, (2005)
-
Options for implementing a strategy of market segmentation in chinese consumer goods markets
Liu, Yanli, (2005)
-
Xiao, Haixia, (2019)
- More ...