Solving Semi-Markov Decision Problems Using Average Reward Reinforcement Learning
| Year of publication: |
1999
|
|---|---|
| Authors: | Das, Tapas K. ; Gosavi, Abhijit ; Mahadevan, Sridhar ; Marchalleck, Nicholas |
| Published in: |
Management Science. - Institute for Operations Research and the Management Sciences - INFORMS, ISSN 0025-1909. - Vol. 45.1999, 4, p. 560-574
|
| Publisher: |
Institute for Operations Research and the Management Sciences - INFORMS |
| Subject: | semi-Markov decision processes (SMDP) | reinforcement learning | average reward | preventive maintenance |
-
Yan, Qi, (2022)
-
Wang, Hongfeng, (2023)
-
Stability estimates in the problem of average optimal switching of a Markov chain
Gordienko, Evgueni, (2003)
- More ...
-
Solving Semi-Markov Decision Problems Using Average Reward Reinforcement Learning
Das, Tapas K., (1999)
-
Gosavi, Abhijit, (2002)
-
Gosavi, Abhijit, (2004)
- More ...