Solving Semi-Markov Decision Problems Using Average Reward Reinforcement Learning
Year of publication: |
1999
|
---|---|
Authors: | Das, Tapas K. ; Gosavi, Abhijit ; Mahadevan, Sridhar ; Marchalleck, Nicholas |
Published in: |
Management Science. - Institute for Operations Research and the Management Sciences - INFORMS, ISSN 0025-1909. - Vol. 45.1999, 4, p. 560-574
|
Publisher: |
Institute for Operations Research and the Management Sciences - INFORMS |
Subject: | semi-Markov decision processes (SMDP) | reinforcement learning | average reward | preventive maintenance |
-
Wang, Hongfeng, (2023)
-
Yan, Qi, (2022)
-
Average optimal switching of a Markov chain with a Borel state space
Yushkevich, Alexander, (2002)
- More ...
-
Solving Semi-Markov Decision Problems Using Average Reward Reinforcement Learning
Das, Tapas K., (1999)
-
Gosavi, Abhijit, (2004)
-
Gosavi, Abhijit, (2002)
- More ...