Transfer reinforcement learning for mixed observability markov decision processes with time-varying interval-valued parameters and its application in pandemic control
Mu Du, Hongtao Yu, Nan Kong
| Year of publication: |
2025
|
|---|---|
| Authors: | Du, Mu ; Yu, Hongtao ; Kong, Nan |
| Published in: |
INFORMS journal on computing : JOC ; charting new directions in operations research and computer science ; a journal of the Institute for Operations Research and the Management Sciences. - Linthicum, Md. : INFORMS, ISSN 1526-5528, ZDB-ID 2004082-9. - Vol. 37.2025, 2, p. 315-337
|
| Subject: | deep reinforcement learning | MOMDP | online learning and optimization | time-varying interval-valued parameters | transfer learning | Lernprozess | Learning process | Theorie | Theory | Lernen | Learning | Markov-Kette | Markov chain | Mathematische Optimierung | Mathematical programming | E-Learning | E-learning |
Saved in:
Saved in favorites
Similar items by subject
-
A nonparametric learning algorithm for a stochastic multi-echelon inventory problem
Yang, Cong, (2024)
-
An asymptotically tight learning algorithm for mobile-promotion platforms
Feng, Zhichao, (2023)
-
Optimal online learning for nonlinear belief models using discrete priors
Han, Weidong, (2020)
- More ...
Similar items by person