Transfer reinforcement learning for mixed observability markov decision processes with time-varying interval-valued parameters and its application in pandemic control
| Year of publication: |
2025
|
|---|---|
| Authors: | Du, Mu ; Yu, Hongtao ; Kong, Nan |
| Published in: |
INFORMS journal on computing : JOC ; charting new directions in operations research and computer science ; a journal of the Institute for Operations Research and the Management Sciences. - Linthicum, Md. : INFORMS, ISSN 1526-5528, ZDB-ID 2004082-9. - Vol. 37.2025, 2, p. 315-337
|
| Subject: | deep reinforcement learning | MOMDP | online learning and optimization | time-varying interval-valued parameters | transfer learning | Lernprozess | Learning process | Theorie | Theory | Lernen | Learning | Markov-Kette | Markov chain | Mathematische Optimierung | Mathematical programming | E-Learning | E-learning |
-
A nonparametric learning algorithm for a stochastic multi-echelon inventory problem
Yang, Cong, (2024)
-
An asymptotically tight learning algorithm for mobile-promotion platforms
Feng, Zhichao, (2023)
-
Optimal learning for structured bandits
Parys, Bart P. G. van, (2024)
- More ...
-
A data-driven optimization approach for multi-period resource allocation in cholera outbreak control
Du, Mu, (2021)
-
A new heuristic scheduling method for the make-pack-route problem in make-to-order supply chains
Du, Mu, (2019)
-
Zhang, Hang, (2022)
- More ...