Transfer reinforcement learning for mixed observability markov decision processes with time-varying interval-valued parameters and its application in pandemic control

Mu Du, Hongtao Yu, Nan Kong

Year of publication:	2025
Authors:	Du, Mu ; Yu, Hongtao ; Kong, Nan
Published in:	INFORMS journal on computing : JOC ; charting new directions in operations research and computer science ; a journal of the Institute for Operations Research and the Management Sciences. - Linthicum, Md. : INFORMS, ISSN 1526-5528, ZDB-ID 2004082-9. - Vol. 37.2025, 2, p. 315-337
Subject:	deep reinforcement learning \| MOMDP \| online learning and optimization \| time-varying interval-valued parameters \| transfer learning \| Lernprozess \| Learning process \| Theorie \| Theory \| Lernen \| Learning \| Markov-Kette \| Markov chain \| Mathematische Optimierung \| Mathematical programming \| E-Learning \| E-learning

Type of publication:	Article
Type of publication (narrower categories):	Aufsatz in Zeitschrift ; Article in journal
Language:	English
Other identifiers:	10.1287/ijoc.2022.0236 [DOI]
Source:	ECONIS - Online Catalogue of the ZBW

Persistent link: https://www.econbiz.de/10015434269