Showing 1 - 7 of 7
Persistent link: https://www.econbiz.de/10014338000
Persistent link: https://www.econbiz.de/10015211747
One of the challenges for multi-agent reinforcement learning (MARL) is designing efficient learning algorithms for a large system in which each agent has only limited or partial information of the entire system. In this system, it is desirable to learn policies of a decentralized type. A recent...
Persistent link: https://www.econbiz.de/10013216610
Persistent link: https://www.econbiz.de/10014311405
Persistent link: https://www.econbiz.de/10014314872
We study finite-time horizon continuous-time linear-quadratic reinforcement learning problems in an episodic setting, where both the state and control coefficients are unknown to the controller. We first propose a least-squares algorithm based on continuous-time observations and controls, and...
Persistent link: https://www.econbiz.de/10013226899
Persistent link: https://www.econbiz.de/10011867030