Showing 1 - 8 of 8
In the past decade, hundreds of impact evaluation studies have measured the learning outcomes of education interventions in developing countries. The impact magnitudes are often reported in terms of "standard deviations," making them difficult to communicate to policy makers beyond education...
Persistent link: https://www.econbiz.de/10012007992
Persistent link: https://www.econbiz.de/10014338000
Persistent link: https://www.econbiz.de/10015211747
One of the challenges for multi-agent reinforcement learning (MARL) is designing efficient learning algorithms for a large system in which each agent has only limited or partial information of the entire system. In this system, it is desirable to learn policies of a decentralized type. A recent...
Persistent link: https://www.econbiz.de/10013216610
Persistent link: https://www.econbiz.de/10014311405
Persistent link: https://www.econbiz.de/10014314872
We study finite-time horizon continuous-time linear-quadratic reinforcement learning problems in an episodic setting, where both the state and control coefficients are unknown to the controller. We first propose a least-squares algorithm based on continuous-time observations and controls, and...
Persistent link: https://www.econbiz.de/10013226899
Persistent link: https://www.econbiz.de/10011867030