Castro, Pablo S; Desai, Ajit; Du, Han; Garratt, Rodney; … - 2021
This paper uses reinforcement learning (RL) to approximate the policy rules of banks participating in a high-value payments system. The objective of the agents is to learn a policy function for the choice of amount of liquidity provided to the system at the beginning of the day. Individual...