Vanvuchelen, Nathalie; Boute, Robert N. - 2022
We propose a new policy architecture to apply deep reinforcement learning (DRL) with a continuous action representation. This contrasts most current DRL implementations that make use of a discrete action space. The latter are not scalable to large problems. To obtain feasible discrete actions,...