Arruda, E.F.; Fragoso, M.D. - In: European Journal of Operational Research 240 (2015) 3, pp. 697-705
This paper introduces a two-phase approach to solve average cost Markov decision processes, which is based on state space embedding or time aggregation. In the first phase, time aggregation is applied for policy optimization in a prescribed subset of the state space, and a novel result is...