Showing 1 - 1 of 1
In this paper we suggest a new successive approximation method to compute the optimal discounted reward for finite state and action, discrete time, discounted Markov decision chains. The method is based on a block partitioning of the (stochastic) matrices corresponding to the stationary...
Persistent link: https://www.econbiz.de/10008873101