Lembersky, Mark R. - In: Management Science 21 (1974) 3, pp. 348-357
Motivated by a planning horizon result for continuous time Markov decision chains, we study decision rules, called preferred, which may be used in the initially stationary part of nearly optimal policies. We characterize these rules and then, under conditions involving state recurrence and...