Guo, Xianping; Shi, Peng; Zhu, Weiping - In: Mathematical Methods of Operations Research 52 (2000) 2, pp. 287-306
-canonical policy for nonstationary Markov decision processes, which is an extension of the canonical policies of Herna´ndez-Lerma and …