Horiguchi, Masayuki - In: Mathematical Methods of Operations Research 53 (2001) 2, pp. 279-295
In this paper, the optimization problem for a stopped Markov decision process with finite states and actions is considered over stopping times τ constrained so that ?τ≦α for some fixed α0. The problem is solved through randomization of stopping times and mathematical programming...