Serin, Yasemin; Kulkarni, Vidyadhar - In: Mathematical Methods of Operations Research 61 (2005) 2, pp. 311-328
We develop an algorithm to compute optimal policies for Markov decision processes subject to constraints that result from some observability restrictions on the process. We assume that the state of the Markov process is unobservable. There is an observable process related to the unobservable...