Wang, Xikui; Bickis, Mikelis G. - In: Mathematical Methods of Operations Research 58 (2003) 2, pp. 209-219
One-armed bandit processes with continuous delayed responses are formulated as controlled stochastic processes … bandit processes. Furthermore, there is an optimal stopping solution when all observations on the unknown arm are complete …