Barut, Emre; Powell, Warren - In: Journal of Global Optimization 58 (2014) 3, pp. 517-543
We propose a sequential learning policy for ranking and selection problems, where we use a non-parametric procedure for estimating the value of a policy. Our estimation approach aggregates over a set of kernel functions in order to achieve a more consistent estimator. Each element in the kernel...