Showing 1 - 1 of 1
We generalize the bandit process with a covariate introduced by Woodroofe in several significant directions: a linear regression model characterizing the unknown arm, an unknown variance for regression residuals and general discounting sequence for a non-stationary model. With the Bayesian...
Persistent link: https://www.econbiz.de/10010698322