Showing 1 - 9 of 9
Persistent link: https://www.econbiz.de/10014393036
We consider a non-stationary variant of a sequential stochastic optimization problem, where the underlying cost functions may change along the horizon. We propose a measure, termed variation budget, that controls the extent of said change, and study how restrictions on this budget impact...
Persistent link: https://www.econbiz.de/10013035332
Persistent link: https://www.econbiz.de/10014393035
Persistent link: https://www.econbiz.de/10011397831
Persistent link: https://www.econbiz.de/10010526550
In a multi-armed bandit (MAB) problem a gambler needs to choose at each round of play one of K arms, each characterized by an unknown reward distribution. Reward realizations are only observed when an arm is selected, and the gambler's objective is to maximize cumulative expected earnings over...
Persistent link: https://www.econbiz.de/10012856685
Persistent link: https://www.econbiz.de/10015361773
Persistent link: https://www.econbiz.de/10013365859
Persistent link: https://www.econbiz.de/10014505131