Showing 1 - 10 of 11
We consider a non-stationary variant of a sequential stochastic optimization problem, where the underlying cost functions may change along the horizon. We propose a measure, termed variation budget, that controls the extent of said change, and study how restrictions on this budget impact...
Persistent link: https://www.econbiz.de/10013035332
Persistent link: https://www.econbiz.de/10014437823
Persistent link: https://www.econbiz.de/10015211529
Persistent link: https://www.econbiz.de/10010217832
Persistent link: https://www.econbiz.de/10011397831
Persistent link: https://www.econbiz.de/10010526550
Persistent link: https://www.econbiz.de/10011972263
Persistent link: https://www.econbiz.de/10011596500
In a multi-armed bandit (MAB) problem a gambler needs to choose at each round of play one of K arms, each characterized by an unknown reward distribution. Reward realizations are only observed when an arm is selected, and the gambler's objective is to maximize cumulative expected earnings over...
Persistent link: https://www.econbiz.de/10012856685
Persistent link: https://www.econbiz.de/10009782027