Showing 1 - 9 of 9
We study a robust model of the multi-armed bandit (MAB) problem in which the transition probabilities are ambiguous and belong to subsets of the probability simplex. We characterize the optimal policy as a project-by-project retirement policy but we show that arms become dependent so the Gittins...
Persistent link: https://www.econbiz.de/10013062893
Persistent link: https://www.econbiz.de/10002024503
Persistent link: https://www.econbiz.de/10009784136
Persistent link: https://www.econbiz.de/10009745626
Persistent link: https://www.econbiz.de/10010381854
Persistent link: https://www.econbiz.de/10010355688
Persistent link: https://www.econbiz.de/10009760456
Multi-armed bandit has been well-known for its efficiency in online decision-making in terms of minimizing the loss of the participants' welfare during experiments (i.e., the regret). In clinical trials and many other scenarios, the statistical power of inferring the treatment effects (i.e., the...
Persistent link: https://www.econbiz.de/10014076786
The Handbook is a comprehensive research reference that is essential for anyone interested in conducting research in supply chain. Unique features include: -A focus on the intersection of quantitative supply chain analysis and E-Business, -Unlike other edited volumes in the supply chain area,...
Persistent link: https://www.econbiz.de/10013521441