Showing 1 - 10 of 158
In a multi-armed bandit (MAB) problem a gambler needs to choose at each round of play one of K arms, each characterized by an unknown reward distribution. Reward realizations are only observed when an arm is selected, and the gambler's objective is to maximize cumulative expected earnings over...
Persistent link: https://www.econbiz.de/10012856685
We consider a non-stationary variant of a sequential stochastic optimization problem, where the underlying cost functions may change along the horizon. We propose a measure, termed variation budget, that controls the extent of said change, and study how restrictions on this budget impact...
Persistent link: https://www.econbiz.de/10013035332
In a multi-armed bandit (MAB) problem a gambler needs to choose at each round of play one of K arms, each characterized by an unknown reward distribution. Reward realizations are only observed when an arm is selected, and the gambler's objective is to maximize his cumulative expected earnings...
Persistent link: https://www.econbiz.de/10011183969
A new class of online services allows publishers to direct readers from articles they are currently reading to other web-based content they may be interested in. A key feature of such a dynamic recommendation service is that users interact with the provider along their browsing path. While the...
Persistent link: https://www.econbiz.de/10011183988
Persistent link: https://www.econbiz.de/10011397831
Persistent link: https://www.econbiz.de/10011437902
We consider a pricing problem in an environment where the customers' willingness-to-pay (WtP) distribution may change at some point over the selling horizon. Customers arrive sequentially and make purchase decisions based on a quoted price and their private reservation price. The seller knows...
Persistent link: https://www.econbiz.de/10013112585
We consider a single product revenue management problem where, given an initial inventory, the objective is to dynamically adjust prices over a finite sales horizon to maximize expected revenues. Realized demand is observed over time, but the underlying functional relationship between price and...
Persistent link: https://www.econbiz.de/10013119422
We consider a multi-period single product pricing problem with an unknown demand curve. The seller's objective is to adjust prices in each period so as to maximize cumulative expected revenues over a given finite time horizon; in doing so, the seller needs to resolve the tension between learning...
Persistent link: https://www.econbiz.de/10013066868
The fields of statistics and econometrics have developed powerful methods for testing the validity (specification) of a model based on its fit to underlying data. Unlike statisticians, managers are typically more interested in the performance of a decision rather than the statistical validity of...
Persistent link: https://www.econbiz.de/10014042407