Showing 1 - 10 of 60
Persistent link: https://www.econbiz.de/10014302689
In a multi-armed bandit (MAB) problem a gambler needs to choose at each round of play one of K arms, each characterized by an unknown reward distribution. Reward realizations are only observed when an arm is selected, and the gambler's objective is to maximize his cumulative expected earnings...
Persistent link: https://www.econbiz.de/10011183969
A new class of online services allows publishers to direct readers from articles they are currently reading to other web-based content they may be interested in. A key feature of such a dynamic recommendation service is that users interact with the provider along their browsing path. While the...
Persistent link: https://www.econbiz.de/10011183988
In a multi-armed bandit (MAB) problem a gambler needs to choose at each round of play one of K arms, each characterized by an unknown reward distribution. Reward realizations are only observed when an arm is selected, and the gambler's objective is to maximize cumulative expected earnings over...
Persistent link: https://www.econbiz.de/10012856685
We study a multi-server queueing model of a revenue-maximizing firm providing a service to a market of heterogeneous price- and delay-sensitive customers with private individual preferences. The firm may offer a selection of service classes that are differentiated in prices and delays. Using a...
Persistent link: https://www.econbiz.de/10013078749
We consider the one-armed bandit problem of Woodroofe [J. Amer. Statist. Assoc. 74 (1979) 799-806], which involves sequential sampling from two populations: one whose characteristics are known, and one which depends on an unknown parameter and incorporates a covariate. The goal is to maximize...
Persistent link: https://www.econbiz.de/10013119402
We consider a call center model with multiple customer classes and multiple server pools. Calls arrive randomly over time, and the instantaneous arrival rates are allowed to vary both temporally and stochastically in an arbitrary manner. The objective is to minimize the sum of personnel costs...
Persistent link: https://www.econbiz.de/10013119405
We consider a single product revenue management problem where, given an initial inventory, the objective is to dynamically adjust prices over a finite sales horizon to maximize expected revenues. Realized demand is observed over time, but the underlying functional relationship between price and...
Persistent link: https://www.econbiz.de/10013119422
We consider a dynamic learning problem where a decision maker sequentially selects a control and observes a response variable that depends on chosen control and an unknown sensitivity parameter. After every observation, the decision maker updates her/his estimate of the unknown parameter and...
Persistent link: https://www.econbiz.de/10012933782
Motivated by applications in financial services, we consider a seller who offers prices sequentially to a stream of potential customers, observing either success or failure in each sales attempt. The parameters of the underlying demand model are initially unknown, so each price decision involves...
Persistent link: https://www.econbiz.de/10012938107