Similar Search Results

Optimal Exploration-Exploitation in a Multi-Armed-Bandit Problem with Non-Stationary Rewards

Besbes, Omar - 2020

In a multi-armed bandit (MAB) problem a gambler needs to choose at each round of play one of K arms, each characterized by an unknown reward distribution. Reward realizations are only observed when an arm is selected, and the gambler's objective is to maximize cumulative expected earnings over...

Persistent link: https://www.econbiz.de/10012856685

Non-Stationary Stochastic Optimization

Besbes, Omar - 2015

We consider a non-stationary variant of a sequential stochastic optimization problem, where the underlying cost functions may change along the horizon. We propose a measure, termed variation budget, that controls the extent of said change, and study how restrictions on this budget impact...

Persistent link: https://www.econbiz.de/10013035332

Optimal Exploration-Exploitation in a Multi-armed-Bandit Problem with Non-stationary Rewards

Besbes, Omar; Gur, Yonatan; Zeevi, Assaf - Graduate School of Business, Stanford University - 2014

Persistent link: https://www.econbiz.de/10011183969

Optimization in Online Content Recommendation Services: Beyond Click-Through-Rates

Besbes, Omar; Gur, Yonatan; Zeevi, Assaf - Graduate School of Business, Stanford University - 2014

A new class of online services allows publishers to direct readers from articles they are currently reading to other web-based content they may be interested in. A key feature of such a dynamic recommendation service is that users interact with the provider along their browsing path. While the...

Persistent link: https://www.econbiz.de/10011183988

Non-stationary stochastic optimization

Besbes, Omar; Gur, Yonatan; Zeevi, Assaf - In: Operations research 63 (2015) 5, pp. 1227-1244

Persistent link: https://www.econbiz.de/10011397831

Optimization in online content recommendation services : beyond click-through rates

Besbes, Omar; Gur, Yonatan; Zeevi, Assaf - In: Manufacturing & service operations management : M & SOM 18 (2016) 1, pp. 15-33

Persistent link: https://www.econbiz.de/10011437902

On the Minimax Complexity of Pricing in a Changing Environment

Besbes, Omar - 2012

We consider a pricing problem in an environment where the customers' willingness-to-pay (WtP) distribution may change at some point over the selling horizon. Customers arrive sequentially and make purchase decisions based on a quoted price and their private reservation price. The seller knows...

Persistent link: https://www.econbiz.de/10013112585

Dynamic Pricing Without Knowing the Demand Function : Risk Bounds and Near-Optimal Algorithms

Besbes, Omar - 2011

We consider a single product revenue management problem where, given an initial inventory, the objective is to dynamically adjust prices over a finite sales horizon to maximize expected revenues. Realized demand is observed over time, but the underlying functional relationship between price and...

Persistent link: https://www.econbiz.de/10013119422

On the (Surprising) Sufficiency of Linear Models for Dynamic Pricing with Demand Learning

Besbes, Omar - 2014

We consider a multi-period single product pricing problem with an unknown demand curve. The seller's objective is to adjust prices in each period so as to maximize cumulative expected revenues over a given finite time horizon; in doing so, the seller needs to resolve the tension between learning...

Persistent link: https://www.econbiz.de/10013066868

Testing the Validity of a Demand Model : An Operations Perspective

Besbes, Omar; Phillips, Robert L.; Zeevi, Assaf - 2012

The fields of statistics and econometrics have developed powerful methods for testing the validity (specification) of a model based on its fit to underlying data. Unlike statisticians, managers are typically more interested in the performance of a decision rather than the statistical validity of...

Persistent link: https://www.econbiz.de/10014042407