Similar Search Results

Optimal Exploration-Exploitation in a Multi-Armed-Bandit Problem with Non-Stationary Rewards

Besbes, Omar - 2020

In a multi-armed bandit (MAB) problem a gambler needs to choose at each round of play one of K arms, each characterized by an unknown reward distribution. Reward realizations are only observed when an arm is selected, and the gambler's objective is to maximize cumulative expected earnings over...

Persistent link: https://www.econbiz.de/10012856685

Non-Stationary Stochastic Optimization

Besbes, Omar - 2015

We consider a non-stationary variant of a sequential stochastic optimization problem, where the underlying cost functions may change along the horizon. We propose a measure, termed variation budget, that controls the extent of said change, and study how restrictions on this budget impact...

Persistent link: https://www.econbiz.de/10013035332

Non-stationary stochastic optimization

Besbes, Omar; Gur, Yonatan; Zeevi, Assaf - In: Operations research 63 (2015) 5, pp. 1227-1244

Persistent link: https://www.econbiz.de/10011397831

Dynamic Pricing Without Knowing the Demand Function : Risk Bounds and Near-Optimal Algorithms

Besbes, Omar - 2011

We consider a single product revenue management problem where, given an initial inventory, the objective is to dynamically adjust prices over a finite sales horizon to maximize expected revenues. Realized demand is observed over time, but the underlying functional relationship between price and...

Persistent link: https://www.econbiz.de/10013119422

On the Disclosure of Promotion Value in Platforms with Learning Sellers

Gur, Yonatan - 2020

We consider a platform facilitating trade between sellers and buyers with the objective of maximizing consumer surplus. Even though in many such marketplaces prices are set by revenue-maximizing sellers, platforms can influence prices through (i) price-dependent promotion policies that can...

Persistent link: https://www.econbiz.de/10012847343

Value loss in allocation systems with provider guarantees

Gur, Yonatan; Iancu, Dan; Warnes, Xavier - In: Management science : journal of the Institute for … 67 (2021) 6, pp. 3757-3784

Persistent link: https://www.econbiz.de/10012607132

Adaptive Sequential Experiments with Unknown Information Arrival Processes

Gur, Yonatan; Momeni, Ahmadreza - 2021

Sequential experiments are deployed in a variety of practices, including for optimizing product recommendations and pricing in online platforms. Such experiments are often characterized by an exploration-exploitation tradeoff that is well-understood when at each time period feedback is received...

Persistent link: https://www.econbiz.de/10013218225

Information disclosure and promotion policy design for platforms

Gur, Yonatan; Macnamara, Gregory; Morgenstern, Ilan; … - In: Management science : journal of the Institute for … 69 (2023) 10, pp. 5883-5903

Persistent link: https://www.econbiz.de/10014393037

Regret Minimization with Dynamic Benchmarks in Repeated Games

Crippa, Ludovico; Gur, Yonatan; Light, Bar - 2023

In repeated games, strategies are often evaluated by their ability to guarantee the performance of the single best action that is selected in hindsight (a property referred to as Hannan consistency, or no-regret). However, the effectiveness of the single best action as a yardstick to evaluate...

Persistent link: https://www.econbiz.de/10014264316

Value loss in allocation systems with provider guarantees

Gur, Yonatan; Iancu, Dan; Warnes, Xavier - 2019

Many operational settings share the following three features: (i) a centralized planning system allocates tasks to workers or service providers, (ii) the providers generate value by completing the tasks, and (iii) the completion of tasks influences the providers' welfare. In such cases, the...

Persistent link: https://www.econbiz.de/10012065219