Similar Search Results

Besbes, Omar - 2015

We consider a non-stationary variant of a sequential stochastic optimization problem, where the underlying cost functions may change along the horizon. We propose a measure, termed variation budget, that controls the extent of said change, and study how restrictions on this budget impact...

Persistent link: https://www.econbiz.de/10013035332

A general framework for bandit problems beyond cumulative objectives

Cassel, Asaf; Mannor, Shie; Zeevi, Assaf - In: Mathematics of operations research 48 (2023) 4, pp. 2196-2232

Persistent link: https://www.econbiz.de/10014437823

Towards optimal problem dependent generalization error bounds in statistical learning theory

Xu, Yunbei; Zeevi, Assaf - In: Mathematics of operations research 50 (2025) 1, pp. 40-67

Persistent link: https://www.econbiz.de/10015211529

General bounds and finite-time improvement for the Kiefer-Wolfowitz stochastic approximation algorithm

Broadie, Mark; Cicek, Deniz; Zeevi, Assaf - In: Operations research 59 (2011) 5, pp. 1211-1224

Persistent link: https://www.econbiz.de/10010217832

Non-stationary stochastic optimization

Besbes, Omar; Gur, Yonatan; Zeevi, Assaf - In: Operations research 63 (2015) 5, pp. 1227-1244

Persistent link: https://www.econbiz.de/10011397831

On the (surprising) sufficiency of linear models for dynamic pricing with demand learning

Besbes, Omar; Zeevi, Assaf - In: Management science : journal of the Institute for … 61 (2015) 4, pp. 723-739

Persistent link: https://www.econbiz.de/10010526550

Tractable sampling strategies for ordinal optimization

Shin, Dongwook; Broadie, Mark; Zeevi, Assaf - In: Operations research 66 (2018) 6, pp. 1693-1712

Persistent link: https://www.econbiz.de/10011972263

On the tightness of an LP relaxation for rational optimization and its applications

Avadhanula, Vashist; Bhandari, Jalaj; Goyal, Vineet; … - In: Operations research letters 44 (2016) 5, pp. 612-617

Persistent link: https://www.econbiz.de/10011596500

Optimal Exploration-Exploitation in a Multi-Armed-Bandit Problem with Non-Stationary Rewards

Besbes, Omar - 2020

In a multi-armed bandit (MAB) problem a gambler needs to choose at each round of play one of K arms, each characterized by an unknown reward distribution. Reward realizations are only observed when an arm is selected, and the gambler's objective is to maximize cumulative expected earnings over...

Persistent link: https://www.econbiz.de/10012856685

Optimal dynamic assortment planning with demand learning

Sauré, Denis; Zeevi, Assaf - In: Manufacturing & service operations management : M & SOM 15 (2013) 3, pp. 387-404

Persistent link: https://www.econbiz.de/10009782027