Similar Search Results

Approachability in repeated games: Computational aspects and a Stackelberg variant

Mannor, Shie; Tsitsiklis, John N. - In: Games and Economic Behavior 66 (2009) 1, pp. 315-325

We consider a finite two-player zero-sum game with vector-valued rewards. We study the question of whether a given polyhedral set D is "approachable," that is, whether Player 1 (the "decision maker") can guarantee that the long-term average reward belongs to D, for any strategy of Player 2 (the...

Persistent link: https://www.econbiz.de/10005066714

A contract-based model for directed network formation

Johari, Ramesh; Mannor, Shie; Tsitsiklis, John N. - In: Games and Economic Behavior 56 (2006) 2, pp. 201-224

Persistent link: https://www.econbiz.de/10005413666

Bias and Variance Approximation in Value Function Estimates

Mannor, Shie; Simester, Duncan; Sun, Peng; Tsitsiklis, … - In: Management Science 53 (2007) 2, pp. 308-322

We consider a finite-state, finite-action, infinite-horizon, discounted reward Markov decision process and study the bias and variance in the value function estimates that result from empirical estimates of the model parameters. We provide closed-form approximations for the bias and variance,...

Persistent link: https://www.econbiz.de/10009209247

Regret minimization in repeated matrix games with variable stage duration

Mannor, Shie; Shimkin, Nahum - In: Games and Economic Behavior 63 (2008) 1, pp. 227-258

Regret minimization in repeated matrix games has been extensively studied ever since Hannan's seminal paper [Hannan, J., 1957. Approximation to Bayes risk in repeated play. In: Dresher, M., Tucker, A.W., Wolfe, P. (Eds.), Contributions to the Theory of Games, vol. III. Ann. of Math. Stud., vol....

Persistent link: https://www.econbiz.de/10005413696

A general framework for bandit problems beyond cumulative objectives

Cassel, Asaf; Mannor, Shie; Zeevi, Assaf - In: Mathematics of operations research 48 (2023) 4, pp. 2196-2232

Persistent link: https://www.econbiz.de/10014437823

Statistical optimization in high dimensions

Xu, Huan; Caramanis, Constantine; Mannor, Shie - In: Operations research 64 (2016) 4, pp. 958-979

Persistent link: https://www.econbiz.de/10011538580

Reinforcement learning in robust Markov decision processes

Lim, Shiau Hong; Xu, Huan; Mannor, Shie - In: Mathematics of operations research 41 (2016) 4, pp. 1325-1353

Persistent link: https://www.econbiz.de/10011595063

Robust MDPs with k-rectangular uncertainty

Mannor, Shie; Mebel, Ofir; Xu, Huan - In: Mathematics of operations research 41 (2016) 4, pp. 1484-1509

Persistent link: https://www.econbiz.de/10011595106

When is it important to know you've been rejected? A search problem with probabilistic appearance of offers

Das, Sanmay; Tsitsiklis, John N. - In: Journal of Economic Behavior & Organization 74 (2010) 1-2, pp. 104-122

A problem that often arises in the process of searching for a job or for a candidate to fill a position is that applicants do not know if they will receive an offer from any given firm with which they interview, and, conversely, firms do not know whether applicants will definitely take positions...

Persistent link: https://www.econbiz.de/10008507088

Private sequential learning

Tsitsiklis, John N.; Xu, Kuang; Xu, Zhi - In: Operations research 69 (2021) 5, pp. 1575-1590

Persistent link: https://www.econbiz.de/10012661070