Showing 1 - 10 of 21
We consider a finite two-player zero-sum game with vector-valued rewards. We study the question of whether a given polyhedral set D is "approachable," that is, whether Player 1 (the "decision maker") can guarantee that the long-term average reward belongs to D, for any strategy of Player 2 (the...
Persistent link: https://www.econbiz.de/10005066714
Persistent link: https://www.econbiz.de/10005413666
We consider a finite-state, finite-action, infinite-horizon, discounted reward Markov decision process and study the bias and variance in the value function estimates that result from empirical estimates of the model parameters. We provide closed-form approximations for the bias and variance,...
Persistent link: https://www.econbiz.de/10009209247
Regret minimization in repeated matrix games has been extensively studied ever since Hannan's seminal paper [Hannan, J., 1957. Approximation to Bayes risk in repeated play. In: Dresher, M., Tucker, A.W., Wolfe, P. (Eds.), Contributions to the Theory of Games, vol. III. Ann. of Math. Stud., vol....
Persistent link: https://www.econbiz.de/10005413696
Persistent link: https://www.econbiz.de/10014437823
Persistent link: https://www.econbiz.de/10011538580
Persistent link: https://www.econbiz.de/10011595063
Persistent link: https://www.econbiz.de/10011595106
A problem that often arises in the process of searching for a job or for a candidate to fill a position is that applicants do not know if they will receive an offer from any given firm with which they interview, and, conversely, firms do not know whether applicants will definitely take positions...
Persistent link: https://www.econbiz.de/10008507088
Persistent link: https://www.econbiz.de/10012661070