Robustness of stochastic bandit policies
Year of publication: |
2014
|
---|---|
Authors: | Salomon, Antoine ; Audibert, Jean-Yves |
Institutions: | Université Paris-Dauphine (Paris IX) |
Subject: | Exploration–exploitation tradeoff | Multi-armed stochastic bandit | Regret deviations/risk |
Series: | |
---|---|
Type of publication: | Book / Working Paper |
Notes: | Published in Theoretical Computer Science, 2014, Vol. 519. pp. 46-67.Length: 21 pages |
Classification: | C73 - Stochastic and Dynamic Games ; D81 - Criteria for Decision-Making under Risk and Uncertainty |
Source: |
-
Evolutionary Stability of Prospect Theory Preferences
Rieger, Marc Oliver, (2009)
-
Equilibria in Games with Prospect Theory Preferences
Metzger, Lars P., (2009)
-
Stringhi, Alessandro, (2025)
- More ...
-
Lower Bounds and Selectivity of Weak-Consistent Policies in Stochastic Multi-Armed Bandit Problem.
El Alaoui, Issam, (2013)
-
On games of strategic experimentation
Salomon, Antoine, (2013)
-
Regret in online combinatorial optimization
Audibert, Jean-Yves, (2014)
- More ...