Lower Bounds and Selectivity of Weak-Consistent Policies in Stochastic Multi-Armed Bandit Problem.
Year of publication: |
2013-01
|
---|---|
Authors: | El Alaoui, Issam ; Audibert, Jean-Yves ; Salomon, Antoine |
Institutions: | Université Paris-Dauphine (Paris IX) |
Subject: | Consistency | regret lower bounds | selectivity | stochastic bandits | UCB policies |
Extent: | application/pdf |
---|---|
Series: | |
Type of publication: | Book / Working Paper |
Notes: | Published in Journal of Machine Learning Research, 2013, Vol. 14, no. 1. pp. 187-207.Length: 20 pages |
Classification: | D81 - Criteria for Decision-Making under Risk and Uncertainty ; C73 - Stochastic and Dynamic Games |
Source: |
-
Evolutionary Stability of Prospect Theory Preferences
Rieger, Marc Oliver, (2009)
-
Equilibria in Games with Prospect Theory Preferences
Metzger, Lars P., (2009)
-
Stringhi, Alessandro, (2025)
- More ...
-
Robustness of stochastic bandit policies
Salomon, Antoine, (2014)
-
On games of strategic experimentation
Salomon, Antoine, (2013)
-
Regret in online combinatorial optimization
Audibert, Jean-Yves, (2014)
- More ...