Regret bound for Narendra-Shapiro bandit algorithms
Year of publication: |
2015-02
|
---|---|
Authors: | Gadat, Sébastien ; Panloup, F. ; Saadane, Sofiane |
Institutions: | Toulouse School of Economics (TSE) |
Subject: | Regret | Stochastic Bandit Algorithms | Piecewise Deterministic Markov Processes |
-
Optimal stopping for partially observed piecewise-deterministic Markov processes
Brandejsky, Adrien, (2013)
-
MDP algorithms for portfolio optimization problems in pure jump markets
Bäuerle, Nicole, (2009)
-
Ruin probabilities in multivariate risk models with periodic common shock
Cojocaru, Ionica, (2017)
- More ...
-
Gadat, Sébastien, (2016)
-
Classification with the nearest neighbor rule in general finite dimensional spaces
Gadat, Sébastien, (2014)
-
Individual human cytotoxic T Lymphocytes exhibit intraclonal heterogeneity in cumulative killing
Vasconcelos, Z., (2014)
- More ...