Optimal assignment of sellers in a store with a random number of clients via the Armed Bandit model
Year of publication: |
October-December 2017
|
---|---|
Authors: | Vázquez-Guevara, Víctor Hugo ; Cruz-Suárez, Hugo ; Velasco-Luna, Fernando |
Published in: |
RAIRO / Operations research. - Les Ulis : EDP Sciences, ISSN 0399-0559, ZDB-ID 1481534-5. - Vol. 51.2017, 4, p. 1118-1132
|
Subject: | Armed bandit model | dynamic programming | assignment of personal | random horizon | markov decision processes | Theorie | Theory | Markov-Kette | Markov chain | Dynamische Optimierung | Dynamic programming | Entscheidung | Decision | Mathematische Optimierung | Mathematical programming |
-
On boundedness of Q-learning iterates for stochastic shortest path problems
Yu, Huizhen, (2013)
-
Finitely additive dynamic programming
Sudderth, William D., (2016)
-
Improved and generalized upper bounds on the complexity of policy iteration
Scherrer, Bruno, (2016)
- More ...
-
An envelope theorem and some applications to discounted Markov decision processes
Cruz-Suárez, Hugo, (2008)
-
An envelope theorem and some applications to discounted Markov decision processes
Cruz-Suárez, Hugo, (2008)
-
An envelope theorem and some applications to discounted Markov decision processes
Cruz-Suárez, Hugo, (2008)
- More ...