A Structured Multiarmed Bandit Problem and the Greedy Policy
Year of publication: |
2009-03
|
---|---|
Authors: | Rusmevichientong, Paat ; Mersereau, Adam J. ; Tsitsiklis, John N. |
Publisher: |
Institute of Electrical and Electronics Engineers |
Subject: | Markov decision process (MDP) |
-
Neighbourhood Search for constructing Pareto sets
Dorini, G., (2007)
-
Neighbourhood Search for constructing Pareto sets
Dorini, G., (2007)
-
Optimal patient assignment for W queueing network in a diagnostic facility setting
Geng, Na, (2017)
- More ...
-
Linearly parameterized bandits
Rusmevichientong, Paat, (2010)
-
The value of field experiments
Li, Jimmy Q., (2015)
-
Linearly Parameterized Bandits
Rusmevichientong, Paat, (2010)
- More ...