Showing 1 - 10 of 28
Persistent link: https://www.econbiz.de/10009772772
Persistent link: https://www.econbiz.de/10011297146
Persistent link: https://www.econbiz.de/10011341662
Persistent link: https://www.econbiz.de/10011946649
Persistent link: https://www.econbiz.de/10011628224
Persistent link: https://www.econbiz.de/10003962612
We consider a multiarmed bandit problem where the expected reward of each arm is a linear function of an unknown scalar with a prior distribution. The objective is to choose a sequence of arms that maximizes the expected total (or discounted total) reward. We demonstrate the effectiveness of a...
Persistent link: https://www.econbiz.de/10009432173
Persistent link: https://www.econbiz.de/10001053548
Persistent link: https://www.econbiz.de/10003985721
Persistent link: https://www.econbiz.de/10009787362