Showing 521 - 530 of 783,248
Motivated by practical applications, chiefly clinical trials, we study the regret achievable for stochastic bandits under the constraint that the employed policy must split trials into a small number of batches. Our results show that a very small number of batches gives close to minimax optimal...
Persistent link: https://www.econbiz.de/10013012898
Persistent link: https://www.econbiz.de/10012872401
Persistent link: https://www.econbiz.de/10015184961
Persistent link: https://www.econbiz.de/10015097275
Persistent link: https://www.econbiz.de/10015151630
Persistent link: https://www.econbiz.de/10015076452
Persistent link: https://www.econbiz.de/10015076700
Persistent link: https://www.econbiz.de/10015078773
Persistent link: https://www.econbiz.de/10015078775
Persistent link: https://www.econbiz.de/10013288205