Delay-adaptive learning in generalized linear contextual bandits
Year of publication: |
2024
|
---|---|
Authors: | Blanchet, Jose ; Xu, Renyuan ; Zhou, Zhengyuan |
Published in: |
Mathematics of operations research. - Hanover, Md. : INFORMS, ISSN 1526-5471, ZDB-ID 2004273-5. - Vol. 49.2024, 1, p. 326-345
|
Subject: | contextual bandits | delayed feedback | generalized linear model | MLE | Schätztheorie | Estimation theory | Lernprozess | Learning process | Lernen | Learning | Mathematische Optimierung | Mathematical programming |
-
Artificial intelligence as structural estimation : Deep Blue, Bonanza, and AlphaGo
Igami, Mitsuru, (2020)
-
LocalGLMnet : interpretable deep learning for tabular data
Richman, Ronald, (2023)
-
Dynamic batch learning in high-dimensional sparse linear contextual bandits
Ren, Zhimei, (2024)
- More ...
-
Distributionally robust batch contextual bandits
Si, Nian, (2023)
-
Learning to order for inventory systems with lost sales and uncertain supplies
Chen, Boxiao, (2024)
-
Deterministic and stochastic wireless network games : equilibrium, dynamics, and price of anarchy
Zhou, Zhengyuan, (2018)
- More ...