Sublinear regret for learning POMDPs
Year of publication: |
2022
|
---|---|
Authors: | Xiong, Yi ; Chen, Ningyuan ; Gao, Xuefeng ; Zhou, Sean Xiang |
Published in: |
Production and operations management : the flagship research journal of the Production and Operations Management Society. - London : Sage Publications, ISSN 1937-5956, ZDB-ID 2151364-8. - Vol. 31.2022, 9, p. 3491-3504
|
Subject: | exploration-exploitation | online learning | partially observable MDP | spectral estimator | E-Learning | E-learning | Schätztheorie | Estimation theory | Lernprozess | Learning process | Lernen | Learning |
-
Chang, Victor, (2022)
-
A report on the online learning experience of students in accounting course
Lam, Jeanne Y. C., (2015)
-
Learning in random utility models via online decision problems
Melo, Emerson, (2021)
- More ...
-
Add‐On Pricing in a Distribution Channel
Yin, Qianbo, (2021)
-
Trade‐in for Cash or for Upgrade? Dynamic Pricing with Customer Choice
Xiao, Yongbo, (2019)
-
Customer Satisfaction, Advertising Competition, and Platform Performance
Yang, Chaolin, (2021)
- More ...