Nonasymptotic analysis of Monte Carlo tree search
| Year of publication: |
2022
|
|---|---|
| Authors: | Shah, Devavrat ; Xie, Qiaomin ; Xu, Zhi |
| Published in: |
Operations research. - Linthicum, Md. : INFORMS, ISSN 1526-5463, ZDB-ID 2019440-7. - Vol. 70.2022, 6, p. 3234-3260
|
| Subject: | Machine Learning and Data Science | Monte Carlo tree search | Nonstationary multi-armed bandit | reinforcement learning | Theorie | Theory | Monte-Carlo-Simulation | Monte Carlo simulation | Künstliche Intelligenz | Artificial intelligence | Lernprozess | Learning process | Lernen | Learning | Operations Research | Operations research |
-
Mo, Zhaobin, (2023)
-
Online model-based reinforcement learning for decision-making in long distance routes
Alcaraz, Juan J., (2022)
-
Learning generalized strong branching for set covering, set packing, and 0-1 knapsack problems
Yang, Yu, (2022)
- More ...
-
Greed works : online algorithms for unrelated machine stochastic scheduling
Gupta, Varun, (2020)
-
Tsitsiklis, John N., (2021)
-
Zhang, Feng, (2023)
- More ...