Nonasymptotic analysis of Monte Carlo tree search
Year of publication: |
2022
|
---|---|
Authors: | Shah, Devavrat ; Xie, Qiaomin ; Xu, Zhi |
Published in: |
Operations research. - Linthicum, Md. : INFORMS, ISSN 1526-5463, ZDB-ID 2019440-7. - Vol. 70.2022, 6, p. 3234-3260
|
Subject: | Machine Learning and Data Science | Monte Carlo tree search | Nonstationary multi-armed bandit | reinforcement learning | Theorie | Theory | Monte-Carlo-Simulation | Monte Carlo simulation | Künstliche Intelligenz | Artificial intelligence | Lernprozess | Learning process | Lernen | Learning | Operations Research | Operations research |
-
Mo, Zhaobin, (2023)
-
Online model-based reinforcement learning for decision-making in long distance routes
Alcaraz, Juan J., (2022)
-
Single-machine scheduling with times-based and job-dependent learning effect
Jiang, Zhongyi, (2017)
- More ...
-
Greed works : online algorithms for unrelated machine stochastic scheduling
Gupta, Varun, (2020)
-
Xie, Qiaomin, (2023)
-
Does attention affect individual investors' investment return?
Shi, Rongsheng, (2012)
- More ...