//--> //--> //-->
Toggle navigation
Logout
Change account settings
EN
DE
ES
FR
A-Z
Beta
About EconBiz
News
Thesaurus (STW)
Research Skills
Help
EN
DE
ES
FR
My account
Logout
Change account settings
Login
Publications
Events
Your search terms
Search
Retain my current filters
~person:"Yang, Zhuoran"
~subject:"Restless bandits"
~subject:"Theory"
Search options
All Fields
Title
Exact title
Subject
Author
Institution
ISBN/ISSN
Published in...
Publisher
Open Access only
Advanced
Search history
My EconBiz
Favorites
Loans
Reservations
Fines
You are here:
Home
Modeling Human Performance in...
Similar by subject
Narrow search
Delete all filters
| 3 applied filters
Year of publication
From:
To:
Subject
All
Restless bandits
Theory
Learning process
3
Lernprozess
3
reinforcement learning
3
Learning
2
Lernen
2
Theorie
2
Correlation
1
Experiment
1
Game theory
1
Korrelation
1
Learning organization
1
Lernende Organisation
1
Markov chain
1
Markov games
1
Markov-Kette
1
Mathematical programming
1
Mathematische Optimierung
1
Nash equilibrium
1
Nash-Gleichgewicht
1
Neural networks
1
Neuronale Netze
1
Spieltheorie
1
correlated equilibrium
1
episodic MDP
1
exploration
1
function approximation
1
linear function approximation
1
overparameterized neural network
1
temporal difference learning
1
more ...
less ...
Online availability
All
Undetermined
2
Type of publication
All
Article
2
Type of publication (narrower categories)
All
Article in journal
2
Aufsatz in Zeitschrift
2
Language
All
English
2
Author
All
Yang, Zhuoran
Jaimungal, Sebastian
4
Calvano, Emilio
3
Calzolari, Giacomo
3
Denicolò, Vincenzo
3
Dimitrakopoulos, Roussos
3
Pastorello, Sergio
3
Powell, Warren B.
3
Qin, Zhiwei
3
Ulmer, Marlin Wolf
3
Agrawal, Shipra
2
Ayesta, Urtzi
2
Bertasiute, Akvile
2
Brammer, Janis
2
Hao, Jin-Kao
2
Hildebrandt, Florentin D.
2
Jacko, Peter
2
Jia, Randy
2
Jiao, Yan
2
Lutz, Bernhard
2
Mannor, Shie
2
Massaro, Domenico
2
Mattfeld, Dirk C.
2
Neumann, Dirk
2
Ning, Brian
2
Niño-Mora, José
2
Ponomarenko, Alexey
2
Russo, Daniel
2
Tang, Xiaocheng
2
Van Roy, Benjamin
2
Wang, Zhaoran
2
Weber, Matthias
2
Xu, Zhe
2
Yaakoubi, Yassine
2
Ye, Jieping
2
Zhang, Fan
2
Zhu, Hongtu
2
Aboussalah, Amine Mohamed
1
Agasucci, Valerio
1
Agrawal, Priyank
1
more ...
less ...
Published in...
All
Mathematics of operations research
2
Source
All
ECONIS (ZBW)
2
Showing
1
-
2
of
2
Sort
relevance
articles prioritized
date (newest first)
date (oldest first)
1
Provably efficient reinforcement learning with linear function approximation
Jin, Chi
;
Yang, Zhuoran
;
Wang, Zhaoran
;
Jordan, Michael …
- In:
Mathematics of operations research
48
(
2023
)
3
,
pp. 1496-1521
Persistent link: https://www.econbiz.de/10014329343
Saved in:
2
Neural temporal difference and Q learning provably converge to global optima
Cai, Qi
;
Yang, Zhuoran
;
Lee, Jason D.
;
Wang, Zhaoran
- In:
Mathematics of operations research
49
(
2024
)
1
,
pp. 619-651
Persistent link: https://www.econbiz.de/10014527959
Saved in:
Results per page
10
25
50
100
250
A service of the
zbw
×
Loading...
//-->