//--> //--> //-->
Toggle navigation
Logout
Change account settings
EN
DE
ES
FR
A-Z
Beta
About EconBiz
News
Thesaurus (STW)
Academic Skills
Help
EN
DE
ES
FR
My account
Logout
Change account settings
Login
Publications
Events
Your search terms
Search
Search options
All Fields
Title
Exact title
Subject
Author
Institution
ISBN/ISSN
Published in...
Publisher
Open Access only
Advanced
Search history
My EconBiz
Favorites
Loans
Reservations
Fines
You are here:
Home
Search: subject:"offline reinforcement learning"
Narrow search
Narrow search
Year of publication
From:
To:
Subject
All
Learning
3
Learning process
3
Lernen
3
Lernprozess
3
Theorie
3
Theory
3
offline reinforcement learning
3
Artificial intelligence
2
Künstliche Intelligenz
2
Advertising
1
Bellman residual minimization
1
Consumer behaviour
1
Dynamic programming
1
Dynamische Optimierung
1
Konsumentenverhalten
1
Machine Learning and Data Science
1
Markov chain
1
Markov-Kette
1
Scheduling problem
1
Scheduling-Verfahren
1
Werbung
1
adaptive interventions
1
advertising
1
dynamic programming
1
fast rates
1
fitted Q-iteration
1
machine learning
1
margin condition
1
personalization
1
policy evaluation
1
semiparametric efficiency
1
unmeasured confounding
1
more ...
less ...
Online availability
All
Undetermined
3
Type of publication
All
Article
3
Type of publication (narrower categories)
All
Article in journal
3
Aufsatz in Zeitschrift
3
Language
All
English
3
Author
All
Kallus, Nathan
2
Bennett, Andrew
1
Hu, Yichun
1
Rafieian, Omid
1
Uehara, Masatoshi
1
Published in...
All
Marketing science
1
Mathematics of operations research
1
Operations research
1
Source
All
ECONIS (ZBW)
3
Showing
1
-
3
of
3
Sort
relevance
articles prioritized
date (newest first)
date (oldest first)
1
Fast rates for the regret of
offline
reinforcement
learning
Hu, Yichun
;
Kallus, Nathan
;
Uehara, Masatoshi
- In:
Mathematics of operations research
50
(
2025
)
1
,
pp. 633-655
Persistent link: https://www.econbiz.de/10015211758
Saved in:
2
Proximal reinforcement learning : efficient off-policy evaluation in partially observed markov decision processes
Bennett, Andrew
;
Kallus, Nathan
- In:
Operations research
72
(
2024
)
3
,
pp. 1071-1086
Persistent link: https://www.econbiz.de/10014557447
Saved in:
3
Optimizing user engagement through adaptive ad sequencing
Rafieian, Omid
- In:
Marketing science
42
(
2023
)
5
,
pp. 910-933
Persistent link: https://www.econbiz.de/10014393384
Saved in:
Results per page
10
25
50
100
250
A service of the
zbw
×
Loading...
//-->