//--> //--> //--> //-->
Toggle navigation
Logout
Change account settings
EN
DE
ES
FR
A-Z
Beta
About EconBiz
News
Thesaurus (STW)
Academic Skills
Help
EN
DE
ES
FR
My account
Logout
Change account settings
Login
Publications
Events
Your search terms
Search
Search options
All Fields
Title
Exact title
Subject
Author
Institution
ISBN/ISSN
Published in...
Publisher
Open Access only
Advanced
Search history
My EconBiz
Favorites
Loans
Reservations
Fines
You are here:
Home
Search: subject:"Policy gradient"
Narrow search
Narrow search
Year of publication
From:
To:
Subject
All
Theorie
9
Theory
9
reinforcement learning
8
Learning process
5
Lernprozess
5
Algorithm
4
Algorithmus
4
Artificial intelligence
4
Künstliche Intelligenz
4
Mathematical programming
4
Mathematische Optimierung
4
policy gradient
4
Machine Learning and Data Science
3
Deep deterministic policy gradient
2
Dynamic programming
2
Dynamische Optimierung
2
Learning
2
Lernen
2
Markov chain
2
Markov-Kette
2
Policy gradient
2
policy gradient methods
2
60J05
1
AGV path planning
1
Agent-based modeling
1
Agentenbasierte Modellierung
1
AgriPoliS
1
Aktienrückkauf
1
Allocation
1
Allokation
1
Anti-conflict
1
Auction theory
1
Auktionstheorie
1
Automated container terminal
1
Automation
1
Automatisierung
1
Bertrand equilibrium
1
Central pattern generators
1
Competition in uniform price auctions
1
Container terminal
1
more ...
less ...
Online availability
All
Undetermined
11
Free
4
Type of publication
All
Article
15
Type of publication (narrower categories)
All
Article in journal
12
Aufsatz in Zeitschrift
12
Article
1
research-article
1
viewpoint
1
Language
All
English
15
Author
All
Russo, Daniel
2
Appel, Franziska
1
Bhandari, Jalaj
1
Cen, Shicong
1
Chen, Yuxin
1
Cheng, Chen
1
Chi, Yuejie
1
Chow, Andy H. F.
1
Coache, Anthony
1
Cui, Songyi
1
Deng, Yang
1
Denisov, Denis
1
Dong, Changxing
1
Graf, Christoph
1
Hamdouche, Mohamed
1
Hasan Monadjemi, Amir
1
Henry-Labordere, Pierre
1
Hu, Hongtao
1
Jaimungal, Sebastian
1
Jamshidi, Kamal
1
Klöckl, Claude
1
Kuo, Yong-Hong
1
Li, Shaodong
1
Lin, Xudong
1
Lin, Yifan
1
Luo, Suyuan
1
Moharrami, Mehrdad
1
Murthy, Yashaswini
1
Njiru, Ruth Dionisia Gicuku
1
Pham, Huyên
1
Roy, Arghyadip
1
Schmidt, Johannes
1
Shahbazi, Hamed
1
Srikant, Rayadurgam
1
Walton, Neil
1
Wang, Feiyang
1
Wang, Yuhao
1
Wei, Yuting
1
Xiao, Shichang
1
Yan, Yimo
1
more ...
less ...
Published in...
All
Operations research
3
Mathematics of operations research
2
Transportation research / E : an international journal
2
Applied mathematical finance
1
Computational economics
1
Electronic Communications of the EASST
1
Industrial Robot: An International Journal
1
Industrial Robot: the international journal of robotics research and application
1
International journal of production research
1
Management science : journal of the Institute for Operations Research and the Management Sciences
1
Mathematical finance : an international journal of mathematics, statistics and financial economics
1
more ...
less ...
Source
All
ECONIS (ZBW)
12
Other ZBW resources
2
EconStor
1
Showing
1
-
10
of
15
Sort
relevance
articles prioritized
date (newest first)
date (oldest first)
1
Deep Reinforcement Learning in agent-based model AgriPoliS to simulate strategic land market interactions
Dong, Changxing
;
Njiru, Ruth Dionisia Gicuku
;
Appel, …
- In:
Electronic Communications of the EASST
83
(
2025
),
pp. 1-18
stable and sustainable development. Utilizing a
policy
gradient
algorithm, we update the RL agent's policy network to …
Persistent link: https://www.econbiz.de/10015210972
Saved in:
2
Global optimality guarantees for
policy
gradient
methods
Bhandari, Jalaj
;
Russo, Daniel
- In:
Operations research
72
(
2024
)
5
,
pp. 1906-1927
Persistent link: https://www.econbiz.de/10015361758
Saved in:
3
Computational performance of deep reinforcement learning to find Nash equilibria
Graf, Christoph
;
Zobernig, Viktor
;
Schmidt, Johannes
; …
- In:
Computational economics
63
(
2024
)
2
,
pp. 529-576
Persistent link: https://www.econbiz.de/10014472392
Saved in:
4
A
policy
gradient
algorithm for the risk-sensitive exponential cost MDP
Moharrami, Mehrdad
;
Murthy, Yashaswini
;
Roy, Arghyadip
; …
- In:
Mathematics of operations research
50
(
2025
)
1
,
pp. 431-458
Persistent link: https://www.econbiz.de/10015211728
Saved in:
5
Reusing historical trajectories in natural
policy
gradient
via importance sampling : convergence and convergence rate
Lin, Yifan
;
Wang, Yuhao
;
Zhou, Enlu
- In:
Operations research
73
(
2025
)
6
,
pp. 3010-3026
Persistent link: https://www.econbiz.de/10015550999
Saved in:
6
Fast global convergence of natural
policy
gradient
methods with entropy regularization
Cen, Shicong
;
Cheng, Chen
;
Chen, Yuxin
;
Wei, Yuting
; …
- In:
Operations research
70
(
2022
)
4
,
pp. 2563-2578
Persistent link: https://www.econbiz.de/10013366515
Saved in:
7
Reinforcement learning with dynamic convex risk measures
Coache, Anthony
;
Jaimungal, Sebastian
- In:
Mathematical finance : an international journal of …
34
(
2024
)
2
,
pp. 557-587
Persistent link: https://www.econbiz.de/10014514792
Saved in:
8
Approximation benefits of
policy
gradient
methods with aggregated states
Russo, Daniel
- In:
Management science : journal of the Institute for …
69
(
2023
)
11
,
pp. 6898-6911
Persistent link: https://www.econbiz.de/10014435438
Saved in:
9
Regret analysis of a Markov
policy
gradient
algorithm for multiarm bandits
Walton, Neil
;
Denisov, Denis
- In:
Mathematics of operations research
48
(
2023
)
3
,
pp. 1553-1588
Persistent link: https://www.econbiz.de/10014329345
Saved in:
10
A
policy
gradient
approach to solving dynamic assignment problem for on-site service delivery
Yan, Yimo
;
Deng, Yang
;
Cui, Songyi
;
Kuo, Yong-Hong
; …
- In:
Transportation research / E : an international journal
178
(
2023
),
pp. 1-26
Persistent link: https://www.econbiz.de/10014437658
Saved in:
1
2
Next
Last
Results per page
10
25
50
100
250
A service of the
zbw
FAQ-Assistent (beta)
×
Loading...
//-->