EconBiz - Find Economic Literature
    • Logout
    • Change account settings
  • A-Z
  • Beta
  • About EconBiz
  • News
  • Thesaurus (STW)
  • Academic Skills
  • Help
  •  My account 
    • Logout
    • Change account settings
  • Login
EconBiz - Find Economic Literature
Publications Events
Search options
Advanced Search history
My EconBiz
Favorites Loans Reservations Fines
    You are here:
  • Home
  • Search: subject:"Policy gradient"
Narrow search

Narrow search

Year of publication
Subject
All
Theorie 9 Theory 9 reinforcement learning 8 Learning process 5 Lernprozess 5 Algorithm 4 Algorithmus 4 Artificial intelligence 4 Künstliche Intelligenz 4 Mathematical programming 4 Mathematische Optimierung 4 policy gradient 4 Machine Learning and Data Science 3 Deep deterministic policy gradient 2 Dynamic programming 2 Dynamische Optimierung 2 Learning 2 Lernen 2 Markov chain 2 Markov-Kette 2 Policy gradient 2 policy gradient methods 2 60J05 1 AGV path planning 1 Agent-based modeling 1 Agentenbasierte Modellierung 1 AgriPoliS 1 Aktienrückkauf 1 Allocation 1 Allokation 1 Anti-conflict 1 Auction theory 1 Auktionstheorie 1 Automated container terminal 1 Automation 1 Automatisierung 1 Bertrand equilibrium 1 Central pattern generators 1 Competition in uniform price auctions 1 Container terminal 1
more ... less ...
Online availability
All
Undetermined 11 Free 4
Type of publication
All
Article 15
Type of publication (narrower categories)
All
Article in journal 12 Aufsatz in Zeitschrift 12 Article 1 research-article 1 viewpoint 1
Language
All
English 15
Author
All
Russo, Daniel 2 Appel, Franziska 1 Bhandari, Jalaj 1 Cen, Shicong 1 Chen, Yuxin 1 Cheng, Chen 1 Chi, Yuejie 1 Chow, Andy H. F. 1 Coache, Anthony 1 Cui, Songyi 1 Deng, Yang 1 Denisov, Denis 1 Dong, Changxing 1 Graf, Christoph 1 Hamdouche, Mohamed 1 Hasan Monadjemi, Amir 1 Henry-Labordere, Pierre 1 Hu, Hongtao 1 Jaimungal, Sebastian 1 Jamshidi, Kamal 1 Klöckl, Claude 1 Kuo, Yong-Hong 1 Li, Shaodong 1 Lin, Xudong 1 Lin, Yifan 1 Luo, Suyuan 1 Moharrami, Mehrdad 1 Murthy, Yashaswini 1 Njiru, Ruth Dionisia Gicuku 1 Pham, Huyên 1 Roy, Arghyadip 1 Schmidt, Johannes 1 Shahbazi, Hamed 1 Srikant, Rayadurgam 1 Walton, Neil 1 Wang, Feiyang 1 Wang, Yuhao 1 Wei, Yuting 1 Xiao, Shichang 1 Yan, Yimo 1
more ... less ...
Published in...
All
Operations research 3 Mathematics of operations research 2 Transportation research / E : an international journal 2 Applied mathematical finance 1 Computational economics 1 Electronic Communications of the EASST 1 Industrial Robot: An International Journal 1 Industrial Robot: the international journal of robotics research and application 1 International journal of production research 1 Management science : journal of the Institute for Operations Research and the Management Sciences 1 Mathematical finance : an international journal of mathematics, statistics and financial economics 1
more ... less ...
Source
All
ECONIS (ZBW) 12 Other ZBW resources 2 EconStor 1
Showing 1 - 10 of 15
Cover Image
Deep Reinforcement Learning in agent-based model AgriPoliS to simulate strategic land market interactions
Dong, Changxing; Njiru, Ruth Dionisia Gicuku; Appel, … - In: Electronic Communications of the EASST 83 (2025), pp. 1-18
stable and sustainable development. Utilizing a policy gradient algorithm, we update the RL agent's policy network to …
Persistent link: https://www.econbiz.de/10015210972
Saved in:
Cover Image
Global optimality guarantees for policy gradient methods
Bhandari, Jalaj; Russo, Daniel - In: Operations research 72 (2024) 5, pp. 1906-1927
Persistent link: https://www.econbiz.de/10015361758
Saved in:
Cover Image
Computational performance of deep reinforcement learning to find Nash equilibria
Graf, Christoph; Zobernig, Viktor; Schmidt, Johannes; … - In: Computational economics 63 (2024) 2, pp. 529-576
Persistent link: https://www.econbiz.de/10014472392
Saved in:
Cover Image
A policy gradient algorithm for the risk-sensitive exponential cost MDP
Moharrami, Mehrdad; Murthy, Yashaswini; Roy, Arghyadip; … - In: Mathematics of operations research 50 (2025) 1, pp. 431-458
Persistent link: https://www.econbiz.de/10015211728
Saved in:
Cover Image
Reusing historical trajectories in natural policy gradient via importance sampling : convergence and convergence rate
Lin, Yifan; Wang, Yuhao; Zhou, Enlu - In: Operations research 73 (2025) 6, pp. 3010-3026
Persistent link: https://www.econbiz.de/10015550999
Saved in:
Cover Image
Fast global convergence of natural policy gradient methods with entropy regularization
Cen, Shicong; Cheng, Chen; Chen, Yuxin; Wei, Yuting; … - In: Operations research 70 (2022) 4, pp. 2563-2578
Persistent link: https://www.econbiz.de/10013366515
Saved in:
Cover Image
Reinforcement learning with dynamic convex risk measures
Coache, Anthony; Jaimungal, Sebastian - In: Mathematical finance : an international journal of … 34 (2024) 2, pp. 557-587
Persistent link: https://www.econbiz.de/10014514792
Saved in:
Cover Image
Approximation benefits of policy gradient methods with aggregated states
Russo, Daniel - In: Management science : journal of the Institute for … 69 (2023) 11, pp. 6898-6911
Persistent link: https://www.econbiz.de/10014435438
Saved in:
Cover Image
Regret analysis of a Markov policy gradient algorithm for multiarm bandits
Walton, Neil; Denisov, Denis - In: Mathematics of operations research 48 (2023) 3, pp. 1553-1588
Persistent link: https://www.econbiz.de/10014329345
Saved in:
Cover Image
A policy gradient approach to solving dynamic assignment problem for on-site service delivery
Yan, Yimo; Deng, Yang; Cui, Songyi; Kuo, Yong-Hong; … - In: Transportation research / E : an international journal 178 (2023), pp. 1-26
Persistent link: https://www.econbiz.de/10014437658
Saved in:
  • 1
  • 2
  • Next
  • Last
A service of the
zbw
FAQ-Assistent (beta)
  • Sitemap
  • Plain language
  • Accessibility
  • Contact us
  • Imprint
  • Privacy

Loading...