Similar Search Results

Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon

Hambly, Ben M.; Xu, Renyuan; Yang, Huining - 2021

We explore reinforcement learning methods for finding the optimal policy in the linear quadratic regulator (LQR) problem. In particular we consider the convergence of policy gradient methods in the setting of known and unknown parameters. We are able to produce a global linear convergence...

Persistent link: https://www.econbiz.de/10013251559

An Optimal Information Gathering Algorithm

Di Caprio, Debora; Santos‐Arteaga, Francisco - 2012

are illustrated numerically for a variety of utility functions commonly used in decision theory …

Persistent link: https://www.econbiz.de/10014166100

Optimal information disclosure : a linear programming approach

Kolotilin, Anton - In: Theoretical economics : TE ; an open access journal in … 13 (2018) 2, pp. 607-635

An uninformed sender designs a mechanism that discloses information about her type to a privately informed receiver, who then decides whether to act. I impose a single-crossing assumption, so that the receiver with a higher type is more willing to act. Using a linear programming approach, I...

Persistent link: https://www.econbiz.de/10011856702

Fast Fourier Transform and its Applications to Integer Knapsack Problems

Nesterov, Yurii - 2005

In this paper we suggest a new efficient technique for solving integer knapsack problems. Our algorithms can be seen as application of Fast Fourier Transform to generating functions of integer polytopes. Using this approach, it is possible to count the number of boolean solutions of a single...

Persistent link: https://www.econbiz.de/10014066592

Linear quadratic approximation of rationally inattentive control problems

Miao, Jianjun; Zhang, Bo - In: Macroeconomic dynamics 28 (2024) 6, pp. 1371-1393

Persistent link: https://www.econbiz.de/10015154352

Multivariate rational inattention

Miao, Jianjun; Wu, Jieran; Young, Eric R. - In: Econometrica : journal of the Econometric Society, an … 90 (2022) 2, pp. 907-945

Persistent link: https://www.econbiz.de/10013190108

Optimal Information Disclosure : A Linear Programming Approach

Kolotilin, Anton - 2016

Persistent link: https://www.econbiz.de/10012979703

A Tenure-Clock Problem

Chen, Chia-hui - 2015

We consider a “tenure-clock problem” in which a principal may set a deadline by which she needs to evaluate an agent's ability and decides whether to promote him or not. We embed this problem in a continuous-time model with both hidden action and hidden information, where the principal must...

Persistent link: https://www.econbiz.de/10013030521

Optimism and Communication

Chen, Ying - 2010

I examine how the communication incentive of an agent (sender) changes when the prior of the principal (receiver) about the agent's private information becomes more optimistic (in the sense of monotone likelihood ratio dominance). I use the canonical model of strategic communication (Crawford...

Persistent link: https://www.econbiz.de/10014190361

An approximate dynamic programming approach for sequential pig marketing decisions at herd level

Pourmoayed, Reza; Nielsen, Lars Relund - In: European journal of operational research : EJOR 276 (2019) 3, pp. 1056-1070

Persistent link: https://www.econbiz.de/10012003710