Valuing programs with deterministic and stochastic cycles

In many dynamic programming problems, a mix of state variables exists - some exhibiting stochastic cycles and others having deterministic cycles. We derive a formula for the value function in infinite-horizon, stationary, Markovian decision problems by exploiting a special partitioned-circulant structure of the transition matrix [Pi]. Our strategy for computing the left-inverse of the matrix [I-[beta][Pi]], which is central to implementing Howard's policy iteration algorithm, yields significant improvements in computation time and major reductions in memory required. When the deterministic cycle is of order n, our cyclic inversion algorithm yields an O(n2) speed-up relative to the usual policy iteration algorithm.

MoreLess

Year of publication:	2009
Authors:	Paarsch, Harry J. ; Rust, John
Published in:	Journal of Economic Dynamics and Control. - Elsevier, ISSN 0165-1889. - Vol. 33.2009, 3, p. 614-623
Publisher:	Elsevier
Keywords:	Dynamic programming Policy iteration Deterministic cycles Stochastic cycles Circulant matrix Cyclic inversion algorithm

More details

Type of publication:	Article
Source:	RePEc - Research Papers in Economics

Persistent link: https://www.econbiz.de/10005160877