Valuing programs with deterministic and stochastic cycles
In many dynamic programming problems, a mix of state variables exists - some exhibiting stochastic cycles and others having deterministic cycles. We derive a formula for the value function in infinite-horizon, stationary, Markovian decision problems by exploiting a special partitioned-circulant structure of the transition matrix [Pi]. Our strategy for computing the left-inverse of the matrix [I-[beta][Pi]], which is central to implementing Howard's policy iteration algorithm, yields significant improvements in computation time and major reductions in memory required. When the deterministic cycle is of order n, our cyclic inversion algorithm yields an O(n2) speed-up relative to the usual policy iteration algorithm.
Year of publication: |
2009
|
---|---|
Authors: | Paarsch, Harry J. ; Rust, John |
Published in: |
Journal of Economic Dynamics and Control. - Elsevier, ISSN 0165-1889. - Vol. 33.2009, 3, p. 614-623
|
Publisher: |
Elsevier |
Keywords: | Dynamic programming Policy iteration Deterministic cycles Stochastic cycles Circulant matrix Cyclic inversion algorithm |
Saved in:
Saved in favorites
Similar items by person
-
Is the ‘Linkage Principle’ Valid? Evidence from the Field
Cho, Sung-Jin, (2014)
-
Stochastic Dynamic Programming in Space: An Application to British Columbia Forestry
Rust, John, (2005)
-
Intertemporal Productivity under Fixed Wages and Piece Rates
Rust, John, (2007)
- More ...