Showing 1 - 2 of 2
Persistent link: https://www.econbiz.de/10005277514
This paper deals with approximate value iteration (AVI) algorithms applied to discounted dynamic programming (DP) problems. For a fixed control policy, the span semi-norm of the so-called Bellman residual is shown to be convex in the Banach space of candidate solutions to the DP problem. This...
Persistent link: https://www.econbiz.de/10008865135