Showing 1 - 1 of 1
We consider a finite-state, finite-action, infinite-horizon, discounted reward Markov decision process and study the bias and variance in the value function estimates that result from empirical estimates of the model parameters. We provide closed-form approximations for the bias and variance,...
Persistent link: https://www.econbiz.de/10009209247