Showing 1 - 1 of 1
We consider off-policy evaluation and optimization with continuous action spaces. We focus on observational data where the data collection policy is unknown and needs to be estimated. We take a semi-parametric approach where the value function takes a known parametric form in the treatment, but...
Persistent link: https://www.econbiz.de/10012014174