The authors propose models for the solution of the fundamental problem of option replication subject to discrete trading, round lotting and nonlinear transaction costs using state-of-the-art methods in deep reinforcement learning (DRL), including deep Q-learning, deep Q-learning with Pop-Art and...