Gosavi, Abhijit - 2015 - 2nd ed. 2015
Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning introduces the evolving … optimization via temporal differences and Reinforcement Learning: Q-Learning, SARSA, and R-SMART algorithms, and policy search, via … API, Q-P-Learning, actor-critics, and learning automata · A special examination of neural-network-based function …