Wang, Qiaochu; Huang, Yan; Singh, Param Vir - 2022
-learning algorithm (a specific type of RL algorithm) is particularly appealing for pricing because it autonomously learns an optimal … that the Q-learning algorithm has a significant advantage over simple rule-based pricing algorithms; therefore, in a …. We find that when a Q-learning algorithm competes against a rule-based pricing algorithm, higher prices are sustained in …