Johnson, Kris; Simchi-Levi, David; Wang, He - 2015
("exploitation" objective). We propose a class of dynamic pricing algorithms that builds upon the simple yet powerful machine … learning technique known as Thompson sampling to address the challenge of balancing the exploration-exploitation tradeoff under …