Showing 1 - 10 of 77
We examine collaboration in a one-arm bandit problem in which the players' actions affect the distribution over future payoffs. The players need to exert costly effort both to enhance the value of a risky technology and to learn about its current state. Both product value and learning are public...
Persistent link: https://www.econbiz.de/10011557309
Persistent link: https://www.econbiz.de/10014330454
Persistent link: https://www.econbiz.de/10012431697
The advent of reinforcement learning (RL) in financial markets is driven by several advantages inherent to this field of artificial intelligence. In particular, RL allows to combine the "prediction" and the "portfolio construction" task in one integrated step, thereby closely aligning the...
Persistent link: https://www.econbiz.de/10011911059
A novel debate within competition policy and regulation circles is whether autonomous machine learning algorithms may learn to collude on prices. We show that when firms face short-run price commitments, independent Q-learning (a simple but well-established self-learning algorithm) learns to...
Persistent link: https://www.econbiz.de/10011932327
Persistent link: https://www.econbiz.de/10011378587
Persistent link: https://www.econbiz.de/10011975691
Persistent link: https://www.econbiz.de/10012025704
Persistent link: https://www.econbiz.de/10010428062
Persistent link: https://www.econbiz.de/10012111530