Showing 1 - 10 of 79
We examine collaboration in a one-arm bandit problem in which the players' actions affect the distribution over future payoffs. The players need to exert costly effort both to enhance the value of a risky technology and to learn about its current state. Both product value and learning are public...
Persistent link: https://www.econbiz.de/10011557309
Persistent link: https://www.econbiz.de/10014330454
Persistent link: https://www.econbiz.de/10012431697
We introduce a two-player model of reinforcement learning with memory. Past actions of an iterated game are stored in a memory and used to determine player’s next action. To examine the behaviour of the model some approximate methods are used and confronted against numerical simulations and...
Persistent link: https://www.econbiz.de/10010873298
In many social dilemmas, individuals tend to generate a situation with low payoffs instead of a system optimum ("tragedy of the commons"). Is the routing of traffic a similar problem? In order to address this question, we present experimental results on humans playing a route choice game in a...
Persistent link: https://www.econbiz.de/10005080922
The advent of reinforcement learning (RL) in financial markets is driven by several advantages inherent to this field of artificial intelligence. In particular, RL allows to combine the "prediction" and the "portfolio construction" task in one integrated step, thereby closely aligning the...
Persistent link: https://www.econbiz.de/10011911059
A novel debate within competition policy and regulation circles is whether autonomous machine learning algorithms may learn to collude on prices. We show that when firms face short-run price commitments, independent Q-learning (a simple but well-established self-learning algorithm) learns to...
Persistent link: https://www.econbiz.de/10011932327
Persistent link: https://www.econbiz.de/10011378587
Persistent link: https://www.econbiz.de/10011975691
Persistent link: https://www.econbiz.de/10012025704