Similar Search Results

Unraveling of cooperation in dynamic collaboration

Vasama, Suvi - 2016

We examine collaboration in a one-arm bandit problem in which the players' actions affect the distribution over future payoffs. The players need to exert costly effort both to enhance the value of a risky technology and to learn about its current state. Both product value and learning are public...

Persistent link: https://www.econbiz.de/10011557309

Learning optimal solutions via an LSTM-optimization framework

Yilmaz, Dogacan; Büyüktahtakın, İ. Esra - In: Operations research forum 4 (2023) 2, pp. 1-40

Persistent link: https://www.econbiz.de/10014330454

Pooling or fooling? : an experiment on signaling

Feri, Francesco; Meléndez-Jiménez, Miguel A.; Ponti, … - In: Journal of economic behavior & organization : JEBO 176 (2020), pp. 582-596

Persistent link: https://www.econbiz.de/10012431697

Statistical mechanics approach to a reinforcement learning model with memory

Lipowski, Adam; Gontarek, Krzysztof; Ausloos, Marcel - In: Physica A: Statistical Mechanics and its Applications 388 (2009) 9, pp. 1849-1856

We introduce a two-player model of reinforcement learning with memory. Past actions of an iterated game are stored in a memory and used to determine player’s next action. To examine the behaviour of the model some approximate methods are used and confronted against numerical simulations and...

Persistent link: https://www.econbiz.de/10010873298

HOW INDIVIDUALS LEARN TO TAKE TURNS: EMERGENCE OF ALTERNATING COOPERATION IN A CONGESTION GAME AND THE PRISONER'S DILEMMA

HELBING, DIRK; SCHÖNHOF, MARTIN; STARK, HANS-ULRICH; … - In: Advances in Complex Systems (ACS) 08 (2005) 01, pp. 87-116

In many social dilemmas, individuals tend to generate a situation with low payoffs instead of a system optimum ("tragedy of the commons"). Is the routing of traffic a similar problem? In order to address this question, we present experimental results on humans playing a route choice game in a...

Persistent link: https://www.econbiz.de/10005080922

Reinforcement learning in financial markets - a survey

Fischer, Thomas G. - 2018

The advent of reinforcement learning (RL) in financial markets is driven by several advantages inherent to this field of artificial intelligence. In particular, RL allows to combine the "prediction" and the "portfolio construction" task in one integrated step, thereby closely aligning the...

Persistent link: https://www.econbiz.de/10011911059

Assessing Autonomous Algorithmic Collusion: Q-Learning Under Short-Run Price Commitments

Klein, Timo - 2018

A novel debate within competition policy and regulation circles is whether autonomous machine learning algorithms may learn to collude on prices. We show that when firms face short-run price commitments, independent Q-learning (a simple but well-established self-learning algorithm) learns to...

Persistent link: https://www.econbiz.de/10011932327

Nonconvergence to saddle boundary points under perturbed reinforcement learning

Chasparis, Georgios C.; Shamma, Jeff S.; Rantzer, Anders - In: International journal of game theory : official journal … 44 (2015) 3, pp. 667-699

Persistent link: https://www.econbiz.de/10011378587

Observational and reinforcement pattern-learning : an exploratory study

Hanaki, Nobuyuki; Kirman, Alan P.; Pezanis-Christou, Paul - In: European economic review : EER 104 (2018), pp. 1-21

Persistent link: https://www.econbiz.de/10011975691

Riemannian game dynamics

Mertikopoulos, Panayotis; Sandholm, William H. - In: Journal of economic theory 177 (2018), pp. 315-364

Persistent link: https://www.econbiz.de/10012025704

1
2
3
4
5
6
7
8
Next
Last