Showing 1 - 4 of 4
Persistent link: https://www.econbiz.de/10011932621
Persistent link: https://www.econbiz.de/10013461447
Persistent link: https://www.econbiz.de/10015411202
Weakly coupled Markov decision processes (WDPs) arise in dynamic decision-making and reinforcement learning. These models are often high dimensional but decompose into smaller component MDPs when coupling constraints are relaxed. Lagrangian relaxations of WDPs that dualize linking constraints...
Persistent link: https://www.econbiz.de/10013291518