//-->
Optimal assignment of sellers in a store with a random number of clients via the Armed Bandit model
Vázquez-Guevara, Víctor Hugo, (2017)
Optimising darts strategy using Markov decision processes and reinforcement learning
Baird, Graham, (2020)
Finitely additive dynamic programming
Sudderth, William D., (2016)
Solving Markov decision processes via state space decomposition and time aggregation
Alexandre, Rodrigo e Alvim, (2025)
Learning-agent-based simulation for queue network systems
Fuller, Daniel Barry, (2020)
Long-term integrated surgery room optimization and recovery ward planning, with a case study in the Brazilian National Institute of Traumatology and Orthopedics (INTO)
Siqueira, Cecília L., (2018)