//-->
Analysis of Single Buffer Random Polling System With State-Dependent Input Process and Server/Station Breakdowns
Lee, Thomas Y.S., (2018)
Computing a bias-optimal policy in a discretetime Markov decision problem
Denardo, Eric V., (1970)
The multi-armed bandit, with constraints
Denardo, Eric V., (2013)