Showing 1 - 10 of 33
In this paper discounted and average Markov decision processes with finite state space and countable action set (semi-infinite MDP for short) are discussed. Without ordinary continuity and compactness conditions, for discounted semi-infinite MDP we have shown that by exploiting the results on...
Persistent link: https://www.econbiz.de/10010999567
In this paper weighted singularly perturbed hybrid stochastic systems are discussed. Under some reasonable assumptions, it is shown that there exists a uniformly δ-optimal policy when the perturbation is sufficiently small. Copyright Springer-Verlag Berlin Heidelberg 2005
Persistent link: https://www.econbiz.de/10010847546
In this paper we consider the weighted reward Markov decision process, with perturbation. The “weighted reward” refers to appropriately normalized convex combination of the discounted and the long-run average reward criteria. This criterion allows the controller to trade-off short-term costs...
Persistent link: https://www.econbiz.de/10010847712
In this paper weighted singularly perturbed hybrid stochastic systems are discussed. Under some reasonable assumptions, it is shown that there exists a uniformly δ-optimal policy when the perturbation is sufficiently small. Copyright Springer-Verlag Berlin Heidelberg 2005
Persistent link: https://www.econbiz.de/10010999595
In this paper we consider the weighted reward Markov decision process, with perturbation. The “weighted reward” refers to appropriately normalized convex combination of the discounted and the long-run average reward criteria. This criterion allows the controller to trade-off short-term costs...
Persistent link: https://www.econbiz.de/10010999741
Persistent link: https://www.econbiz.de/10005461516
In this paper we consider a singularly perturbed Markov decision process with finitely many states and actions and the limiting expected average reward criterion. We make no assumptions about the underlying ergodic structure. We present algorithms for the computation of a uniformly optimal...
Persistent link: https://www.econbiz.de/10010949977
In this paper we carry out a preliminary exploration of a time scales' conjecture, which postulates that "reasonable" notions of sustainability must include a suitable synchronisation of time scales of both the processes of human development and those of the natural environment. We perform our...
Persistent link: https://www.econbiz.de/10008494370
Persistent link: https://www.econbiz.de/10005205092
In this paper we consider a singularly perturbed Markov decision process with finitely many states and actions and the limiting expected average reward criterion. We make no assumptions about the underlying ergodic structure. We present algorithms for the computation of a uniformly optimal...
Persistent link: https://www.econbiz.de/10010759191