Fuzzyfying Markov Decision Process
Markov decision processes have become an indispensable tool in applications as diverse as equipment maintenance, manufacturing systems, inventory control, queuing networks and investment analysis. Typically we have a controlled Markov chain on a suitable state space in which transitional probabilities depend on the policy (or decision maker) which comes from a set of possible actions. The main problem of interest would be to find an optimal policy that minimizes the associated cost.Linear Programming has been widely used to find the optimal Markov decision policy. It requires solutions of large systems of simultaneous linear equations. By the fact that the complexity in linear programming increases much faster with the increase in the number of states which is often called curse of dimensionality, the linear programming method can handle only small models.This thesis presents a new method to lessen the curse of dimensionality. By assuming certain monotonicity property for the transition probability, it is shown that a fuzzy membership function can be used to reduce the number of states. The use of membership functions help to reduce the number of the states. However all the states remain intact through the use of the membership value. That is, those states eliminated can be recovered through interpolation with the aid of membership functions. This new proposed method is shown to be effective in coping with the curse of dimensionality.
|Year of publication:||
|Authors:||Irshad, Ahmed Syed|
|Other Persons:||Subhash C Kak (contributor) ; Steve C. S. Cai (contributor) ; Guoxiang Gu (contributor)|
|Subject:||Electrical & Computer Engineering|
|Type of publication:||Other|
Persistent link: https://www.econbiz.de/10009439338
Saved in favorites
Similar items by subject
Verma, Santhosh, (2011)
Nanduri, Kishori, (2005)
- More ...