View Related Documents

Abstract

This paper presents three conditions. Each of them guarantees the uniqueness of optimal policies of discounted Markov decision processes. The conditions presented here impose hypotheses specifically on the state space X, the action space A, the admissible action sets A(x),xisinX, the transition probability Q, and on the cost function c. Two of these conditions require mainly convexity assumptions, but the third one does not need this kind of assumptions. However, it needs certain stochastic order relations in Q, and the cost function c to reach its minimum with respect to the actions, just in one action. We illustrate the conditions with several examples including, in particular, discrete models, the linear regulator problem, and also a model of an inventory control system.

Keywords  Discounted Markov decision processes - Uniqueness of optimal policies - Convexity - Stochastic order

Mathematics Subject Classification 2000:  90C40 - 93E20

Manuscript received: May 2003 / Final version received: January 2004

Fulltext Preview

Image of the first page of the fulltext document