AIJul 4, 2012

Existence and Finiteness Conditions for Risk-Sensitive Planning: Results and Conjectures

arXiv:1207.1391v115 citations

Originality Incremental advance

AI Analysis

This addresses a foundational gap in decision-theoretic planning for autonomous agents and decision-support systems, though it is incremental as it builds on existing MDP frameworks.

The paper tackles the problem of ensuring optimal expected utilities exist and are finite in risk-sensitive planning for fully observable Markov decision processes with non-linear utility functions, deriving conditions that guarantee this for stationary policies and conjecturing extensions to non-stationary ones.

Decision-theoretic planning with risk-sensitive planning objectives is important for building autonomous agents or decision-support systems for real-world applications. However, this line of research has been largely ignored in the artificial intelligence and operations research communities since planning with risk-sensitive planning objectives is more complicated than planning with risk-neutral planning objectives. To remedy this situation, we derive conditions that guarantee that the optimal expected utilities of the total plan-execution reward exist and are finite for fully observable Markov decision process models with non-linear utility functions. In case of Markov decision process models with both positive and negative rewards, most of our results hold for stationary policies only, but we conjecture that they can be generalized to non stationary policies.

View on arXiv PDF

Similar