Solving Limited Memory Influence Diagrams

نویسندگان

  • Denis Deratani Mauá
  • Cassio Polpo de Campos
  • Marco Zaffalon
چکیده

We present a new algorithm for exactly solving decision making problems represented as influence diagrams. We do not require the usual assumptions of no forgetting and regularity; this allows us to solve problems with simultaneous decisions and limited information. The algorithm is empirically shown to outperform a state-of-the-art algorithm on randomly generated problems of up to 150 variables and 10 solutions. We show that these problems are NP-hard even if the underlying graph structure of the problem has low treewidth and the variables take on a bounded number of states, and that they admit no provably good approximation if variables can take on an arbitrary number of states.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Solving influence diagrams using heuristic search

Existing methods for solving influence diagrams are mostly based on the bottom-up dynamic programming technique. These methods may waste computation in solving decision scenarios that have zero probabilities or are unreachable from any initial state by following an optimal decision policy. Heuristic search was applied in (Qi & Poole 1995) to address these limitations, but their algorithm uses a...

متن کامل

On the complexity of solving polytree-shaped limited memory influence diagrams with binary variables

Influence diagrams are intuitive and concise representations of structured decision problems. When the problem is non-Markovian, an optimal strategy can be exponentially large in the size of the diagram. We can avoid the inherent intractability by constraining the size of admissible strategies, giving rise to limited memory influence diagrams. A valuable question is then how small do strategies...

متن کامل

Solving Limited Memory Influence Diagrams Using Branch-and-Bound Search

A limited-memory influence diagram (LIMID) is a generalization of a traditional influence diagram in which the assumptions of regularity and no-forgetting are relaxed. Lauritzen and Nilsson (2001) introduced this model and proposed an iterative algorithm, called single policy updating (SPU), that converges to a locally optimal solution. In this paper, we describe a branch-andbound algorithm tha...

متن کامل

Markov Limid processes for representing and solving renewal problems

In this paper a new tool for simultaneous optimization of decisions on multiple time scales is discussed. It combines the dynamic properties of Markov decision processes with the flexible and compact state space representation of Limited Memory Influence Diagrams (LIMIDs).

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Artif. Intell. Res.

دوره 44  شماره 

صفحات  -

تاریخ انتشار 2012