partially fluid

It is known that determinining whether a DEC-POMDP, namely, a cooperative partially observable stochastic game (POSG), has a cooperative strategy with positive expected reward is complete for NEXP. It was not known until now how cooperation affected that complexity. We show that, for competitive POSGs, the complexity of determining whether one team has a positive-expected-reward strategy is com...

متن کامل

Decision-Theoretic Meta-reasoning in Partially Observable and Decentralized Settings

2014

Alan Scott Carlin ALAN CARLIN Roderic A. Grupen

DECISION-THEORETIC META-REASONING IN PARTIALLY OBSERVABLE AND DECENTRALIZED SETTINGS

متن کامل

Dec-POMDPs as Non-Observable MDPs

2014

Frans A. Oliehoek Christopher Amato

A recent insight in the field of decentralized partially observable Markov decision processes (Dec-POMDPs) is that it is possible to convert a Dec-POMDP to a non-observable MDP, which is a special case of POMDP. This technical report provides an overview of this reduction and pointers to related literature.

متن کامل

On Partially Observable MDPs and BDI Models

2002

Martijn C. Schut Michael Wooldridge Simon Parsons

Decision theoretic planning in ai bymeans of solving Partially ObservableMarkov decision processes (pomdps) has been shown to be both powerful and versatile. However, such approaches are computationally hard and, from a design stance, are not necessarily intuitive for conceptualising many problems. We propose a novel method for solving pomdps, which provides a designer with a more intuitive mea...

متن کامل

Learning to Explore and Exploit in POMDPs

2009

Chenghui Cai Xuejun Liao Lawrence Carin

A fundamental objective in reinforcement learning is the maintenance of a proper balance between exploration and exploitation. This problem becomes more challenging when the agent can only partially observe the states of its environment. In this paper we propose a dual-policy method for jointly learning the agent behavior and the balance between exploration exploitation, in partially observable...

متن کامل

Learning Partially Observable Action Models: Efficient Algorithms

2006

Dafna Shahaf Allen Chang Eyal Amir

We present tractable, exact algorithms for learning actions’ effects and preconditions in partially observable domains. Our algorithms maintain a propositional logical representation of the set of possible action models after each observation and action execution. The algorithms perform exact learning of preconditions and effects in any deterministic action domain. This includes STRIPS actions ...

متن کامل

Robust Person Guidance by Using Online POMDPs

2013

Luis Merino Joaquín Ballesteros Noé Pérez-Higueras Rafael Ramón Vigo Javier Pérez-Lara Fernando Caballero

The paper considers a guiding task in which a robot has to guide a person towards a destination. A robust operation requires to consider uncertain models on the person motion and intentions, as well as noise and occlusions in the sensors employed for the task. Partially Observable Markov Decision Processes (POMDPs) are used to model the task. The paper describes an enhancement on online POMDP s...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید