partially fluid

نتایج جستجو برای: partially fluid

تعداد نتایج: 347414 فیلتر نتایج به سال:

MDPs Semi - Markov decision processes Hidden Markov models Partially observable SMDPs Hierarchical HMMs

2007

Sridhar Mahadevan

متن کامل

How Prior Probability Influences Decision Making: A Unifying Probabilistic Model

2012

Yanping Huang Abram L. Friesen Timothy D. Hanks Michael N. Shadlen Rajesh P. N. Rao

How does the brain combine prior knowledge with sensory evidence when making decisions under uncertainty? Two competing descriptive models have been proposed based on experimental data. The first posits an additive offset to a decision variable, implying a static effect of the prior. However, this model is inconsistent with recent data from a motion discrimination task involving temporal integr...

متن کامل

Goal Achievement in Partially Known, Partially Observable Domains

2006

Allen Chang Eyal Amir

We present a decision making algorithm for agents that act in partially observable domains which they do not know fully. Making intelligent choices in such domains is very difficult because actions’ effects may not be known a priori (partially known domain), and features may not always be visible (partially observable domain). Nonetheless, we show that an efficient solution is achievable in STR...

متن کامل

Discovering Abstract Concepts to Aid Cross-Map Transfer for a Learning Agent

2009

Cédric Herpson Vincent Corruble

The capacity to apply knowledge in a context different than the one in which it was learned has become crucial within the area of autonomous agents. This paper specifically addresses the issue of transfer of knowledge acquired through online learning in partially observable environments. We investigate the discovery of relevant abstract concepts which help the transfer of knowledge in the conte...

متن کامل

Subjective Equilibria in Interactive POMDPs: Theory and Computational Limitations

2005

Prashant Doshi

We analyze the asymptotic behavior of agents engaged in a infinite horizon partially observable stochastic game formalized by the interactive POMDP framework. We show that when agents’ initial beliefs satisfy a truth compatibility condition, their behavior converges to a subjective 2-equilibrium in a finite time, and subjective equilibrium in the limit. Imposing an additional assumption of mutu...

متن کامل

Goal Recognition over POMDPs: Inferring the Intention of a POMDP Agent

2011

Miquel Ramírez Hector Geffner

Plan recognition is the problem of inferring the goals and plans of an agent from partial observations of her behavior. Recently, it has been shown that the problem can be formulated and solved using planners, reducing plan recognition to plan generation. In this work, we extend this model-based approach to plan recognition to the POMDP setting, where actions are stochastic and states are parti...

متن کامل

Maintaining Predictions over Time without a Model

2009

Erik Talvitie Satinder P. Singh

A common approach to the control problem in partially observable environments is to perform a direct search in policy space, as defined over some set of features of history. In this paper we consider predictive features, whose values are conditional probabilities of future events, given history. Since predictive features provide direct information about the agent’s future, they have a number of...

متن کامل

Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes

2010

Toyomi Meguro Ryuichiro Higashinaka Yasuhiro Minami Kohji Dohsaka

This paper investigates how to automatically create a dialogue control component of a listening agent to reduce the current high cost of manually creating such components. We collected a large number of listening-oriented dialogues with their user satisfaction ratings and used them to create a dialogue control component using partially observable Markov decision processes (POMDPs), which can le...

متن کامل

A Four-Participant Group Facilitation Framework for Conversational Robots

2013

Yoichi Matsuyama Iwao Akiba Akihiro Saito Tetsunori Kobayashi

In this paper, we propose a framework for conversational robots that facilitates fourparticipant groups. In three-participant conversations, the minimum unit for multiparty conversations, social imbalance, in which a participant is left behind in the current conversation, sometimes occurs. In such scenarios, a conversational robot has the potential to facilitate situations as the fourth partici...

متن کامل

Market-Based Reinforcement Learning in Partially Observable Worlds

2001

Ivo Kwee Marcus Hutter Jürgen Schmidhuber

Unlike traditional reinforcement learning (RL), market-based RL is in principle applicable to worlds described by partially observable Markov Decision Processes (POMDPs), where an agent needs to learn short-term memories of relevant previous events in order to execute optimal actions. Most previous work, however, has focused on reactive settings (MDPs) instead of POMDPs. Here we reimplement a r...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید