partially non

نتایج جستجو برای: partially non

تعداد نتایج: 1430292 فیلتر نتایج به سال:

Decision-Theoretic Meta-reasoning in Partially Observable and Decentralized Settings

2014

Alan Scott Carlin ALAN CARLIN Roderic A. Grupen

DECISION-THEORETIC META-REASONING IN PARTIALLY OBSERVABLE AND DECENTRALIZED SETTINGS

متن کامل

Dec-POMDPs as Non-Observable MDPs

2014

Frans A. Oliehoek Christopher Amato

A recent insight in the field of decentralized partially observable Markov decision processes (Dec-POMDPs) is that it is possible to convert a Dec-POMDP to a non-observable MDP, which is a special case of POMDP. This technical report provides an overview of this reduction and pointers to related literature.

متن کامل

On Partially Observable MDPs and BDI Models

2002

Martijn C. Schut Michael Wooldridge Simon Parsons

Decision theoretic planning in ai bymeans of solving Partially ObservableMarkov decision processes (pomdps) has been shown to be both powerful and versatile. However, such approaches are computationally hard and, from a design stance, are not necessarily intuitive for conceptualising many problems. We propose a novel method for solving pomdps, which provides a designer with a more intuitive mea...

متن کامل

Learning to Explore and Exploit in POMDPs

2009

Chenghui Cai Xuejun Liao Lawrence Carin

A fundamental objective in reinforcement learning is the maintenance of a proper balance between exploration and exploitation. This problem becomes more challenging when the agent can only partially observe the states of its environment. In this paper we propose a dual-policy method for jointly learning the agent behavior and the balance between exploration exploitation, in partially observable...

متن کامل

Learning Partially Observable Action Models: Efficient Algorithms

2006

Dafna Shahaf Allen Chang Eyal Amir

We present tractable, exact algorithms for learning actions’ effects and preconditions in partially observable domains. Our algorithms maintain a propositional logical representation of the set of possible action models after each observation and action execution. The algorithms perform exact learning of preconditions and effects in any deterministic action domain. This includes STRIPS actions ...

متن کامل

Robust Person Guidance by Using Online POMDPs

2013

Luis Merino Joaquín Ballesteros Noé Pérez-Higueras Rafael Ramón Vigo Javier Pérez-Lara Fernando Caballero

The paper considers a guiding task in which a robot has to guide a person towards a destination. A robust operation requires to consider uncertain models on the person motion and intentions, as well as noise and occlusions in the sensors employed for the task. Partially Observable Markov Decision Processes (POMDPs) are used to model the task. The paper describes an enhancement on online POMDP s...

متن کامل

MDPs Semi - Markov decision processes Hidden Markov models Partially observable SMDPs Hierarchical HMMs

2007

Sridhar Mahadevan

متن کامل

How Prior Probability Influences Decision Making: A Unifying Probabilistic Model

2012

Yanping Huang Abram L. Friesen Timothy D. Hanks Michael N. Shadlen Rajesh P. N. Rao

How does the brain combine prior knowledge with sensory evidence when making decisions under uncertainty? Two competing descriptive models have been proposed based on experimental data. The first posits an additive offset to a decision variable, implying a static effect of the prior. However, this model is inconsistent with recent data from a motion discrimination task involving temporal integr...

متن کامل

patterns and variations in native and non-native interlanguage pragmatic rating: effects of rater training, intercultural proficiency, and self-assessment

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه علامه طباطبایی - دانشکده ادبیات و زبانهای خارجی 1391

مینو عالمی, ضیا تاج الدین, محمد خطیب,

although there are studies on pragmatic assessment, to date, literature has been almost silent about native and non-native english raters’ criteria for the assessment of efl learners’ pragmatic performance. focusing on this topic, this study pursued four purposes. the first one was to find criteria for rating the speech acts of apology and refusal in l2 by native and non-native english teachers...

15 صفحه اول

Goal Achievement in Partially Known, Partially Observable Domains

2006

Allen Chang Eyal Amir

We present a decision making algorithm for agents that act in partially observable domains which they do not know fully. Making intelligent choices in such domains is very difficult because actions’ effects may not be known a priori (partially known domain), and features may not always be visible (partially observable domain). Nonetheless, we show that an efficient solution is achievable in STR...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید