The Thing that we Tried Didn't Work very Well: Deictic Representation in Reinforcement Learning

نویسندگان

  • Sarah Finney
  • Natalia Hernandez-Gardiol
  • Leslie Pack Kaelbling
  • Tim Oates
چکیده

Most reinforcement learning methods oper­ ate on propositional representations of the world state. Such representations are of­ ten intractably large and generalize poorly. Using a deictic representation is believed to be a viable alternative: they promise gener­ alization while allowing the use of existing reinforcement-learning methods. Yet, there are few experiments on learning with deic­ tic representations reported in the literature. In this paper we explore the effectiveness of two forms of deictic representation and a naive propositional representation in a sim­ ple blocks-world domain. We find, empiri­ cally, that the deictic representations actu­ ally worsen learning performance. We con­ clude with a discussion of possible causes of these results and strategies for more effective learning in domains with objects.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hierarchical Reinforcement Learning with Deictic Representation in a Computer Game

Computer games are challenging test beds for machine learning research. Without applying abstraction and generalization techniques, many traditional machine learning techniques, such as reinforcement learning, will fail to learn efficiently. In this paper we examine extensions of reinforcement learning that scale to the complexity of computer games. In particular we look at hierarchical reinfor...

متن کامل

Deictic Option Schemas

Deictic representation is a representational paradigm, based on selective attention and pointers, that allows an agent to learn and reason about rich complex environments. In this article we present a hierarchical reinforcement learning framework that employs aspects of deictic representation. We also present a Bayesian algorithm for learning the correct representation for a given sub-problem a...

متن کامل

The Thing We Tried That Worked: Utile Distinctions for Relational Reinforcement Learning

This paper introduces a relational function approximation technique based on McCallum’s UTree algorithm (McCallum, 1995). We have extended the original approach to handle relational observations using an attribute graph of observable objects and relationships (McGovern et al., 2003). Furthermore, we address the inherent challenges that arise with a relational representation. We use stochastic s...

متن کامل

Learning to Repair Plans and Schedules Using a Relational (deictic) Representation

Unplanned and abnormal events may have a significant impact on the feasibility of plans and schedules which requires to repair them ‘on-the-fly’ to guarantee due date compliance of orders-in-progress and negotiating delivery conditions for new orders. In this work, a repair-based rescheduling approach based on the integration of intensive simulations with logical and relational reinforcement le...

متن کامل

SmartGantt - An intelligent system for real time rescheduling based on relational reinforcement learning

With the current trend towards cognitive manufacturing systems to deal with unforeseen events and disturbances that constantly demand real-time repair decisions, learning/reasoning skills and interactive capabilities are important functionalities for rescheduling a shop-floor on the fly taking into account several objectives and goal states. In this work, the automatic generation and update thr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002