The Thing that we Tried Didn't Work very Well: Deictic Representation in Reinforcement Learning
نویسندگان
چکیده
Most reinforcement learning methods oper ate on propositional representations of the world state. Such representations are of ten intractably large and generalize poorly. Using a deictic representation is believed to be a viable alternative: they promise gener alization while allowing the use of existing reinforcement-learning methods. Yet, there are few experiments on learning with deic tic representations reported in the literature. In this paper we explore the effectiveness of two forms of deictic representation and a naive propositional representation in a sim ple blocks-world domain. We find, empiri cally, that the deictic representations actu ally worsen learning performance. We con clude with a discussion of possible causes of these results and strategies for more effective learning in domains with objects.
منابع مشابه
Hierarchical Reinforcement Learning with Deictic Representation in a Computer Game
Computer games are challenging test beds for machine learning research. Without applying abstraction and generalization techniques, many traditional machine learning techniques, such as reinforcement learning, will fail to learn efficiently. In this paper we examine extensions of reinforcement learning that scale to the complexity of computer games. In particular we look at hierarchical reinfor...
متن کاملDeictic Option Schemas
Deictic representation is a representational paradigm, based on selective attention and pointers, that allows an agent to learn and reason about rich complex environments. In this article we present a hierarchical reinforcement learning framework that employs aspects of deictic representation. We also present a Bayesian algorithm for learning the correct representation for a given sub-problem a...
متن کاملThe Thing We Tried That Worked: Utile Distinctions for Relational Reinforcement Learning
This paper introduces a relational function approximation technique based on McCallum’s UTree algorithm (McCallum, 1995). We have extended the original approach to handle relational observations using an attribute graph of observable objects and relationships (McGovern et al., 2003). Furthermore, we address the inherent challenges that arise with a relational representation. We use stochastic s...
متن کاملLearning to Repair Plans and Schedules Using a Relational (deictic) Representation
Unplanned and abnormal events may have a significant impact on the feasibility of plans and schedules which requires to repair them ‘on-the-fly’ to guarantee due date compliance of orders-in-progress and negotiating delivery conditions for new orders. In this work, a repair-based rescheduling approach based on the integration of intensive simulations with logical and relational reinforcement le...
متن کاملSmartGantt - An intelligent system for real time rescheduling based on relational reinforcement learning
With the current trend towards cognitive manufacturing systems to deal with unforeseen events and disturbances that constantly demand real-time repair decisions, learning/reasoning skills and interactive capabilities are important functionalities for rescheduling a shop-floor on the fly taking into account several objectives and goal states. In this work, the automatic generation and update thr...
متن کامل