Behaviour-Based Reinforcement Learning

نویسندگان

George Dimitri Konidaris

Jane Rankin

Douglas Howie

چکیده

Although behaviour-based robotics has been successfully used to develop autonomous mobile robots up to a certain point, further progress may require the integration of a learning model into the behaviour-based framework. Reinforcement learning is a natural candidate for this because it seems well suited to the problems faced by autonomous agents. However, previous attempts to use reinforcement learning in behaviour-based mobile robots have been simple combinations of these two methodologies rather than full integrations, and have suffered from severe scaling problems that appear to make them infeasible. Furthermore, the implicit assumptions that form the basis of reinforcement learning theory were not developed with the problems faced by autonomous agents in complex environments in mind. This dissertation introduces a model of reinforcement learning that is designed specifically for use in behaviour-based robots, taking the conditions faced by situated agents into account. The model layers a distributed and asynchronous reinforcement learning algorithm over a learned topological map and standard behavioural substrate to create a reinforcement learning complex. The topological map creates a small and task-relevant state space that aims to make reinforcement learning feasible, while the distributed and asynchronous nature of the model makes it compatible with behaviour-based design principles. The model is then validated through an experiment that requires a mobile robot to perform puck foraging in three separate artificial arenas. The development of Dangerous Beans, a mobile robot that is capable of building a distributed topological map of its environment and performing reinforcement learning over it is described, along with the results of its use to test three control strategies (random decision making, a standard reinforcement learning algorithm layered on top of a topological map, and the full model developed in this dissertation) in the arenas. The results show that the model developed in this dissertation is able to learn rapidly in a real environment, and outperforms both the random strategy and the layered standard reinforcement learning algorithm. Following this, a discussion of the implications of these results is given, which suggests that situated learning and the integration of behaviour-based methods and layered learning models merit further study.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cooperation Learning for Behaviour-based Neural-fuzzy Controller in Robot Navigation

Based on the previously proposed extended neural-fuzzy network, this paper presents a cooperation scheme of training data based learning and reinforcement learning for constructing sensor-based behaviour modules in robot navigation. In order to solve reinforcement learning problem, a reinforcement-based neural-fuzzy control system (RNFCS) is provided, which consists of a neural-fuzzy controller...

متن کامل

Reinforcement Learning in Biologically-Inspired Collective Robotics: A Rough Set Approach

This thesis presents a rough set approach to reinforcement learning. This is made possible by considering behaviour patterns of learning agents in the context of approximation spaces. Rough set theory introduced by Zdzisław Pawlak in the early 1980s provides a ground for deriving pattern-based rewards within approximation spaces. Learning can be considered episodic. The framework provided by an...

متن کامل

The use of shock collars and their impact on the welfare of dogs: A review of the current literature

There is a wide range of different methods currently in use in dog training. The techniques are based upon operant conditioning, which is the process of learning whereby the animal forms an association between an action and the consequence to it of doing that action. Reinforcement of a behaviour means that the likelihood of the target behaviour being shown again is increased. Reinforcement can ...

متن کامل

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...

متن کامل

Hierarchical Reinforcement Learning for Spoken Dialogue Systems

This thesis focuses on the problem of scalable optimization of dialogue behaviour in speech-based conversational systems using reinforcement learning. Most previous investigations in dialogue strategy learning have proposed flat reinforcement learning methods, which are more suitable for small-scale spoken dialogue systems. This research formulates the problem in terms of Semi-Markov Decision P...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2003

Behaviour-Based Reinforcement Learning

نویسندگان

چکیده

منابع مشابه

Cooperation Learning for Behaviour-based Neural-fuzzy Controller in Robot Navigation

Reinforcement Learning in Biologically-Inspired Collective Robotics: A Rough Set Approach

The use of shock collars and their impact on the welfare of dogs: A review of the current literature

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

Hierarchical Reinforcement Learning for Spoken Dialogue Systems

عنوان ژورنال:

اشتراک گذاری