q learning

Neural reinforcement learning for behaviour synthesis

Journal: :Robotics and Autonomous Systems 1997

Claude F. Touzet

We present the results of a research aimed at improving the Q-learning method through the use of artificial neural networks. Neural implementations are interesting due to their generalisation ability. Two implementations are proposed: one with a competitive multilayer perceptron and the other with a self-organising map. Results obtained on a task of learning an obstacle avoidance behaviour for ...

متن کامل

Reinforcement learning based feedback control of tumor growth by limiting maximum chemo-drug dose using fuzzy logic

ژورنال: کنترل 2022

Mashayekhi, Hoda, Nazari, Mostafa,

In this paper, a model-free reinforcement learning-based controller is designed to extract a treatment protocol because the design of a model-based controller is complex due to the highly nonlinear dynamics of cancer. The Q-learning algorithm is used to develop an optimal controller for cancer chemotherapy drug dosing. In the Q-learning algorithm, each entry of the Q-table is updated using data...

متن کامل

Hybridisation of expertise and reinforcement learning in dialogue systems

2009

Romain Laroche Ghislain Putois Philippe Bretier Bernadette Bouchon-Meunier

This paper addresses the problem of introducing learning capabilities in industrial handcrafted automata-based Spoken Dialogue Systems, in order to help the developer to cope with his dialogue strategies design tasks. While classical reinforcement learning algorithms position their learning at the dialogue move level, the fundamental idea behind our approach is to learn at a finer internal deci...

متن کامل

Temporal Difference Models: Model-Free Deep RL for Model-Based Control

Journal: :CoRR 2018

Vitchyr Pong Shixiang Gu Murtaza Dalal Sergey Levine

Model-free reinforcement learning (RL) is a powerful, general tool for learning complex behaviors. However, its sample efficiency is often impractically large for solving challenging real-world problems, even with off-policy algorithms such as Q-learning. A limiting factor in classic model-free RL is that the learning signal consists only of scalar rewards, ignoring much of the rich information...

متن کامل

Learning robot control

2012

Max Mindt

Learning is essential to expand the capabilities of a robot. But what is the meaning of learning for a robot and how is the implementation of a learning task? This question is a key question and in addition to what should be learned. This paper defines a general learning model, classify robot learning and develops a learning architecture for locomotion of a simple walker. These learning archite...

متن کامل

Combining independent and joint learning: a negotiation based approach

2012

Reinaldo A. C. Bianchi Ana L. C. Bazzan

This work presents a new class of multiagent reinforcement learning algorithms that takes advantage of negotiation in order to improve the process of action selection. In this class of algorithms, agents use communication to cooperate and negotiate over the joint actions, thus enhancing the process of action selection. In this paper a new algorithm in this class is proposed: the Negotiation-bas...

متن کامل

Making a Robot Learn to Play Soccer Using Reward and Punishment

2007

Heiko Müller Martin Lauer Roland Hafner Sascha Lange Artur Merke Martin A. Riedmiller

In this paper, we show how reinforcement learning can be applied to real robots to achieve optimal robot behavior. As example, we enable an autonomous soccer robot to learn intercepting a rolling ball. Main focus is on how to adapt the Q-learning algorithm to the needs of learning strategies for real robots and how to transfer strategies learned in simulation onto real robots.

متن کامل

Morphing Airfoils with Four Morphing Parameters

2008

Amanda Lampton Adam Niksch John Valasek

An episodic unsupervised learning simulation using the Q-Learning method is developed to learn the optimal shape and shape change policy for a problem with four state variables. Optimality is addressed by reward functions based on airfoil properties such as lift coefficient, drag coefficient, and moment coefficient about the leading edge representing optimal shapes for specified flight conditio...

متن کامل

Speeding Up the Learning in A Robot Simulator

2015

Mostafa Al-Emran

Q-learning is a one of the well-known Reinforcement Learning algorithms that has been widely used in various problems. The main contribution of this work is how to speed up the learning in a single agent environment (e.g. the robot). In this work, an attempt to optimize the traditional Q-learning algorithm has been done via using the Repeated Update Q-learning (RUQL) algorithm (the recent state...

متن کامل

Improving on the reinforcement learning of coordination in cooperative multi-agent systems

2002

Spiros Kapetanakis Daniel Kudenko

We report on an investigation of reinforcement learning techniques for the learning of coordination in cooperative multiagent systems. These techniques are variants of Q-learning (Watkins, 1989) that are applicable to scenarios where mutual observation of actions is not possible. To date, reinforcement learning approaches for such independent agents did not guarantee convergence to the optimal ...

متن کامل