q learning

Attentional Mechanisms as a Strategy for Generalization in the Q-Learning Algorithm

2007

Carlos H. C. Ribeiro

In the last few years, reinforcement learning algorithms have been proposed as a more natural way of modelling animal learning. Unlike supervised learning methods, reinforcement learning addresses the basic problem faced by an animal when trying to control a discrete stochastic dynamic system: discover by trial and error a policy of actions that maximises some criterium of optimality, usually e...

متن کامل

Extended QDSEGA for controlling real robots -acquisition of locomotion patterns for snake-like robot

2003

Kazuyuki Ito Tetsushi Kamegawa Fumitoshi Matsuno

Reinforcement learning is very effec#ive for robot learning. Because it does not need prior knowledge and has higher capability of reactive and adaptive behaviors. In our previous works, we proposed new reinforce learning algorithm: "Q-learning with Dynamic Structuring of Exploration Space Based on Genetic Algorithm (QDSEGA)". It is designed for complicated systems with large action-state space...

متن کامل

Signals Reinforcement Inputs Sensory Actions Skill Skill Skill

1998

Bruce L. Digney

While the need for hierarchies within control systems is apparent, it is also clear to many researchers that such hierarchies should be learned. Learning both the structure and the component behaviors is a diicult task. The beneet of learning the hierarchical structures of behaviors is that the decomposition of the control structure into smaller transportable chunks allows previously learned kn...

متن کامل

On Continuous-Action Q-Learning via Tile Coding Function Approximation

2004

Alexander A. Sherstov

Reinforcement learning (RL) is a powerful machine-learning methodology that has an established theoretical foundation and has proven effective in a variety of small, simulated domains. There has been considerable work on applying RL, a method originally conceived for discrete state-action spaces, to problems with continuous states. The extension of RL to allow continuous actions, on the other h...

متن کامل

طراحی پایدارساز PSS3B بر اساس الگوریتم KH و Q-learning برای میراسازی نوسانات فرکانس پایین سیستم قدرت تک‌ماشینه

ژورنال: مهندسی برق و الکترونیک ایران 2017

شایقی, حسین, اکبری مجد, عادل , عبداله یونسی, عبداله عبداله یونسی, هاشمی, یاشار ,

The main purpose of this paper is to develop a supplementary signal using reinforcement learning (RL) to improve the performance of power system stabilizer (PSS). RL is one of the most important issues in the field of artificial intelligence and is the popular method for solving Markov decision procedure (MDP). In this paper, a control method is developed based on Q-learning and used to improve...

متن کامل

قیمت دهی در بازار برق به کمک الگوریتم q-learning تطبیقی و قدرت بازار

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه فردوسی مشهد - پژوهشکده فنی و مهندسی 1391

رضا کاکولاریمی, محمد باقر نقیبی سیستانی,

در طی دو- سه دهه ی اخیر صنعت برق در سرتاسر جهان، گذار از ساختارهای یک پارچه ی عمودی را به سمت بازارهای آزاد رقابتی آغاز کرده است. با وجود حرکت به سمت فضای رقابتی، متأسفانه این گذار به صورت کامل صورت نگرفته است، و بازارهایی با رقابت ناکامل ایجاد شده اند. در بازاری با رقابت ناکامل، تولید کننده گان درمی یابند که اگر قیمتی بالاتر از هزینه ی حدی شان پیشنهاد دهند ممکن است سود بیشتری به دست آورند. بنا...

15 صفحه اول

Identifying and Classifying the Mindfulness of Medical Students on Effective Factors on the Effectiveness of a Learning Management System (LMS)

ژورنال: مجله مرکز مطالعات و توسعه آموزش علوم پزشکی شهید صدوقی یزد 2019

kavianii, hasan, shah javan, mahbobe,

Introduction: Learning management system is one of the most effective methods in teaching and learning The present study aims to identify and categorize effective factors on the effectiveness of this system from students' point of view. Methods: The present study uses exploratory and '' Q method'. The study participants were Students of Isfahan University of Medical Sciences in academic year 2...

متن کامل

Fuzzy State Aggregation and Off-policy Reinforcement Learning for Stochastic Environments

2006

Dean C. Wardell

Reinforcement learning is one of the more attractive machine learning technologies, due to its unsupervised learning structure and ability to continually learn even as the environment it is operating in changes. This ability to learn in an unsupervised manner in a changing environment is applicable in complex domains through the use of function approximation of the domain’s policy. The function...

متن کامل

Image Sampling Using Q-Learning

Journal: :International Journal of Computer Science and Engineering 2021

متن کامل

Q LEARNING REGRESSION NEURAL NETWORK

Journal: :Neural Network World 2018

متن کامل