نتایج جستجو برای: q learning

تعداد نتایج: 717428  

2007
Carlos H. C. Ribeiro

In the last few years, reinforcement learning algorithms have been proposed as a more natural way of modelling animal learning. Unlike supervised learning methods, reinforcement learning addresses the basic problem faced by an animal when trying to control a discrete stochastic dynamic system: discover by trial and error a policy of actions that maximises some criterium of optimality, usually e...

2003
Kazuyuki Ito Tetsushi Kamegawa Fumitoshi Matsuno

Reinforcement learning is very effec#ive for robot learning. Because it does not need prior knowledge and has higher capability of reactive and adaptive behaviors. In our previous works, we proposed new reinforce learning algorithm: "Q-learning with Dynamic Structuring of Exploration Space Based on Genetic Algorithm (QDSEGA)". It is designed for complicated systems with large action-state space...

1998
Bruce L. Digney

While the need for hierarchies within control systems is apparent, it is also clear to many researchers that such hierarchies should be learned. Learning both the structure and the component behaviors is a diicult task. The beneet of learning the hierarchical structures of behaviors is that the decomposition of the control structure into smaller transportable chunks allows previously learned kn...

2004
Alexander A. Sherstov

Reinforcement learning (RL) is a powerful machine-learning methodology that has an established theoretical foundation and has proven effective in a variety of small, simulated domains. There has been considerable work on applying RL, a method originally conceived for discrete state-action spaces, to problems with continuous states. The extension of RL to allow continuous actions, on the other h...

The main purpose of this paper is to develop a supplementary signal using reinforcement learning (RL) to improve the performance of power system stabilizer (PSS). RL is one of the most important issues in the field of artificial intelligence and is the popular method for solving Markov decision procedure (MDP). In this paper, a control method is developed based on Q-learning and used to improve...

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه فردوسی مشهد - پژوهشکده فنی و مهندسی 1391

در طی دو- سه دهه ی اخیر صنعت برق در سرتاسر جهان، گذار از ساختارهای یک پارچه ی عمودی را به سمت بازارهای آزاد رقابتی آغاز کرده است. با وجود حرکت به سمت فضای رقابتی، متأسفانه این گذار به صورت کامل صورت نگرفته است، و بازارهایی با رقابت ناکامل ایجاد شده اند. در بازاری با رقابت ناکامل، تولید کننده گان درمی یابند که اگر قیمتی بالاتر از هزینه ی حدی شان پیشنهاد دهند ممکن است سود بیشتری به دست آورند. بنا...

Introduction: Learning management system is one of the most effective methods in teaching and learning The present study aims to identify and categorize effective factors on the effectiveness of this system from students' point of view. Methods: The present study uses exploratory and '' Q method'. The study participants were Students of Isfahan University of Medical Sciences in academic year 2...

2006
Dean C. Wardell

Reinforcement learning is one of the more attractive machine learning technologies, due to its unsupervised learning structure and ability to continually learn even as the environment it is operating in changes. This ability to learn in an unsupervised manner in a changing environment is applicable in complex domains through the use of function approximation of the domain’s policy. The function...

Journal: :International Journal of Computer Science and Engineering 2021

Journal: :Neural Network World 2018

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید