نتایج جستجو برای: keywords reinforcement learning

تعداد نتایج: 2453256  

Journal: :journal of artificial intelligence in electrical engineering 2014
mohammad esmaeil akbari noradin ghadimi

in this paper an adaptive pid controller for wind energy conversion systems (wecs) has been developed. theadaptation technique applied to this controller is based on reinforcement learning (rl) theory. nonlinearcharacteristics of wind variations as plant input, wind turbine structure and generator operational behaviordemand for high quality adaptive controller to ensure both robust stability an...

2013
Sudhir Raj Cheruvu Siva Kumar

Reinforcement learning has been active research area not only in machine learning but also in control engineering, operation research and robotics in recent years. It is a model free learning control method that can solve Markov decision problems. Q-learning is an incremental dynamic programming procedure that determines the optimal policy in a step-by-step manner. It is an online procedure for...

2012
Junya Nakase Koichi Moriyama Kiyoshi Kiyokawa Masayuki Numao Mayumi Oyama-Higa Satoshi Kurihara

In ambient information systems, not only extracting human behavior by sensor network but also adaptive autonomous interaction between the environment and humans is an important function. In this paper we propose a reinforcement learning framework to extract suitable interaction for each person from daily behavior. In the experiment, we show the feasibility of the proposed methodology. Keywords-...

2005
Simon Thiel Stavros Dalakakis Dieter Roller

This paper outlines the development of a learning retrieval agent. Task of this agent is to extract knowledge of the Active Semantic Network in respect to user-requests. Based on a reinforcement learning approach, the agent learns to interpret the user’s intention. Especially, the learning algorithm focuses on the retrieval of complex long distant relations. Increasing its learnt knowledge with...

2002
Jun Izawa Toshiyuki Kondo Koji Ito

The present paper discusses an optimal control method of biological robot arm which has redundancy of the mapping from the control input to the task goal. The control input space is divided into a couple of subspaces according to a priority order depending on the progress and stability of learning. In the proposed method, the search noise which is required for reinforcement learning is restrict...

2016
Kohei Arai

A new online clustering method based on not only reinforcement and competitive learning but also pursuit algorithm (Pursuit Reinforcement Competitive Learning: PRCL) as well as learning automata is proposed for reaching a relatively stable clustering solution in comparatively short time duration. UCI repository data which are widely used for evaluation of clustering performance in usual is used...

2013
Hitesh Shah Shiv Nadar

The synergy of the two paradigms, neural network and fuzzy inference system, has given rise to rapidly emerging filed, neuro-fuzzy systems. Evolving neuro-fuzzy systems are intended to use online learning to extract knowledge from data and perform a high-level adaptation of the network structure. We explore the potential of evolving neuro-fuzzy systems in reinforcement learning (RL) application...

Journal: :I. J. Robotics Res. 2013
Jens Kober J. Andrew Bagnell Jan Peters

Reinforcement learning offers to robotics a framework and set of tools for the design of sophisticated and hard-to-engineer behaviors. Conversely, the challenges of robotic problems provide both inspiration, impact, and validation for developments in reinforcement learning. The relationship between disciplines has sufficient promise to be likened to that between physics and mathematics. In this...

2012
Pucheng Zhou Bingrong Hong

How to coordinate the behaviors of the agents through learning is a challenging problem within multi-agent domains. Because of its complexity, recent work has focused on how coordinated strategies can be learned. Here we are interested in using reinforcement learning techniques to learn the coordinated actions of a group of agents, without requiring explicit communication among them. However, t...

2007
Hiroyuki Nakahara Kenji Doya

|The basal ganglia (BG) have been hypothesized to perform reinforcement learning by use of reinforcement signals provided by dopamine neurons. It is well known that there exist multiple BG-thalamocortical loops, but their functions are poorly understood. Here, we propose a computational model of how di erent BG loops are employed in visuomotor sequence learning using di erent representations of...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید