نتایج جستجو برای: keywords reinforcement learning

تعداد نتایج: 2453256  

Wireless Sensor Networks (WSNs) are consist of independent distributed sensors with storing, processing, sensing and communication capabilities to monitor physical or environmental conditions. There are number of challenges in WSNs because of limitation of battery power, communications, computation and storage space. In the recent years, computational intelligence approaches such as evolutionar...

2010
Sh. Yasini M. B. Naghibi - Sistani A. Karimpour

This paper employs a new approach to regulate the blood glucose level of type I diabetic patient under an intensive insulin treatment. The closed-loop control scheme incorporates expert knowledge about treatment by using reinforcement learning theory to maintain the normoglycemic average of 80 mg/dl and the normal condition for free plasma insulin concentration in severe initial state. The insu...

Journal: :Adaptive Behaviour 2002
Martin V. Butz Joachim Hoffmann

The concept of anticipations controlling behavior is introduced. Background is provided about the importance of anticipations from a psychological perspective. Based on the psychological background wrapped in a framework of anticipatory behavioral control, the anticipatory learning classifier system ACS2 is explained. ACS2 learns and generalizes on-line a predictive environmental model (a model...

2012
M. Sedighizadeh A. Rezazadeh

A self tuning PID control strategy using reinforcement learning is proposed in this paper to deal with the control of wind energy conversion systems (WECS). Actor-Critic learning is used to tune PID parameters in an adaptive way by taking advantage of the model-free and on-line learning properties of reinforcement learning effectively. In order to reduce the demand of storage space and to impro...

2009
Moamar Sayed Mouchaweh Bernard Riera

In this paper, we propose to develop the supervised classification method Fuzzy Pattern Matching to be in addition a non supervised one. The goal is to monitor dynamic systems with a limited prior knowledge about their functioning. The detection of the occurrence of new states as well as the reinforcement of the estimation of their membership functions are performed online thanks to the combina...

2017
Sean Lamont John Aslanides Jan Leike Marcus Hutter

In recent years, work has been done to develop the theory of General Reinforcement Learning (GRL). However, there are few examples demonstrating the known results regarding generalised discounting. We have added to the GRL simulation platform AIXIjs the functionality to assign an agent arbitrary discount functions, and an environment which can be used to determine the effect of discounting on a...

Journal: :IJSIR 2016
Daniel Hein Alexander Hentschel Thomas A. Runkler Steffen Udluft

This article introduces a model-based reinforcement learning (RL) approach for continuous state and action spaces. While most RL methods try to find closed-form policies, the approach taken here employs numerical on-line optimization of control action sequences. First, a general method for reformulating RL problems as optimization tasks is provided. Subsequently, Particle Swarm Optimization (PS...

2016
Ali Nadi Ali Edrissi

Relief demand and transportation links availability is the essential information that is needed for every natural disaster operation. This information is not in hand once a disaster strikes. Relief demand and network condition has been evaluated based on prediction method in related works. Nevertheless, prediction seems to be over or under estimated due to uncertainties and may lead to a failur...

2009
Poornima Balakrishna Rajesh Ganesan Ph.D Lance Sherry

Accurate estimation of taxi-out time in the presence of uncertainties in the National Airspace System (NAS) is essential for the development of a more efficient air traffic management system. The dynamic nature of operations in the NAS indicates that traditional regression methods characterized by constant parameters would be inadequate to capture variations in taxi-out time across a day. In th...

2007
Florent Guenter Micha Hersch Sylvain Calinon Aude Billard

The goal of developing algorithms for programming robots by demonstration is to create an easy way of programming robots such that it can be accomplished by anyone. When a demonstrator teaches a task to a robot, he/she shows some ways of fulfilling the task, but not all the possibilities. The robot must then be able to reproduce the task even when unexpected perturbations occur. In this case, i...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید