Derivative-free reinforcement learning: a review

نویسندگان

چکیده

Reinforcement learning is about agent models that make the best sequential decisions in unknown environments. In an environment, needs to explore environment while exploiting collected information, which usually forms a sophisticated problem solve. Derivative-free optimization, meanwhile, capable of solving problems. It commonly uses sampling-and-updating framework iteratively improve solution, where exploration and exploitation are also needed be well balanced. Therefore, derivative-free optimization deals with similar core issue as reinforcement learning, has been introduced approaches, under names classifier systems neuroevolution/evolutionary learning. Although such methods have developed for decades, recently, exhibits attracting increasing attention. However, recent survey on this topic still lacking. article, we summarize date, organize aspects including parameter updating, model selection, exploration, parallel/distributed methods. Moreover, discuss some current limitations possible future directions, hoping article could bring more attentions serve catalyst developing novel efficient approaches.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reinforcement Learning: Model-free

Simply put, reinforcement learning (RL) is a term used to indicate a large family of dierent algorithms RL that all share two key properties. First, the objective of RL is to learn appropriate behavior through trialand-error experience in a task. Second, in RL, the feedback available to the learning agent is restricted to a reward signal that indicates how well the agent is behaving, but does ...

متن کامل

A Review of Reinforcement Learning

Monte Carlo methods, and temporal difference learning are teased apart, then tied back together in a unified way. Innovations such as backup diagrams, which decorate the book cover, help convey the power and excitement behind reinforcement learning methods to both novices and veterans like us. The book consists of three parts, one dedicated to the problem description and two others to a range o...

متن کامل

Multitask model-free reinforcement learning

Conventional model-free reinforcement learning algorithms are limited to performing only one task, such as navigating to a single goal location in a maze, or reaching one goal state in the Tower of Hanoi block manipulation problem. It has been thought that only model-based algorithms could perform goal-directed actions, optimally adapting to new reward structures in the environment. In this wor...

متن کامل

Reinforcement Learning in Neural Networks: A Survey

In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...

متن کامل

Reinforcement Learning in Neural Networks: A Survey

In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Frontiers of Computer Science

سال: 2021

ISSN: ['1673-7350', '1673-7466']

DOI: https://doi.org/10.1007/s11704-020-0241-4