Symbolic Regression Methods for Reinforcement Learning

نویسندگان

چکیده

Reinforcement learning algorithms can be used to optimally solve dynamic decision-making and control problems. With continuous-valued state input variables, reinforcement must rely on function approximators represent the value policy mappings. Commonly numerical approximators, such as neural networks or basis expansions, have two main drawbacks: they are black-box models offering no insight in mappings learned, require significant trial error tuning of their meta-parameters. In this paper, we propose a new approach constructing smooth functions form analytic expressions by means symbolic regression. We introduce three off-line methods for finding based transition model: iteration, direct solution Bellman equation. The illustrated four nonlinear problems: velocity under friction, one-link two-link pendulum swing-up, magnetic manipulation. results show that not only yield well-performing policies, but also compact, mathematically tractable easy plug into other algorithms. This makes them potentially suitable further analysis closed-loop system. A comparison with an alternative using shows our method outperforms network-based one.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supplementing neural reinforcement learning with symbolic methods: Possibilities and challenges

|Several diierent ways of using symbolic methods to improve reinforcement learning are identiied and discussed in some detail. Each demonstrates to some extent the advantages of combining RL and symbolic methods. These methods point to the potentials and the challenges of this line of research.

متن کامل

Towards Deep Symbolic Reinforcement Learning

Deep reinforcement learning (DRL) brings the power of deep neural networks to bear on the generic task of trial-and-error learning, and its effectiveness has been convincingly demonstrated on tasks such as Atari video games and the game of Go. However, contemporary DRL systems inherit a number of shortcomings from the current generation of deep learning techniques. For example, they require ver...

متن کامل

Variational methods for Reinforcement Learning

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transition matrix is obtained from which the optimal decision policy is formed. The classical maximum likelihood point estimate of the transition model does not reflect the uncertainty in the estimate of the transition model ...

متن کامل

Combining Reinforcement Learning with Symbolic Planning

One of the major difficulties in applying Q-learning to realworld domains is the sharp increase in the number of learning steps required to converge towards an optimal policy as the size of the state space is increased. In this paper we propose a method, PLANQ-learning, that couples a Q-learner with a STRIPS planner. The planner shapes the reward function, and thus guides the Q-learner quickly ...

متن کامل

Symbolic structures in numeric reinforcement for learning optimum robot trajectory

This article contributes to the understanding of how symbols can profitably be integrated into a numeric reinforcement learning approach so as to realize a learning robot in a continuous problem setting. An initial experiment clearly revealed that the underlying task could not be learned merely on the basis of numeric reinforcement values. The unstructured utilization of sensory values of the r...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2021

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2021.3119000