Mutual Information Weighing for Probabilistic Movement Primitives

نویسندگان

چکیده

Reinforcement Learning (RL) of trajectory data has been used in several fields, and it is relevance robot motion learning, which sampled trajectories are run their outcome evaluated with a reward value. The responsibility on the performance task can be associated to as whole, or distributed throughout its points (timesteps). In this work, we present novel method for attributing rewards each timestep separately by using Mutual Information (MI) bias model fitting trajectory.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Probabilistic Movement Primitives

Movement Primitives (MP) are a well-established approach for representing modular and re-usable robot movement generators. Many state-of-the-art robot learning successes are based MPs, due to their compact representation of the inherently continuous and high dimensional robot movements. A major goal in robot learning is to combine multiple MPs as building blocks in a modular control architectur...

متن کامل

A Probabilistic Representation for Dynamic Movement Primitives

Dynamic Movement Primitives have successfully been used to realize imitation learning, trial-and-error learning, reinforcement learning, movement recognition and segmentation and control. Because of this they have become a popular representation for motor primitives. In this work, we showcase how DMPs can be reformulated as a probabilistic linear dynamical system with control inputs. Through th...

متن کامل

Learning Movement Primitives

This paper discusses a comprehensive framework for modular motor control based on a recently developed theory of dynamic movement primitives (DMP). DMPs are a formulation of movement primitives with autonomous nonlinear differential equations, whose time evolution creates smooth kinematic control policies. Model-based control theory is used to convert the outputs of these policies into motor co...

متن کامل

Kernelized Movement Primitives

Imitation learning has been studied widely as a convenient way to transfer human skills to robots. This learning approach is aimed at extracting relevant motion patterns from human demonstrations and subsequently applying these patterns to different situations. Despite many advancements have been achieved, the solutions for coping with unpredicted situations (e.g., obstacles and external pertur...

متن کامل

Optimal Movement Primitives

The theory of Optimal Unsupervised Motor Learning shows how a network can discover a reduced-order controller for an unknown nonlinear system by representing only the most significant modes. Here, I extend the theory to apply to command sequences, so that the most significant components discovered by the network correspond to motion "primitives". Combinations of these primitives can be used to ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Frontiers in artificial intelligence and applications

سال: 2022

ISSN: ['1879-8314', '0922-6389']

DOI: https://doi.org/10.3233/faia220359