Model-based inverse reinforcement learning for deterministic systems

نویسندگان

چکیده

This paper focuses on the development of an online data-driven model-based inverse reinforcement learning (MBIRL) technique for linear and nonlinear deterministic systems. Input output trajectories agent under observation, attempting to optimize unknown reward function, are used estimate function corresponding optimal value in real-time. To achieve MBIRL using limited data, a novel feedback-driven approach is developed. The feedback policy dynamic model observation estimated from measured data estimates generate synthetic drive MBIRL. Theoretical guarantees ultimate boundedness estimation errors general, convergence zero special cases, derived Lyapunov techniques. Proof concept numerical experiments demonstrates utility developed method solve problems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Score-based Inverse Reinforcement Learning

This paper reports theoretical and empirical results obtained for the score-based Inverse Reinforcement Learning (IRL) algorithm. It relies on a non-standard setting for IRL consisting of learning a reward from a set of globally scored trajectories. This allows using any type of policy (optimal or not) to generate trajectories without prior knowledge during data collection. This way, any existi...

متن کامل

Reinforcement Learning Based PID Control of Wind Energy Conversion Systems

In this paper an adaptive PID controller for Wind Energy Conversion Systems (WECS) has been developed. Theadaptation technique applied to this controller is based on Reinforcement Learning (RL) theory. Nonlinearcharacteristics of wind variations as plant input, wind turbine structure and generator operational behaviordemand for high quality adaptive controller to ensure both robust stability an...

متن کامل

Model-Based Probabilistic Pursuit via Inverse Reinforcement Learning

In this paper we address the integrated prediction, planning, and control problem that enables a single follower robot (the “photographer”) to quickly re-establish visual contact with a moving target (the “subject”) that has escaped the follower’s field of view. Our work addresses this unavoidable scenario, which reactive controllers are typically ill-equipped to handle, by making intelligent p...

متن کامل

Inverse Reinforcement Learning in Swarm Systems

Inverse reinforcement learning (IRL) has become a useful tool for learning behavioral models from demonstration data. However, IRL remains mostly unexplored for multi-agent systems. In this paper, we show how the principle of IRL can be extended to homogeneous large-scale problems, inspired by the collective swarming behavior of natural systems. In particular, we make the following contribution...

متن کامل

Preference-learning based Inverse Reinforcement Learning for Dialog Control

Dialog systems that realize dialog control with reinforcement learning have recently been proposed. However, reinforcement learning has an open problem that it requires a reward function that is difficult to set appropriately. To set the appropriate reward function automatically, we propose preference-learning based inverse reinforcement learning (PIRL) that estimates a reward function from dia...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Automatica

سال: 2022

ISSN: ['1873-2836', '0005-1098']

DOI: https://doi.org/10.1016/j.automatica.2022.110242