Imitation Learning With Stability and Safety Guarantees

نویسندگان

چکیده

A method is presented to learn neural network (NN) controllers with stability and safety guarantees through imitation learning (IL). Convex conditions are derived for linear time-invariant systems NN by merging Lyapunov theory local quadratic constraints bound the activation functions in NN. These incorporated IL process, which minimizes loss, maximizes volume of region attraction associated controller simultaneously. An alternating direction multipliers based algorithm proposed solve problem. The illustrated on a vehicle lateral control example.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Safe Model-based Reinforcement Learning with Stability Guarantees

Reinforcement learning is a powerful paradigm for learning optimal policies from experimental data. However, to find optimal policies, most reinforcement learning algorithms explore all possible actions, which may be harmful for real-world systems. As a consequence, learning algorithms are rarely applied on safety-critical systems in the real world. In this paper, we present a learning algorith...

متن کامل

Imitation Learning with THOR

The recently proposed House Of inteRactions (AI2THOR) framework [35] provides an simulation environment for high quality 3D scenes. Together with THOR, a Targetdriven model is introduced to improve generalization capabilities. Imitation learning or learning by demonstration is known to be more effective in communicating task. In this project, we extend the Target-driven model by exploring both ...

متن کامل

Experimentation, Imitation, and Stochastic Stability

Do boundedly rational agents repeatedly playing a symmetric game with a unique symmetric equilibrium learn over time to play it? In this paper we model the dynamic interaction of two types of such agents, experimenters and imitators, whose behavior is characterized by simple rules of thumb. We find that the stochastic process describing their play is stable in the large: it converges globally a...

متن کامل

Imitation Learning with Demonstrations and Shaping Rewards

Imitation Learning (IL) is a popular approach for teaching behavior policies to agents by demonstrating the desired target policy. While the approach has lead to many successes, IL often requires a large set of demonstrations to achieve robust learning, which can be expensive for the teacher. In this paper, we consider a novel approach to improve the learning efficiency of IL by providing a sha...

متن کامل

Imitation : learning and communication

This paper focuses on our works on imitation in autonomous robots. In a first part, we take into account recent studies in the field of developmental psychology and consider the two functions of imitation (learning and communication) that these studies have stressed. In a second part, we propose the idea that a proto imitative behavior can be induced in a mobile robot via a limitation of its vi...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Control Systems Letters

سال: 2022

ISSN: ['2475-1456']

DOI: https://doi.org/10.1109/lcsys.2021.3077861