Imitation Learning With Stability and Safety Guarantees
نویسندگان
چکیده
A method is presented to learn neural network (NN) controllers with stability and safety guarantees through imitation learning (IL). Convex conditions are derived for linear time-invariant systems NN by merging Lyapunov theory local quadratic constraints bound the activation functions in NN. These incorporated IL process, which minimizes loss, maximizes volume of region attraction associated controller simultaneously. An alternating direction multipliers based algorithm proposed solve problem. The illustrated on a vehicle lateral control example.
منابع مشابه
Safe Model-based Reinforcement Learning with Stability Guarantees
Reinforcement learning is a powerful paradigm for learning optimal policies from experimental data. However, to find optimal policies, most reinforcement learning algorithms explore all possible actions, which may be harmful for real-world systems. As a consequence, learning algorithms are rarely applied on safety-critical systems in the real world. In this paper, we present a learning algorith...
متن کاملImitation Learning with THOR
The recently proposed House Of inteRactions (AI2THOR) framework [35] provides an simulation environment for high quality 3D scenes. Together with THOR, a Targetdriven model is introduced to improve generalization capabilities. Imitation learning or learning by demonstration is known to be more effective in communicating task. In this project, we extend the Target-driven model by exploring both ...
متن کاملExperimentation, Imitation, and Stochastic Stability
Do boundedly rational agents repeatedly playing a symmetric game with a unique symmetric equilibrium learn over time to play it? In this paper we model the dynamic interaction of two types of such agents, experimenters and imitators, whose behavior is characterized by simple rules of thumb. We find that the stochastic process describing their play is stable in the large: it converges globally a...
متن کاملImitation Learning with Demonstrations and Shaping Rewards
Imitation Learning (IL) is a popular approach for teaching behavior policies to agents by demonstrating the desired target policy. While the approach has lead to many successes, IL often requires a large set of demonstrations to achieve robust learning, which can be expensive for the teacher. In this paper, we consider a novel approach to improve the learning efficiency of IL by providing a sha...
متن کاملImitation : learning and communication
This paper focuses on our works on imitation in autonomous robots. In a first part, we take into account recent studies in the field of developmental psychology and consider the two functions of imitation (learning and communication) that these studies have stressed. In a second part, we propose the idea that a proto imitative behavior can be induced in a mobile robot via a limitation of its vi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Control Systems Letters
سال: 2022
ISSN: ['2475-1456']
DOI: https://doi.org/10.1109/lcsys.2021.3077861