Deep Q‐learning: A robust control approach

نویسندگان

چکیده

This work aims at constructing a bridge between robust control theory and reinforcement learning. Although, learning has shown admirable results in complex tasks, the agent's behavior is opaque. Meanwhile, system several tools for analyzing controlling dynamical systems. article places deep Q-learning into control-oriented perspective to study its dynamics with well-established techniques from control. An uncertain linear time-invariant model formulated by means of neural tangent kernel describe novel approach allows giving conditions stability (convergence) enables analysis frequency-domain. The makes it possible formulate controllers that inject rewards as input loss function achieve better convergence properties. Three output-feedback are synthesized: gain scheduling ℋ 2 $$ {\mathscr{H}}_2 , ∞ {\mathscr{H}}_{\infty } fixed-structure controllers. Compared traditional techniques, which involve heuristics, setting up agent tuning methodology more transparent literature. proposed does not use target network randomized replay memory. role overtaken input, also exploits temporal dependency samples (opposed memory buffer). Numerical simulations different OpenAI Gym environments suggest controlled can converge faster receive higher scores (depending on environment) compared benchmark double Q-learning.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust Control of a Quadrotor

In this paper, a robust tracking control method for automatic take-off and trajectory tracking of a quadrotor helicopter is presented. The designed controller includes two parts: a position controller and an attitude controller. The attitude controller is designed by using the sliding mode control (SMC) method to track the desired pitch and roll angles, which are the output of position controll...

متن کامل

Automatic Generation Control: A Decentralized Robust Approach

Automatic Generation Control (AGC) problem has been one of the major subjects in electric power system design/operation and is becoming much more significant today in accordance with increasing size, changing structure and complexity of interconnected power systems. In practice, AGC systems use simple proportional-integral (PI) controllers. However, since the PI controller parameters are usuall...

متن کامل

A New Approach to Adaptive Robust Control

A new approach is given for the design of adaptive robust control in the frequency domain. Starting with an initial model of a stable plant and a robust stabilizing controller, the new (windsurfer) approach allows the bandwidth of the closed-loop system to he increased progressively through an iterative controlrelevant system identification and control design procedure. The method deals with bo...

متن کامل

Embedded Model Control approach to robust control

Robust control design is mainly devoted to guarantee closed-loop stability of a model-based control law in presence of parametric and structural uncertainties. The control law is usually a complex feedback law which is derived from a (nonlinear) model, possibly complemented with some mathematical envelope of the model uncertainty. Stability may be guarantee with the help of some ignorance coeff...

متن کامل

robust control of a quadrotor

in this paper, a robust tracking control method for automatic take-off and trajectory tracking of a quadrotor helicopter is presented. the designed controller includes two parts: a position controller and an attitude controller. the attitude controller is designed by using the sliding mode control (smc) method to track the desired pitch and roll angles, which are the output of position controll...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Robust and Nonlinear Control

سال: 2022

ISSN: ['1049-8923', '1099-1239']

DOI: https://doi.org/10.1002/rnc.6457