cart pole inverted pendulum

نتایج جستجو برای: cart pole inverted pendulum

تعداد نتایج: 51899 فیلتر نتایج به سال:

Partially Adaptive Control of an Inverted Pendulum and Cart System

Journal: :Transactions of the Institute of Systems, Control and Information Engineers 2001

متن کامل

A new controller for the inverted pendulum on a cart

Journal: :International Journal of Robust and Nonlinear Control 2008

متن کامل

Diff-DAC: Distributed Actor-Critic for Multitask Deep Reinforcement Learning

Journal: :CoRR 2017

Sergio Valcarcel Macua Aleksi Tukiainen Daniel García-Ocaña Hernández David Baldazo Enrique Munoz de Cote Santiago Zazo

We propose a multiagent distributed actor-critic algorithm for multitask reinforcement learning (MRL), named Diff-DAC. The agents are connected, forming a (possibly sparse) network. Each agent is assigned a task and has access to data from this local task only. During the learning process, the agents are able to communicate some parameters to their neighbors. Since the agents incorporate their ...

متن کامل

Reinforcement learning transfer via sparse coding

2012

Haitham Bou-Ammar Karl Tuyls Matthew E. Taylor Kurt Driessens Gerhard Weiss

Although reinforcement learning (RL) has been successfully deployed in a variety of tasks, learning speed remains a fundamental problem for applying RL in complex environments. Transfer learning aims to ameliorate this shortcoming by speeding up learning through the adaptation of previously learned behaviors in similar tasks. Transfer techniques often use an inter-task mapping, which determines...

متن کامل

Adaptation of a Fuzzy Controller’s Scaling Gains Using Genetic Algorithms for Balancing an Inverted Pendulum

2011

Adrian-Vasile Duka

This paper examines the development of a genetic adaptive fuzzy control system for the Inverted Pendulum. The inverted pendulum is a classical problem in Control Engineering, used for testing different control algorithms. The goal is to balance the inverted pendulum in the upright position by controlling the horizontal force applied to its cart. Because it is unstable and has a complicated nonl...

متن کامل

Towards Verification of Artificial Neural Networks

2015

Karsten Scheibler Leonore Winterer Ralf Wimmer Bernd Becker

We consider the safety verification of controllers obtained via machine learning. This is an important problem as the employed machine learning techniques work well in practice, but cannot guarantee safety of the produced controller, which is typically represented as an artificial neural network. Nevertheless, such methods are used in safety-critical environments. In this paper we take a typica...

متن کامل

A three-network architecture for on-line learning and optimization based on adaptive dynamic programming

Journal: :Neurocomputing 2012

Haibo He Zhen Ni Jian Fu

In this paper, we propose a novel adaptive dynamic programming (ADP) architecture with three networks, an action network, a critic network, and a reference network, to develop internal goalrepresentation for online learning and optimization. Unlike the traditional ADP design normally with an action network and a critic network, our approach integrates the third network, a reference network, int...

متن کامل

State Dependent Switching Control for Inverted Pendulum System

2005

Satoko YAMAKAWA Atsushi YAMADA Hideo FUJIMOTO

A switching control strategy for inverted pendulum systems is proposed based on the energy function and its transitions. We deal with a simplified second order model of cart-pendulum systems. Assume a virtual energy whose potential energy has an opposite sign. Paying attention to the motions of the pendulum, the energy changes are analyzed theoretically. According to these analyses, the conditi...

متن کامل

Stabilization of the Pendulum on a Rotor Arm by the Method of Controlled Lagrangians

1999

Anthony M. Bloch Naomi Ehrich Leonard Jerrold E. Marsden

This paper obtains feedback stabilization of an inverted pendulum on a rotor arm by the “method of controlled Lagrangians”. This approach involves modifying the Lagrangian for the uncontrolled system so that the Euler-Lagrange equations derived from the modified or “controlled” Lagrangian describe the closed-loop system. For the closed-loop equations to be consistent with available control inpu...

متن کامل

Leveraging Human Movement in the Ultimate Display

2012

Rohan J. McAdam Keith Nesbitt

Human movement is a “natural skill” employed to solve difficult problems in dynamics concerning the manipulation of a complex biomechanical system, the body, in an ever-changing environment. Continuous Interactive Simulation (CIS) is a technique that attempts to use this human capacity to solve problems in movement dynamics to solve problems concerning arbitrary dynamical systems. In this paper...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید