The Adaptive Dynamic Programming Toolbox
نویسندگان
چکیده
The paper develops the adaptive dynamic programming toolbox (ADPT), which is a MATLAB-based software package and computationally solves optimal control problems for continuous-time control-affine systems. ADPT produces approximate feedback controls by employing technique solving Hamilton–Jacobi–Bellman equation approximately. A novel implementation method derived to optimize memory consumption throughout its execution. supports two working modes: model-based mode model-free mode. In former mode, computes provided system dynamics. latter are generated from measurements of trajectories, without requirement knowledge model. Multiple setting options in ADPT, such that various customized circumstances can be accommodated. Compared other popular toolboxes control, features computational precision time efficiency, illustrated with applications highly non-linear satellite attitude problem.
منابع مشابه
Self-teaching adaptive dynamic programming for Gomoku
In this paper adaptive dynamic programming (ADP) is applied to learn to play Gomoku. The critic network is used to evaluate board situations. The basic idea is to penalize the last move taken by the loser and reward the last move selected by the winner at the end of a game. The results show that the presented program is able to improve its performance by playing against itself and has approache...
متن کاملA dynamic programming approach to adaptive fractionation.
We conduct a theoretical study of various solution methods for the adaptive fractionation problem. The two messages of this paper are as follows: (i) dynamic programming (DP) is a useful framework for adaptive radiation therapy, particularly adaptive fractionation, because it allows us to assess how close to optimal different methods are, and (ii) heuristic methods proposed in this paper are ne...
متن کاملOptimal Asset Allocation using Adaptive Dynamic Programming
In recent years, the interest of investors has shifted to computerized asset allocation (portfolio management) to exploit the growing dynamics of the capital markets. In this paper, asset allocation is formalized as a Markovian Decision Problem which can be optimized by applying dynamic programming or reinforcement learning based algorithms. Using an artificial exchange rate, the asset allocati...
متن کاملBounded Rationality : The Adaptive Toolbox edited
It is a curious feature of 20-century academia that the branch of mathematics known as rational choice theory led two of the most influential approaches to human behaviour to make completely different assumptions about the workings of the human mind. Biologists use rational choice theory to model the effects of natural selection on populations of genes. For a given range of alleles and their ph...
متن کاملExtending the Radar Dynamic Range using Adaptive Pulse Compression
The matched filter in the radar receiver is only adapted to the transmitted signal version and its output will be wasted due to non-matching with the received signal from the environment. The sidelobes amplitude of the matched filter output in pulse compression radars are dependent on the transmitted coded waveforms that extended as much as the length of the code on both sides of the target loc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Sensors
سال: 2021
ISSN: ['1424-8220']
DOI: https://doi.org/10.3390/s21165609