ARES: Adaptive Receding-Horizon Synthesis of Optimal Plans
نویسندگان
چکیده
We introduce ARES, an efficient approximation algorithm for generating optimal plans (action sequences) that take an initial state of a Markov Decision Process (MDP) to a state whose cost is below a specified (convergence) threshold. ARES uses Particle Swarm Optimization, with adaptive sizing for both the receding horizon and the particle swarm. Inspired by Importance Splitting, the length of the horizon and the number of particles are chosen such that at least one particle reaches a next-level state, that is, a state where the cost decreases by a required delta from the previous-level state. The level relation on states and the plans constructed by ARES implicitly define a Lyapunov function and an optimal policy, respectively, both of which could be explicitly generated by applying ARES to all states of the MDP, up to some topological equivalence relation. We also assess the effectiveness of ARES by statistically evaluating its rate of success in generating optimal plans. The ARES algorithm resulted from our desire to clarify if flying in V-formation is a flocking policy that optimizes energy conservation, clear view, and velocity alignment. That is, we were interested to see if one could find optimal plans that bring a flock from an arbitrary initial state to a state exhibiting a single connected V-formation. For flocks with 7 birds, ARES is able to generate a plan that leads to a V-formation in 95% of the 8,000 random initial configurations within 63 seconds, on average. ARES can also be easily customized into a model-predictive controller (MPC) with an adaptive receding horizon and statistical guarantees of convergence. To the best of our knowledge, our adaptive-sizing approach is the first to provide convergence guarantees in receding-horizon techniques.
منابع مشابه
Design of Distributed Optimal Adaptive Receding Horizon Control for Supply Chain of Realistic Size under Demand Disturbances
supply chain network receding horizon control demand move suppression term Supply chain networks are interconnection and dynamics of a demand network. Example subsystems, referred to as stages, include raw materials, distributors of the raw materials, manufacturers, distributors of the manufactured products, retailers, and customers. The main objectives of the control strategy for the s...
متن کاملMatrix Functions and Matrix Equations
Solving large-scale algebraic Riccati equations (AREs) is one of the central tasks in solving optimal control problems for linear and, using receding-horizon techniques, also nonlinear instationary partial differential equations. Large-scale AREs also occur in several model reduction methods for dynamical systems. Due to sparsity and large dimensions of the resulting coefficient matrices, stand...
متن کاملMinimum-Time Travel for a Vehicle with Acceleration Limits: Theoretical Analysis and Receding Horizon Implementation
A methodology is proposed to generate minimum-time optimal velocity profiles for a vehicle with prescribed acceleration limits along a specified path. The necessary optimality conditions are explicitly derived, allowing the construction of the optimal solution semi-analytically. A receding horizon implementation is also proposed for the on-line implementation of the velocity optimizer. Robustne...
متن کاملSpacecraft Attitude Control Using Approximate Receding-Horizon Model-Error Control Synthesis
Abstract Model-error control synthesis is a nonlinear robust control approach that mitigates the effects of modeling errors and disturbances on a system by providing corrections to the nominal control input directly. In this paper model-error control synthesis is applied to the spacecraft attitude control problem, where the model-error vector is computed using a receding-horizon approximation. ...
متن کاملOptimal Receding Horizon Filter for Continuous-Time Nonlinear Stochastic Systems
A receding horizon filtering problem for nonlinear continuous-time stochastic systems is considered. The paper presents the optimal receding horizon filtering equations. Derivation of the equations is based on the Kushner-Stratonovich and Fokker-Planck-Kolmogorov equations for conditional and unconditional density functions. This result could be a theoretical basis for the optimal control in no...
متن کامل