Inverse Optimal Control with Incomplete Observations
نویسندگان
چکیده
In this article, we consider the inverse optimal control problem given incomplete observations of an optimal trajectory. We hypothesize that the cost function is constructed as a weighted sum of relevant features (or basis functions). We handle the problem by proposing the recovery matrix, which establishes a relationship between available observations of the trajectory and weights of given candidate features. The rank of the recovery matrix indicates whether a subset of relevant features can be found among the candidate features and the corresponding weights can be recovered. Additional observations tend to increase the rank of the recovery matrix, thus enabling cost function recovery. We also show that the recovery matrix can be computed iteratively. Based on the recovery matrix, a methodology for using incomplete observations of the trajectory to recover the weights of specified features is established, and an efficient algorithm for recovering the feature weights by finding the minimal required observations is developed. We apply the proposed algorithm to learning the cost function of a simulated robot manipulator conducting free-space motions. The results demonstrate the stable, accurate and robust performance of the proposed approach compared to state of the art techniques.
منابع مشابه
Inverse Kinematics Resolution of Redundant Cooperative Manipulators Using Optimal Control Theory
The optimal path planning of cooperative manipulators is studied in the present research. Optimal Control Theory is employed to calculate the optimal path of each joint choosing an appropriate index of the system to be minimized and taking the kinematics equations as the constraints. The formulation has been derived using Pontryagin Minimum Principle and results in a Two Point Boundary Value Pr...
متن کاملInverse Kinematics Resolution of Redundant Cooperative Manipulators Using Optimal Control Theory
The optimal path planning of cooperative manipulators is studied in the present research. Optimal Control Theory is employed to calculate the optimal path of each joint choosing an appropriate index of the system to be minimized and taking the kinematics equations as the constraints. The formulation has been derived using Pontryagin Minimum Principle and results in a Two Point Boundary Value Pr...
متن کاملA non-linear optimal estimation inverse method for radio occultation measurements of temperature, humidity and surface pressure
An optimal estimation inverse method is presented which can be used to retrieve simultaneously vertical profiles of temperature and specific humidity, in addition to surface pressure, from satellite-to-satellite radio occultation observations of the Earth’s atmosphere. The method is a non-linear, maximum a posteriori technique which can accommodate most aspects of the real radio occultation pro...
متن کاملAn Analytical Solution for Inverse Determination of Residual Stress Field
An analytical solution is presented that reconstructs residual stress field from limited and incomplete data. The inverse problem of reconstructing residual stresses is solved using an appropriate form of the airy stress function. This function is chosen to satisfy the stress equilibrium equations together with the boundary conditions for a domain within a convex polygon. The analytical solu...
متن کاملExamination of Quadrotor Inverse Simulation Problem Using Trust-Region Dogleg Solution Method
In this paper, the particular solution technique for inverse simulation applied to the quadrotor maneuvering flight is investigated. The trust-region dogleg (DL) technique which is proposed alleviates the weakness of Newton’s method used for numerical differentiation of system states in the solution process. The proposed technique emphasizes global convergence solution to the inverse simulatio...
متن کامل