A Sums-of-Squares Extension of Policy Iterations
نویسندگان
چکیده
In order to address the imprecision often introduced by widening operators, policy iteration based on min-computations amounts to consider the characterization of reachable states of a program as an iterative computation of policies, starting from a post-fixpoint. Computing each policy and the associated invariant relies on a sequence of numerical optimizations. While the early papers rely on LP to address linear properties of linear programs, the current state of the art is still limited to the analysis of linear programs with at most quadratic invariant, relying on Semi-Definite Programming (SDP) solvers to compute the next policy, and LP solvers to solve the selected policy. We propose here to extend the class of programs considered through the use of Sums-of-Squares (SOS) optimizations. Our approach enables the precise analysis of switched systems with polynomial assigns and guards. The analysis presented has been implemented in Matlab and applied on existing programs, improving both the set of systems analyzable and the precision of analyzed ones.
منابع مشابه
Adaptive iterative reweighted least squares design of Lp FIR filters
This paper presents an efficient adaptive algorithm for designing FIR digital filters that are efficient according to an Lp error criteria. The algorithm is an extension of Burrus’ iterative reweighted least-squares (IRLS) method for approximating Lp filters. Such algorithm will converge for most significant cases in a few iterations. In some cases however, the transition bandwidth is such that...
متن کاملIterative reweighted least-squares design of FIR filters
ABSTRACT This paper presents an efficient adaptive algorithm for designing FIR digital filters that are efficient according to an error criteria. The algorithm is an extension of Burrus’ iterative reweighted least-squares (IRLS) method for approximating filters. Such algorithm will converge for most significant cases in a few iterations. In some cases however, the transition bandwidth is such t...
متن کاملA Rational Function Whose Integral Values Are Sums of Two Squares
A problem from the 1988 IMO asserts that for positive integers a and b the set of integral values assumed by (a + b)/(ab + 1) is exactly the set of positive squares. We present an extension of this result involving a rational function in three variables whose integral values consist of precisely those numbers expressible as a sum of two positive squares. This immediately implies that a certain ...
متن کاملLeast-squares methods for policy iteration
Approximate reinforcement learning deals with the essential problem of applying reinforcement learning in large and continuous state-action spaces, by using function approximators to represent the solution. This chapter reviews least-squares methods for policy iteration, an important class of algorithms for approximate reinforcement learning. We discuss three techniques for solving the core, po...
متن کاملRings of Integers, Gauss-jacobi Sums, and Their Applications
In this paper we shall explore the structure of the ring of algebraic integers in any quadratic extension of the field of rational numbers Q, develop the concepts of Gauss and Jacobi sums, and apply the theory of algebraic integers and that of Gauss-Jacobi sums to solving problems involving power congruences and power sums as well as to proving the quadratic and cubic reciprocity laws. In parti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1503.08090 شماره
صفحات -
تاریخ انتشار 2015