A Sums-of-Squares Extension of Policy Iterations

نویسندگان

  • Assalé Adjé
  • Pierre-Loïc Garoche
  • Victor Magron
چکیده

In order to address the imprecision often introduced by widening operators, policy iteration based on min-computations amounts to consider the characterization of reachable states of a program as an iterative computation of policies, starting from a post-fixpoint. Computing each policy and the associated invariant relies on a sequence of numerical optimizations. While the early papers rely on LP to address linear properties of linear programs, the current state of the art is still limited to the analysis of linear programs with at most quadratic invariant, relying on Semi-Definite Programming (SDP) solvers to compute the next policy, and LP solvers to solve the selected policy. We propose here to extend the class of programs considered through the use of Sums-of-Squares (SOS) optimizations. Our approach enables the precise analysis of switched systems with polynomial assigns and guards. The analysis presented has been implemented in Matlab and applied on existing programs, improving both the set of systems analyzable and the precision of analyzed ones.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive iterative reweighted least squares design of Lp FIR filters

This paper presents an efficient adaptive algorithm for designing FIR digital filters that are efficient according to an Lp error criteria. The algorithm is an extension of Burrus’ iterative reweighted least-squares (IRLS) method for approximating Lp filters. Such algorithm will converge for most significant cases in a few iterations. In some cases however, the transition bandwidth is such that...

متن کامل

Iterative reweighted least-squares design of FIR filters

ABSTRACT This paper presents an efficient adaptive algorithm for designing FIR digital filters that are efficient according to an error criteria. The algorithm is an extension of Burrus’ iterative reweighted least-squares (IRLS) method for approximating filters. Such algorithm will converge for most significant cases in a few iterations. In some cases however, the transition bandwidth is such t...

متن کامل

A Rational Function Whose Integral Values Are Sums of Two Squares

A problem from the 1988 IMO asserts that for positive integers a and b the set of integral values assumed by (a + b)/(ab + 1) is exactly the set of positive squares. We present an extension of this result involving a rational function in three variables whose integral values consist of precisely those numbers expressible as a sum of two positive squares. This immediately implies that a certain ...

متن کامل

Least-squares methods for policy iteration

Approximate reinforcement learning deals with the essential problem of applying reinforcement learning in large and continuous state-action spaces, by using function approximators to represent the solution. This chapter reviews least-squares methods for policy iteration, an important class of algorithms for approximate reinforcement learning. We discuss three techniques for solving the core, po...

متن کامل

Rings of Integers, Gauss-jacobi Sums, and Their Applications

In this paper we shall explore the structure of the ring of algebraic integers in any quadratic extension of the field of rational numbers Q, develop the concepts of Gauss and Jacobi sums, and apply the theory of algebraic integers and that of Gauss-Jacobi sums to solving problems involving power congruences and power sums as well as to proving the quadratic and cubic reciprocity laws. In parti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1503.08090  شماره 

صفحات  -

تاریخ انتشار 2015