Tile Coding Based on Hyperplane Tiles

نویسندگان

  • Daniele Loiacono
  • Pier Luca Lanzi
چکیده

In large and continuous state-action spaces reinforcement learning heavily relies on function approximation techniques. Tile coding is a well-known function approximator that has been successfully applied to many reinforcement learning tasks. In this paper we introduce the hyperplane tile coding, in which the usual tiles are replaced by parameterized hyperplanes that approximate the action-value function. We compared the performance of hyperplane tile coding with the usual tile coding on three well-known benchmark problems. Our results suggest that the hyperplane tiles improve the generalization capabilities of the tile coding approximator: in the hyperplane tile coding broad generalizations over the problem space result only in a soft degradation of the performance, whereas in the usual tile coding they might dramatically affect the performance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Studying the time, place and the artist assignment of polychrome tile work of decorated archway

      The polychrome tile work of the Qajar era reveals the identical illustration, which has a different artistic style comparing to former eras. Shiraz tile makers of this period provide polychrome tiles with specific features according to costumes and public folklore and established Shiraz school of painted tile work. This article concentrates on the decorated tile work of an archway at the ...

متن کامل

Adaptive Tile Coding for Value Function Approximation

Reinforcement learning problems are commonly tackled by estimating the optimal value function. In many real-world problems, learning this value function requires a function approximator, which maps states to values via a parameterized function. In practice, the success of function approximators depends on the ability of the human designer to select an appropriate representation for the value fu...

متن کامل

Adaptive Tile Coding for Value Function Approximation

Reinforcement learning problems are commonly tackled by estimating the optimal value function. In many real-world problems, learning this value function requires a function approximator, which maps states to values via a parameterized function. In practice, the success of function approximators depends on the ability of the human designer to select an appropriate representation for the value fu...

متن کامل

A generalization of quad-trees applied to image coding

Although quad-trees are not the most successful strategy in image coding, some generalized subdivision schemes have been proposed recently. This work exploits a moderate generalization of quad-trees where tiles are not restricted to be split in both dimensions, which leads to a previously developed graph of anisotropic tiles called “bush”. An algorithm is developed to find the minimal number of...

متن کامل

Fast Rendering of Massive Textured Terrain Data

Rendering of textured terrain models has become a widely used technique in the field of GIS applications and virtual reality. In this paper, we propose a framework based on tile-pyramid model and linear quadtree tile-index, which enable the real-time rendering of out-of-core terrain data sets while guaranteeing geometric and texture accuracy. The digital elevation model tile pyramid and the ort...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008