Improvements on Learning Tetris with Cross Entropy

نویسندگان

Christophe Thiery

Bruno Scherrer

چکیده

For playing the game of Tetris well, training a controller by the cross-entropy method seems to be a viable way (Szita and Lőrincz, 2006; Thiery and Scherrer, 2009). We consider this method to tune an evaluation-based one-piece controller as suggested by Szita and Lőrincz and we introduce some improvements. In this context, we discuss the influence of the noise, and we perform experiments with several sets of features such as those introduced by Bertsekas and Tsitsiklis (1996), by Dellacherie (Fahey, 2003), and some original features. This approach leads to a controller that outperforms the previous known results. On the original game of Tetris, we show that with probability 0.95 it achieves at least 910, 000 ± 5% lines per game on average. On a simplified version of Tetris considered by most research works, it achieves 35, 000, 000 ± 20% lines per game on average. We used this approach when we took part with the program BCTS in the 2008 Tetris domain Reinforcement Learning Competition and won the competition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Notes Improvements on Learning Tetris with Cross-entropy

متن کامل

Learning Tetris Using the Noisy Cross-Entropy Method

The cross-entropy method is an efficient and general optimization algorithm. However, its applicability in reinforcement learning (RL) seems to be limited because it often converges to suboptimal policies. We apply noise for preventing early convergence of the cross-entropy method, using Tetris, a computer game, for demonstration. The resulting policy outperforms previous RL algorithms by almos...

متن کامل

Cross-Entropy Method for Reinforcement Learning

Reinforcement Learning methods have been succesfully applied to various optimalization problems. Scaling this up to real world sized problems has however been more of a problem. In this research we apply Reinforcement Learning to the game of Tetris which has a very large state space. We not only try to learn policies for Standard Tetris but try to learn parameterized policies for Generalized Te...

متن کامل

Tetris-: Exploring Human Performance via Cross Entropy Reinforcement Learning Models

What can a machine learning simulation tell us about human performance in a complex, real-time task such as TetrisTM? Although Tetris is often used as a research tool (Mayer, 2014), the strategies and methods used by Tetris players have seldom been the explicit focus of study. In Study 1, we use cross-entropy reinforcement learning (CERL) (Szita & Lorincz, 2006; Thiery & Scherrer, 2009) to expl...

متن کامل

Comparing reinforcement learning in humans and artificial intelligence through Tetris

Tetris has a long history in Artificial Intelligence (Fahey, 2014) and Cognitive Science (Mayer, in press). We combine both traditions to ask, ”what can Tetris and these two approaches to Tetris, tell us about human expertise?” In our research, we use Cross-Entropy Reinforcement Learning (Szita and Larincz, 2006) to produce two very different types of models; an unsupervised version trained dir...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

ICGA Journal

دوره 32 شماره

صفحات -

تاریخ انتشار 2009

Improvements on Learning Tetris with Cross Entropy

نویسندگان

چکیده

منابع مشابه

Notes Improvements on Learning Tetris with Cross-entropy

Learning Tetris Using the Noisy Cross-Entropy Method

Cross-Entropy Method for Reinforcement Learning

Tetris-: Exploring Human Performance via Cross Entropy Reinforcement Learning Models

Comparing reinforcement learning in humans and artificial intelligence through Tetris

عنوان ژورنال:

اشتراک گذاری