Improvements on Learning Tetris with Cross Entropy
نویسندگان
چکیده
For playing the game of Tetris well, training a controller by the cross-entropy method seems to be a viable way (Szita and Lőrincz, 2006; Thiery and Scherrer, 2009). We consider this method to tune an evaluation-based one-piece controller as suggested by Szita and Lőrincz and we introduce some improvements. In this context, we discuss the influence of the noise, and we perform experiments with several sets of features such as those introduced by Bertsekas and Tsitsiklis (1996), by Dellacherie (Fahey, 2003), and some original features. This approach leads to a controller that outperforms the previous known results. On the original game of Tetris, we show that with probability 0.95 it achieves at least 910, 000 ± 5% lines per game on average. On a simplified version of Tetris considered by most research works, it achieves 35, 000, 000 ± 20% lines per game on average. We used this approach when we took part with the program BCTS in the 2008 Tetris domain Reinforcement Learning Competition and won the competition.
منابع مشابه
Notes Improvements on Learning Tetris with Cross-entropy
For playing the game of Tetris well, training a controller by the cross-entropy method seems to be a viable way (Szita and Lőrincz, 2006; Thiery and Scherrer, 2009). We consider this method to tune an evaluation-based one-piece controller as suggested by Szita and Lőrincz and we introduce some improvements. In this context, we discuss the influence of the noise, and we perform experiments with ...
متن کاملLearning Tetris Using the Noisy Cross-Entropy Method
The cross-entropy method is an efficient and general optimization algorithm. However, its applicability in reinforcement learning (RL) seems to be limited because it often converges to suboptimal policies. We apply noise for preventing early convergence of the cross-entropy method, using Tetris, a computer game, for demonstration. The resulting policy outperforms previous RL algorithms by almos...
متن کاملCross-Entropy Method for Reinforcement Learning
Reinforcement Learning methods have been succesfully applied to various optimalization problems. Scaling this up to real world sized problems has however been more of a problem. In this research we apply Reinforcement Learning to the game of Tetris which has a very large state space. We not only try to learn policies for Standard Tetris but try to learn parameterized policies for Generalized Te...
متن کاملTetris-: Exploring Human Performance via Cross Entropy Reinforcement Learning Models
What can a machine learning simulation tell us about human performance in a complex, real-time task such as TetrisTM? Although Tetris is often used as a research tool (Mayer, 2014), the strategies and methods used by Tetris players have seldom been the explicit focus of study. In Study 1, we use cross-entropy reinforcement learning (CERL) (Szita & Lorincz, 2006; Thiery & Scherrer, 2009) to expl...
متن کاملComparing reinforcement learning in humans and artificial intelligence through Tetris
Tetris has a long history in Artificial Intelligence (Fahey, 2014) and Cognitive Science (Mayer, in press). We combine both traditions to ask, ”what can Tetris and these two approaches to Tetris, tell us about human expertise?” In our research, we use Cross-Entropy Reinforcement Learning (Szita and Larincz, 2006) to produce two very different types of models; an unsupervised version trained dir...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- ICGA Journal
دوره 32 شماره
صفحات -
تاریخ انتشار 2009