Q-Learning: A Tutorial and Extensions

نویسندگان

  • George Cybenko
  • Robert Gray
  • Katsuhiro Moizumi
چکیده

In the past decade, research in neurocomputing has been divided into two relatively wellde ned tracks: one track dealing with cognition and the other with behavior. Cognition deals with organizing, classifying and recognizing sensory stimuli. Behavior is more dynamic, involving sequences of actions and changing interactions with an external environment. The mathematical techniques that apply to these areas, at least from the point of neurocomputing, appear to have been quite separate as well. The purpose of this paper is to give an overview of some recent powerful mathematical results in behavioral neurocomputing, speci cally the concept of Q-learning due to C. Watkins, and some new extensions. Finally, we propose ways in which the mathematics of cognition and the mathematics of behavior can move closer to build more uni ed systems of information processing and action.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Development and Usability Evaluation of an Online Tutorial for “How to Write a Proposal” for Medical Sciences Students

Background and Objective: Considering the importance of learning how to write a proposal for students, this study was performed to develop an online tutorial for “How to write a Proposal” for students and to evaluate its usability. Methods: This study is a developmental research and tool design. “Gamified Online Tutorial based on Self-Determination Theory (GOT-STD) Framework" became the basis f...

متن کامل

Q-learning: a Tutorial and Extensions 1

In the past decade, research in neurocomputing has been divided into two relatively well-deened tracks: one track dealing with cognition and the other with behavior. Cognition deals with organizing, classifying and recognizing sensory stimuli. Behavior is more dynamic, involving sequences of actions and changing interactions with an external environment. The mathematicaltechniques that apply to...

متن کامل

Comparison of efficiency management training using lecturing and small group teaching on learning rate of Nursing and Midwifery student’s

Abstract Introduction: Teaching principles of management is important because it empowers the students in the field of midwifery and nursing. This aspect would improve the quality of care in health system significantly. Therefore, achieving the potential teaching method is great importance. This strategy involves techniques to facilitate the learning process and growth critical thinking in s...

متن کامل

Elicitation, Recast, and Meta-Linguistic Feedback in Form-Focused Exchanges: Effects of Feedback Modality on Multimedia Grammar Instruction

This research explores the effects of three computer-mediated feedback modalities, that is, elicitation, recast, and meta-linguistics, on the learning of English participial, gerund, and infinitival phrases among Iranian intermediate-level EFL learners. The overriding focus of the present study was to investigate whether different types of feedback given through form-focused computer-human exch...

متن کامل

مقدمه‌ای بر سیستمهای اسپینی کوانتمی

This manuscript is the collection of lectures given in the summer school on strongly correlated electron systems held at Isfahan university of technology, June 2007. A short overview on quantum magnetism and spin systems is presented. The numerical exact diagonalization (Lanczos) alghorithm is explained in a pedagogical ground. This is a method to get some ground state properties on finite clus...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995