نتایج جستجو برای: keywords reinforcement learning

تعداد نتایج: 2453256  

2005
Douglas Adams

Machine Learning is a field of research aimed at constructing intelligent machines that gain and improve their skills by learning and adaptation. As such, Machine Learning research addresses several classes of learning problems, including for instance, supervised and unsupervised learning. Arguably, the most ubiquitous and realistic class of learning problems, faced by both living creatures and...

Journal: :Computational Intelligence 2012
Spencer K. White Tony R. Martinez George L. Rudolph

ion in reinforcement learning. Artificial Intelligence, 112(1-2): 181–211. URBANOWICZ, R. J., and J. H. MOORE. 2009. Learning classifier systems: A complete introduction, review, and roadmap. Journal of Artificial Evolution and Applications, 2009. doi: 10.1155/2009/736398. WATKINS, C. J. 1989. Learning from delayed rewards. Ph.D. thesis, Cambridge University, Cambridge, UK. WHITE, S., T. R. MAR...

Journal: :CoRR 2017
Megumi Miyashita Shiro Yano Toshiyuki Kondo

In recent years, attention has been focused on the relationship between black box optimization and reinforcement learning. Black box optimization is a framework for the problem of finding the input that optimizes the output represented by an unknown function. Reinforcement learning, by contrast, is a framework for finding a policy to optimize the expected cumulative reward from trial and error....

2003
Frank Hoffmann Örjan Ekeberg

In this lab you will learn about dynamic programming and reinforcement learning. It is assumed that you are familiar with the basic concepts of reinforcement learning and that you have read chapter 13 in the course book Machine Learning (Mitchell, 1997). The first four chapters of the survey on reinforcement learning by Kaelbling et al. (1996) is a good supplementary material. For further readi...

2004
Frank Hoffmann Örjan Ekeberg

In this lab you will learn about dynamic programming and reinforcement learning. It is assumed that you are familiar with the basic concepts of reinforcement learning and that you have read chapter 13 in the course bookMachine Learning (Mitchell, 1997). The first four chapters of the survey on reinforcement learning by Kaelbling et al. (1996) is a good supplementary material. For further readin...

2018
Elien Segers Tom Beckers Hilde Geurts Laurence Claes Marina Danckaerts Saskia van der Oord

Citation: Segers E, Beckers T, Geurts H, Claes L, Danckaerts M and van der Oord S (2018) Working Memory and Reinforcement Schedule Jointly Determine Reinforcement Learning in Children: Potential Implications for Behavioral Parent Training. Front. Psychol. 9:394. doi: 10.3389/fpsyg.2018.00394 Working Memory and Reinforcement Schedule Jointly Determine Reinforcement Learning in Children: Potentia...

1999
David E. Moriarty Alan C. Schultz John J. Grefenstette

This article characterizes the evolutionary algorithm approach to reinforcement learning in relation to the more standard, temporal diierence methods. We describe several research issues in reinforcement learning and discuss similarities and diierences in how they are addressed by the two methods. A short survey of evolutionary reinforcement learning systems and their successful applications is...

Journal: :The Journal of neuroscience : the official journal of the Society for Neuroscience 2018
Raphael T Gerraty Juliet Y Davidow Karin Foerde Adriana Galvan Danielle S Bassett Daphna Shohamy

Complex learned behaviors must involve the integrated action of distributed brain circuits. While the contributions of individual regions to learning have been extensively investigated, much less is known about how distributed brain networks orchestrate their activity over the course of learning. To address this gap, we used fMRI combined with tools from dynamic network neuroscience to obtain t...

1992
Sebastian B. Thrun

Exploration plays a fundamental role in any active learning system. This study evaluates the role of exploration in active learning and describes several local techniques for exploration in nite, discrete domains, embedded in a reinforcement learning framework (delayed reinforcement). This paper distinguishes between two families of exploration schemes: undirected and directed exploration. Whil...

Journal: :مدیریت زنجیره تأمین 0
زهره کاهه رضا برادران کاظم زاده

in this paper, tender problems in an automobile company for procuring needed items from potential suppliers have been resolved by the learning algorithm q. in this case the purchaser with respect to proposals received from potential providers, including price and delivery time is proposed; order the needed parts to suppliers assigns. the buyer’s objective is minimizing the procurement costs thr...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید