442 CHAPTER 22 MPFC as reinforcement learning regulator
نویسندگان
چکیده
Converging evidence suggest that the medial prefrontal cortex (MPFC) is involved in feedback categorization, performance monitoring, and task monitoring, and may contribute to the online regulation of reinforcement learning (RL) parameters that would affect decision-making processes in the lateral prefrontal cortex (LPFC). Previous neurophysiological experiments have shownMPFC activities encoding error likelihood, uncertainty, reward volatility, as well as neural responses categorizing different types of feedback, for instance, distinguishing between choice errors and execution errors. Rushworth and colleagues have proposed that the involvement ofMPFC in tracking the volatility of the task could contribute to the regulation of one of RL parameters called the learning rate. We extend this hypothesis by proposing that MPFC could contribute to the regulation of other RL parameters such as the exploration rate and default action values in case of task shifts. Here, we analyze the sensitivity to RL parameters of behavioral performance in two monkey decision-making tasks, one with a deterministic reward schedule and the other with a stochastic one. We show that there exist optimal parameter values specific to each of these tasks, that need to be found for optimal performance and that are usually handtuned in computational models. In contrast, automatic online regulation of these parameters using some heuristics can help producing a good, although non-optimal, behavioral performance in each task. We finally describe our computational model of MPFC–LPFC interaction used for online regulation of the exploration rate and its application to a human–robot interaction scenario. There, unexpected uncertainties are produced by the human introducing cued task changes or by cheating. The model enables the robot to autonomously learn to reset exploration in response to such uncertain cues and events. The combined results provide concrete evidence specifying how prefrontal cortical subregionsmay cooperate to regulate RL parameters. It also shows how such neurophysiologically inspired mechanisms can control advanced robots in the real Progress in Brain Research, Volume 202, ISSN 0079-6123, http://dx.doi.org/10.1016/B978-0-444-62604-2.00022-8 © 2013 Elsevier B.V. All rights reserved. 441 442 CHAPTER 22 MPFC as reinforcement learning regulator Author's personal copy world. Finally, themodel’s learningmechanisms that were challenged in the last robotic scenario provide testable predictions on the way monkeys may learn the structure of the task during the pretraining phase of the previous laboratory experiments.
منابع مشابه
Mediodorsal Thalamic Neurons Mirror the Activity of Medial Prefrontal Neurons Responding to Movement and Reinforcement during a Dynamic DNMTP Task
The mediodorsal nucleus (MD) interacts with medial prefrontal cortex (mPFC) to support learning and adaptive decision-making. MD receives driver (layer 5) and modulatory (layer 6) projections from PFC and is the main source of driver thalamic projections to middle cortical layers of PFC. Little is known about the activity of MD neurons and their influence on PFC during decision-making. We recor...
متن کاملMedial prefrontal cortex and the adaptive regulation of reinforcement learning parameters 22
Converging evidence suggest that the medial prefrontal cortex (MPFC) is involved in feedback categorization, performance monitoring, and task monitoring, and may contribute to the online regulation of reinforcement learning (RL) parameters that would affect decision-making processes in the lateral prefrontal cortex (LPFC). Previous neurophysiological experiments have shownMPFC activities encodi...
متن کاملA general role for medial prefrontal cortex in event prediction
A recent computational neural model of medial prefrontal cortex (mPFC), namely the predicted response-outcome (PRO) model (Alexander and Brown, 2011), suggests that mPFC learns to predict the outcomes of actions. The model accounted for a wide range of data on the mPFC. Nevertheless, numerous recent findings suggest that mPFC may signal predictions and prediction errors even when the predicted ...
متن کاملRRLUFF: Ranking function based on Reinforcement Learning using User Feedback and Web Document Features
Principal aim of a search engine is to provide the sorted results according to user’s requirements. To achieve this aim, it employs ranking methods to rank the web documents based on their significance and relevance to user query. The novelty of this paper is to provide user feedback-based ranking algorithm using reinforcement learning. The proposed algorithm is called RRLUFF, in which the rank...
متن کاملMedial prefrontal cortex and the adaptive regulation of reinforcement learning parameters.
Converging evidence suggest that the medial prefrontal cortex (MPFC) is involved in feedback categorization, performance monitoring, and task monitoring, and may contribute to the online regulation of reinforcement learning (RL) parameters that would affect decision-making processes in the lateral prefrontal cortex (LPFC). Previous neurophysiological experiments have shown MPFC activities encod...
متن کامل