keywords reinforcement learning

نتایج جستجو برای: keywords reinforcement learning

تعداد نتایج: 2453256 فیلتر نتایج به سال:

Reward Hierarchical Temporal Memory Model for Memorizing and Computing Reward Prediction Error by Neocortex

2012

Hansol Choi Jun-Cheol Park Jae Hyun Lim Jae Young Jun Dae-Shik Kim

In humans and animals, reward prediction error encoded by dopamine systems is thought to be important in the temporal difference learning class of reinforcement learning (RL). With RL algorithms, many brain models have described the function of dopamine and related areas, including the basal ganglia and frontal cortex. In spite of this importance, how the reward prediction error itself is compu...

متن کامل

Feedback Control of Traffic Signal Network of Less Traffic Sensors by Help of Machine Learning

2012

Takumi Wakahara Sadayoshi Mikami

As a way of resolving vehicle congestion, there is a feedback control approach which models a traffic network as a discrete dynamical system and derives feedback gain for controlling green light times of each junction. Since the input is the sensory observed traffic flow of each link, and since the state equation models both the topology and the parameters of the network, it is effective for ad...

متن کامل

Markov Game Controller Design Algorithms

2012

Rajneesh Sharma M. Gopal

Markov games are a generalization of Markov decision process to a multi-agent setting. Two-player zero-sum Markov game framework offers an effective platform for designing robust controllers. This paper presents two novel controller design algorithms that use ideas from game-theory literature to produce reliable controllers that are able to maintain performance in presence of noise and paramete...

متن کامل

A comparison of direct and model-based reinforcement learning

1997

Christopher G. Atkeson Juan Carlos Santamaría

This paper compares direct reinforcement learning (no explicit model) and model-based reinforcement learning on a simple task: pendulum swing up. We nd that in this task model-based approaches support reinforcement learning from smaller amounts of training data and eecient handling of changing goals.

متن کامل

A Survey on Reinforcement Learning in Global Optimization

1998

Shuangyu Shawn Chang

This report surveys recent development on the global combinatorial optimization using reinforcement learning methods. It introduces the general background of combinatorial optimization problems and reinforcement learning techniques, describes observations and previous works in this area, and focuses on Boyan and Moore's recent work, the STAGE algorithm with the assistant of reinforcement learning.

متن کامل

Incorporating Prior Knowledge and Previously Learned Information into Reinforcement Learning Agents

2000

Kevin R. Dixon Richard J. Malak Pradeep K. Khosla

Reinforcement learning has received much attention in the past decade. The primary thrust of this research has focused on tabula rasa learning methods. That is, the learning agent is initially unaware of its environment and must learn or re-learn everything. We feel that this is neither realistic nor effective. While the agent may start out with little or no knowledge of its environment, it mus...

متن کامل

Negative reinforcement learning is affected in substance dependence.

Journal: :Drug and alcohol dependence 2012

Laetitia L Thompson Eric D Claus Susan K Mikulich-Gilbertson Marie T Banich Thomas Crowley Theodore Krmpotich David Miller Jody Tanabe

BACKGROUND Negative reinforcement results in behavior to escape or avoid an aversive outcome. Withdrawal symptoms are purported to be negative reinforcers in perpetuating substance dependence, but little is known about negative reinforcement learning in this population. The purpose of this study was to examine reinforcement learning in substance dependent individuals (SDI), with an emphasis on ...

متن کامل

On-Line Learning Control by Association and Reinforcement

Journal: :IEEE transactions on neural networks 2000

Jennie Si Yu-Tsung Wang

This paper focuses on a systematic treatment for developing a generic online learning control system based on the fundamental principle of reinforcement learning or more specifically neural dynamic programming. This online learning system improves its performance over time in two aspects: 1) it learns from its own mistakes through the reinforcement signal from the external environment and tries...

متن کامل

cient Exploration In Reinforcement Learning Sebastian

1992

Sebastian B. Thrun

Exploration plays a fundamental role in any active learning system. This study evaluates the role of exploration in active learning and describes several local techniques for exploration in nite, discrete domains, embedded in a reinforcement learning framework (delayed reinforcement). This paper distinguishes between two families of exploration schemes: undirected and directed exploration. Whil...

متن کامل

Intelligent Tutoring Systems using Reinforcement Learning to teach Autistic Students

2007

B. H. Sreenivasa Sarma Balaraman Ravindran

Many Intelligent Tutoring Systems have been developed using different Artificial Intelligence techniques. In this paper we propose to use Reinforcement Learning for building an intelligent tutoring system to teach autistic students, who can't communicate well with others. In reinforcement learning, a policy is updated for taking appropriate action to teach the student. The main advantage of usi...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید

" +i+ "

"; pgn_html+=pgn_li; } document.getElementById("pgn-ul").innerHTML=pgn_html var pgn_links = document.querySelectorAll('.mypgn'); pgn_links.forEach(function(pgn_link) { pgn_link.addEventListener('click', paginate) }) } function post_and_fetch(data,url) { showLoading() xhr = new XMLHttpRequest(); xhr.open('POST', url, true); xhr.setRequestHeader('Content-Type', 'application/json; charset=UTF-8'); xhr.onreadystatechange = function() { if (xhr.readyState === 4 && xhr.status === 200) { var resp = xhr.responseText; resp_json=JSON.parse(resp) resp_place = document.getElementById("search_result_div") resp_place.innerHTML = resp_json['results'] search_meta = resp_json['meta'] update_search_meta(search_meta) update_pagination() hideLoading() } }; xhr.send(JSON.stringify(data)); } function unfilter() { url=/search_year_filter/ var term=document.getElementById("search_meta_data").dataset.term var data={ "year":"unfilter", "term":term, "pgn":1 } filtered_res=post_and_fetch(data,url) } function deactivate_all_bars(){ var yrchart = document.querySelectorAll('.ct-bar'); yrchart.forEach(function(bar) { bar.dataset.active = false bar.style = "stroke:#71a3c5;" }) } year_chart.on("created", function() { var yrchart = document.querySelectorAll('.ct-bar'); yrchart.forEach(function(check) { check.addEventListener('click', checkIndex); }) }); function checkIndex(event) { var yrchart = document.querySelectorAll('.ct-bar'); var year_bar = event.target if (year_bar.dataset.active == "true") { unfilter_res = unfilter() year_bar.dataset.active = false year_bar.style = "stroke:#1d2b3699;" } else { deactivate_all_bars() year_bar.dataset.active = true year_bar.style = "stroke:#e56f6f;" filter_year = chart_data['labels'][Array.from(yrchart).indexOf(year_bar)] url=/search_year_filter/ var term=document.getElementById("search_meta_data").dataset.term var data={ "year":filter_year, "term":term, "pgn":1 } filtered_res=post_and_fetch(data,url) } } function showLoading() { document.getElementById("loading").style.display = "block"; setTimeout(hideLoading, 10000); // 10 seconds } function hideLoading() { document.getElementById("loading").style.display = "none"; } -->