distributed reinforcement learning

نتایج جستجو برای: distributed reinforcement learning

تعداد نتایج: 868955 فیلتر نتایج به سال:

Interval Estimation for Reinforcement-Learning Algorithms in Continuous-State Domains

2010

Martha White Adam M. White

The reinforcement learning community has explored many approaches to obtaining value estimates and models to guide decision making; these approaches, however, do not usually provide a measure of confidence in the estimate. Accurate estimates of an agent’s confidence are useful for many applications, such as biasing exploration and automatically adjusting parameters to reduce dependence on param...

متن کامل

Investigating Reinforcement Learning in Multiagent Coalition Formation

2004

Xin Li Leen-Kiat Soh

In this paper we investigate the use of reinforcement learning to address the multiagent coalition formation problem in dynamic, uncertain, real-time, and noisy environments. To adapt to the complex environmental factors, we equip each agent with the case-based reinforcement learning ability which is the integration of case-based reasoning and reinforcement learning. The agent can use case-base...

متن کامل

A Unified Analysis of Value-Function-Based Reinforcement Learning Algorithms

Journal: :Neural computation 1999

Csaba Szepesvári Michael L. Littman

Reinforcement learning is the problem of generating optimal behavior in a sequential decision-making environment given the opportunity of interacting with it. Many algorithms for solving reinforcement-learning problems work by computing improved estimates of the optimal value function. We extend prior analyses of reinforcement-learning algorithms and present a powerful new theorem that can prov...

متن کامل

A New Nonlinear Reinforcement Scheme for Stochastic Learning Automata

2010

DANA SIMIAN FLORIN STOICA

Reinforcement schemes represent the basis of the learning process for stochastic learning automata, generating their learning behavior. An automaton using a reinforcement scheme can decide the best action, based on past actions and environment responses. The aim of this paper is to introduce a new reinforcement scheme for stochastic learning automata. We test our schema and compare with other n...

متن کامل

Virtual to Real Reinforcement Learning for Autonomous Driving

Journal: :CoRR 2017

Yurong You Xinlei Pan Ziyan Wang Cewu Lu

Reinforcement learning is considered as a promising direction for driving policy learning. However, training autonomous driving vehicle with reinforcement learning in real environment involves non-affordable trial-and-error. It is more desirable to first train in a virtual environment and then transfer to the real environment. In this paper, we propose a novel realistic translation network to m...

متن کامل

ارزیابی ریسک تقلب در مزایای بیمه بیکاری با رویکرد داده کاوی تفحصی

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه علامه طباطبایی - دانشکده اقتصاد 1393

سید جواد طباطبایی منش, آتوسا گودرزی,

due to extraordinary large amount of information and daily sharp increasing claimant for ui benefits and because of serious constraint of financial barriers, the importance of handling fraud detection in order to discover, control and predict fraudulent claims is inevitable. we use the most appropriate data mining methodology, methods, techniques and tools to extract knowledge or insights from ...

Reinforcement learning

Journal: :Scholarpedia 2008

متن کامل

Use of Reinforcement Learning as a Challenge: A Review

2013

Rashmi Sharma Manish Prateek Ashok K. Sinha

Reinforcement learning has its origin from the animal learning theory. RL does not require prior knowledge but can autonomously get optional policy with the help of knowledge obtained by trial-and-error and continuously interacting with the dynamic environment. Due to its characteristics of self improving and online learning, reinforcement learning has become one of intelligent agent’s core tec...

متن کامل

A Distributed Reinforcement Learning Approach for Solving Optimization Problems

2011

Gabriela Czibula Istvan-Gergely Czibula Maria Iuliana Bocicor

Combinatorial optimization is the seeking for one or more optimal solutions in a well defined discrete problem space. The optimization methods are of great importance in practice, particularly in the engineering design process, the scientific experiments and the business decision-making. We are investigating in this paper a distributed reinforcement learning based approach for solving combinato...

متن کامل

Pruning for Monte Carlo Distributed Reinforcement Learning in Decentralized POMDPs

2013

Bikramjit Banerjee

Decentralized partially observable Markov decision processes (Dec-POMDPs) offer a powerful modeling technique for realistic multi-agent coordination problems under uncertainty. Prevalent solution techniques are centralized and assume prior knowledge of the model. Recently a Monte Carlo based distributed reinforcement learning approach was proposed, where agents take turns to learn best response...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید