نتایج جستجو برای: q learning
تعداد نتایج: 717428 فیلتر نتایج به سال:
Chain Form Reinforcement Learning (CFRL) was proposed for a reinforcement learning agent using low memory. In this paper, we introduce Sneak Form Reinforcement Learning (SFRL). SFRL is the method where we improve CFRL in terms of Contextual Learning. If a sequence of state-action pairs has a shortest path, a SFRL agent cuts and saves the path. To improve the performance of SFRL, we introduce Ma...
We present some results of our research in the field of Machine Learning applied to robotics problems. In particular we have investigated on: (i) the application of Learning Classifier Systems to the synthesis of robot controllers; (ii) learning of fuzzy controllers; (iii) learning of purposeful representations of the environment; (iv) and the application of versions of Q-learning to robot trai...
Q-Iearning is a reinforcement learning alg()rithm that learns expected utilities for stateaction transitions through successive interactions with the environment The algorithm '5 simplicity as well as its convergence properties have made it a popular algorithm for study However; its non-parametric representation of utilities limits its effectiveness in environments with large amounts of percept...
This work considers a stateless Q-learning agent in iterated Prisoner’s Dilemma (PD). We have already given a condition of PD payoffs and Q-learning parameters that helps stateless Q-learning agents cooperate with each other [2]. That condition, however, has a restrictive premise. This work relaxes the premise and shows a new payoff condition for mutual cooperation. After that, we derive the pa...
This paper introduces an approach to Reinforcement Learning Algorithm by comparing their immediate rewards using a variation of Q-Learning algorithm. Unlike the conventional Q-Learning, the proposed algorithm compares current reward with immediate reward of past move and work accordingly. Relative reward based Q-learning is an approach towards interactive learning. Q-Learning is a model free re...
Baselines for Joint-Action Reinforcement Learning of Coordination in Cooperative Multi-agent Systems
We report on an investigation of reinforcement learning techniques for the learning of coordination in cooperative multiagent systems. Specifically, we focus on a novel action selection strategy for Q-learning (Watkins 1989). The new technique is applicable to scenarios where mutual observation of actions is not possible. To date, reinforcement learning approaches for such independent agents di...
This paper introduces Progressive Reinforcement Learning, which augments standard Q-Learning with a mechanism for transferring experience gained in one problem to new but related problems. In this approach, an agent acquires experience of operating in a simple domain through experimentation. It then engages in a period of introspection, during which it rationalises the experience gained and for...
We study the problem of learning near-optimal behavior in finite Markov Decision Processes (MDPs) with a polynomial number of samples. These “PAC-MDP” algorithms include the wellknown E3 and R-MAX algorithms as well as the more recent Delayed Q-learning algorithm. We summarize the current state-of-the-art by presenting bounds for the problem in a unified theoretical framework. A more refined an...
One way to speed up reinforcement learning is to enable learning to happen simultaneously at multiple resolutions in space and time. This paper shows how to create a Q-learning managerial hierarchy in which high level managers learn how to set tasks to their sub-managers who, in turn, learn how to satisfy them. Sub-managers need not initially understand their managers’ commands. They simply lea...
Despite the advancement of research and development on multi-robot teams, a key challenge still remains as to how to develop effective mechanisms that enable the robots to autonomously generate, adapt, and enhance team behaviours while improving their individual performance simultaneously. After a literature review of various multi-agent learning approaches, the two most promising learning para...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید