Distributed Value Function Approximation for Collaborative Multi-Agent Reinforcement Learning
نویسندگان
چکیده
منابع مشابه
$QD$-Learning: A Collaborative Distributed Strategy for Multi-Agent Reinforcement Learning Through Consensus + Innovations
The paper considers a class of multi-agent Markov decision processes (MDPs), in which the network agents respond differently (as manifested by the instantaneous one-stage random costs) to a global controlled state and the control actions of a remote controller. The paper investigates a distributed reinforcement learning setup with no prior information on the global state transition and local ag...
متن کاملManifold Representations for Value-Function Approximation in Reinforcement Learning
Reinforcement learning (RL) has shown itself to be a successful paradigm for solving optimal control problems. However, that success has been mostly limited to problems with a finite set of states and actions. The problem of extending reinforcement learning techniques to the continuous state case has received quite a bit of attention in the last few years. One approach to solving reinforcement ...
متن کاملExplicit Manifold Representations for Value-Function Approximation in Reinforcement Learning
We are interested in using reinforcement learning for large, real-world control problems. In particular, we are interested in problems with continuous, multi-dimensional state spaces, in which traditional reinforcement learning approached perform poorly. Value-function approximation addresses some of the problems of traditional algorithms (for example, continuous state spaces), and has been sho...
متن کاملCBR for State Value Function Approximation in Reinforcement Learning
CBR is one of the techniques that can be applied to the task of approximating a function over high-dimensional, continuous spaces. In Reinforcement Learning systems a learning agent is faced with the problem of assessing the desirability of the state it finds itself in. If the state space is very large and/or continuous the availability of a suitable mechanism to approximate a value function – ...
متن کاملStatistical Linearization for Value Function Approximation in Reinforcement Learning
Reinforcement learning (RL) is a machine learning answer to the optimal control problem. It consists in learning an optimal control policy through interactions with the system to be controlled, the quality of this policy being quantified by the so-called value function. An important RL subtopic is to approximate this function when the system is too large for an exact representation. This paper ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Control of Network Systems
سال: 2021
ISSN: 2325-5870,2372-2533
DOI: 10.1109/tcns.2021.3061909