markov games

نتایج جستجو برای: markov games

تعداد نتایج: 126585 فیلتر نتایج به سال:

utilizing generalized learning automata for finding optimal policies in mmdps

Journal: :journal of computer and robotics 0

samaneh assar faculty of computer and information technology engineering, qazvin branch, islamic azad university, qazvin, iran behrooz masoumi faculty of computer and information technology engineering, qazvin branch, islamic azad university, qazvin, iran

multi agent markov decision processes (mmdps), as the generalization of markov decision processes to the multi agent case, have long been used for modeling multi agent system and are used as a suitable framework for multi agent reinforcement learning. in this paper, a generalized learning automata based algorithm for finding optimal policies in mmdp is proposed. in the proposed algorithm, mmdp ...

متن کامل

Markov Decision Evolutionary Games

Journal: :IEEE Transactions on Automatic Control 2010

متن کامل

Schmidt games and Markov partitions

Journal: :Nonlinearity 2009

متن کامل

Some Remarks on Equilibria in Semi-markov Games

2011

A. S. Nowak

This paper is a first study of correlated equilibria in nonzerosum semi-Markov stochastic games. We consider the expected average payoff criterion under a strong ergodicity assumption on the transition structure of the games. The main result is an extension of the correlated equilibrium theorem proven for discounted (discrete-time) Markov games in our joint paper with Raghavan. We also provide ...

متن کامل

Value Function Approximation in Zero-Sum Markov Games

2002

Michail G. Lagoudakis Ronald Parr

This paper investigates value function approximation in the context of zero-sum Markov games, which can be viewed as a generalization of the Markov decision process (MDP) framework to the two-agent case. We generalize error bounds from MDPs to Markov games and describe generalizations of reinforcement learning algorithms to Markov games. We present a generalization of the optimal stopping probl...

متن کامل

Relational Markov Games

2004

Alberto Finzi Thomas Lukasiewicz

Towards a compact and elaboration-tolerant first-order representation of Markov games, we introduce relational Markov games, which combine standard Markov games with first-order action descriptions in a stochastic variant of the situation calculus. We focus on the zero-sum two-agent case, where we have two agents with diametrically opposed goals. We also present a symbolic value iteration algor...

متن کامل

Semi-markov strategies in stochastic games

Journal: :International Journal of Game Theory 1983

متن کامل

Correlated Q-Learning

2003

Amy Greenwald Keith Hall

Recently, there have been several attempts to design multiagent learning algorithms that learn equilibrium policies in general-sum Markov games, just as Q-learning learns optimal policies in Markov decision processes. This paper introduces correlated-Q learning, one such algorithm. The contributions of this paper are twofold: (i) We show empirically that correlated-Q learns correlated equilibri...

متن کامل

Rational and Convergent Model-Free Adaptive Learning for Team Markov Games1

2007

Francisco S. Melo M. Isabel Ribeiro Francisco A. Melo

In this paper, we address multi-agent decision problems where all agents share a common goal. This class of problems is suitably modeled using finite-state Markov games with identical interests. We tackle the problem of coordination and contribute a new algorithm, coordinated Qlearning (CQL). CQL combines Q-learning with biased adaptive play, a coordination mechanism based on the principle of f...

متن کامل

From Markov Chains to Stochastic Games

2003

ABRAHAM NEYMAN

Markov chains1 and Markov decision processes (MDPs) are special cases of stochastic games. Markov chains describe the dynamics of the states of a stochastic game where each player has a single action in each state. Similarly, the dynamics of the states of a stochastic game form a Markov chain whenever the players’ strategies are stationary. Markov decision processes are stochastic games with a ...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید