regret minimization

نتایج جستجو برای: regret minimization

تعداد نتایج: 37822 فیلتر نتایج به سال:

Adaptive Strategies and Regret Minimization in Arbitrarily Varying Markov Environments

2001

Shie Mannor Nahum Shimkin

متن کامل

Sampling of Alternatives in Random Regret Minimization Models

Journal: :Transportation Science 2016

Cristian Angelo Guevara Caspar G. Chorus Moshe E. Ben-Akiva

Sampling of alternatives is often required in discrete choice models to reduce the computational burden and to avoid describing a large number of attributes. This approach has been used in many areas, including modeling of route choice, vehicle ownership, trip destination, residential location, and activity scheduling. The need for sampling of alternatives is accentuated for Random Regret Minim...

متن کامل

Near-Optimal Design of Experiments via Regret Minimization

2017

Zeyuan Allen-Zhu Yuanzhi Li Aarti Singh Yining Wang

We consider computationally tractable methods for the experimental design problem, where k out of n design points of dimension p are selected so that certain optimality criteria are approximately satisfied. Our algorithm finds a (1 + ε)approximate optimal design when k is a linear function of p; in contrast, existing results require k to be super-linear in p. Our algorithm also handles all popu...

متن کامل

Hedging Under Uncertainty: Regret Minimization Meets Exponentially Fast Convergence

2017

Johanne Cohen Amélie Héliou Panayotis Mertikopoulos

This paper examines the problem of multi-agent learning in N -person non-cooperative games. For concreteness, we focus on the socalled “hedge” variant of the exponential weights (EW) algorithm, one of the most widely studied algorithmic schemes for regret minimization in online learning. In this multi-agent context, we show that a) dominated strategies become extinct (a.s.); and b) in generic g...

متن کامل

Robust approachability and regret minimization in games with partial monitoring

2011

Shie Mannor Vianney Perchet Gilles Stoltz

Approachability has become a standard tool in analyzing learning algorithms in the adversarial online learning setup. We develop a variant of approachability for games where there is ambiguity in the obtained reward that belongs to a set, rather than being a single vector. Using this variant we tackle the problem of approachability in games with partial monitoring and develop simple and efficie...

متن کامل

Multi-Agent Counterfactual Regret Minimization for Partial-Information Collaborative Games

2017

Matthew Hartley Stephan Zheng

We study the generalization of counterfactual regret minimization (CFR) to partialinformation collaborative games with more than 2 players. For instance, many 4-player card games are structured as 2v2 games, with each player only knowing the contents of their own hand. To study this setting, we propose a multi-agent collaborative version of Kuhn Poker. We observe that a straightforward applicat...

متن کامل

Random Regret Minimization: Exploration of a New Choice Model for Environmental and Resource Economics

Journal: :Environmental and Resource Economics 2011

متن کامل

Using counterfactual regret minimization to create competitive multiplayer poker agents

2010

Nicholas Abou Risk Duane Szafron

Games are used to evaluate and advance Multiagent and Artificial Intelligence techniques. Most of these games are deterministic with perfect information (e.g. Chess and Checkers). A deterministic game has no chance element and in a perfect information game, all information is visible to all players. However, many real-world scenarios with competing agents are stochastic (non-deterministic) with...

متن کامل

MS&E 336 Lecture 14: Approachability and regret minimization

2007

Ramesh Johari

j 6=i Aj . We let ai denote a pure action for player i, and let si ∈ ∆(Ai) denote a mixed action for player i. We will typically view si as a vector in R Ai , with si(ai) equal to the probability that player i places on ai. We let Πi(a) denote the payoff to player i when the composite pure action vector is a, and by an abuse of notation also let Πi(s) denote the expected payoff to player i when...

متن کامل

Solving Large Imperfect Information Games Using CFR+

Journal: :CoRR 2014

Oskari Tammelin

Counterfactual Regret Minimization and variants (e.g. Public Chance Sampling CFR and Pure CFR) have been known as the best approaches for creating approximate Nash equilibrium solutions for imperfect information games such as poker. This paper introduces CFR, a new algorithm that typically outperforms the previously known algorithms by an order of magnitude or more in terms of computation time ...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید