minimax regret

Fighting Bandits with a New Kind of Smoothness

2015

Jacob D. Abernethy Chansoo Lee Ambuj Tewari

We provide a new analysis framework for the adversarial multi-armed bandit problem. Using the notion of convex smoothing, we define a novel family of algorithms with minimax optimal regret guarantees. First, we show that regularization via the Tsallis entropy, which includes EXP3 as a special case, matches the O( √ NT ) minimax regret with a smaller constant factor. Second, we show that a wide ...

متن کامل

A Competitive Minimax Approach to Robust Estimation in Linear Models

2003

Yonina C. Eldar Neri Merhav

We consider the problem of estimating, in the presence of model uncertainties, a random vector x that is observed through a linear transformation H and corrupted by additive noise. We first assume that both the covariance of x and the transformation H are not completely specified, and develop the linear estimator that minimizes the worst-case mean-squared error (MSE) across all possible covaria...

متن کامل

Near Minimax Optimal Players for the Finite-Time 3-Expert Prediction Problem

2017

Yasin Abbasi-Yadkori Peter L. Bartlett Victor Gabillon

We study minimax strategies for the online prediction problem with expert advice. It has been conjectured that a simple adversary strategy, called COMB, is near optimal in this game for any number of experts. Our results and new insights make progress in this direction by showing that, up to a small additive term, COMB is minimax optimal in the finite-time three expert problem. In addition, we ...

متن کامل

Online Learning with Composite Loss Functions

2014

Ofer Dekel Jian Ding Tomer Koren Yuval Peres

We study a new class of online learning problems where each of the online algorithm’s actions is assigned an adversarial value, and the loss of the algorithm at each step is a known and deterministic function of the values assigned to its recent actions. This class includes problems where the algorithm’s loss is the minimum over the recent adversarial values, the maximum over the recent values,...

متن کامل

Bandit Convex Optimization: √ T Regret in One Dimension

2015

Sébastien Bubeck Ofer Dekel Tomer Koren Yuval Peres

We analyze the minimax regret of the adversarial bandit convex optimization problem. Focusing on the one-dimensional case, we prove that the minimax regret is Θ̃( √ T ) and partially resolve a decade-old open problem. Our analysis is non-constructive, as we do not present a concrete algorithm that attains this regret rate. Instead, we use minimax duality to reduce the problem to a Bayesian setti...

متن کامل

ELEVEN - Tests needed for a Recommendation 1 EUI Working Paper ECO 2006

2006

Karl H. Schlag

A decision maker has to recommend a treatment, knows that any outcome will be in [0; 1] but only has minimal information about the likelihood of outcomes (there is no prior). The decision maker can design a nite number of experiments in which treatments are tested. For the case of two treatments we present a rule for designing experiments and making a recommendation that attains minimax regret...

متن کامل

Regret Minimizing Equilibria and Mechanisms for Games with Strict Type Uncertainty

2004

Nathanael Hyafil Craig Boutilier

Mechanism design has found considerable application to the construction of agent-interaction protocols. In the standard setting, the type (e.g., utility function) of an agent is not known by other agents, nor is it known by the mechanism designer. When this uncertainty is quantified probabilistically, a mechanism induces a game of incomplete information among the agents. However, in many settin...

متن کامل

Regret Minimizing Equilibria of Games with Strict Type Uncertainty

2005

Nathanael Hyafil Craig Boutilier

In the standard mechanism design setting, the type (e.g., utility function) of an agent is not known by other agents, nor is it known by the mechanism designer. When this uncertainty is quantified probabilistically, a mechanism induces a game of incomplete information among the agents. However, in many settings, uncertainty over utility functions cannot easily be quantified. We consider the pro...

متن کامل

Bandit Convex Optimization: \(\sqrt{T}\) Regret in One Dimension

2015

Sébastien Bubeck Ofer Dekel Tomer Koren Yuval Peres

We analyze the minimax regret of the adversarial bandit convex optimization problem. Focusing on the one-dimensional case, we prove that the minimax regret is Θ̃( √ T ) and partially resolve a decade-old open problem. Our analysis is non-constructive, as we do not present a concrete algorithm that attains this regret rate. Instead, we use minimax duality to reduce the problem to a Bayesian setti...

متن کامل

Empirical Entropy, Minimax Regret and Minimax Risk

Journal: :CoRR 2013

Alexander Rakhlin Karthik Sridharan Alexandre B. Tsybakov

We consider the random design regression with square loss. We propose a method that aggregates empirical minimizers (ERM) over appropriately chosen random subsets and reduces to ERM in the extreme case, and we establish exact oracle inequalities for its risk. We show that, under the −p growth of the empirical -entropy, the excess risk of the proposed method attains the rate n− 2 2+p for p ∈ (0,...

متن کامل