نتایج جستجو برای: minimax regret

تعداد نتایج: 12162  

2015
Jacob D. Abernethy Chansoo Lee Ambuj Tewari

We provide a new analysis framework for the adversarial multi-armed bandit problem. Using the notion of convex smoothing, we define a novel family of algorithms with minimax optimal regret guarantees. First, we show that regularization via the Tsallis entropy, which includes EXP3 as a special case, matches the O( √ NT ) minimax regret with a smaller constant factor. Second, we show that a wide ...

2003
Yonina C. Eldar Neri Merhav

We consider the problem of estimating, in the presence of model uncertainties, a random vector x that is observed through a linear transformation H and corrupted by additive noise. We first assume that both the covariance of x and the transformation H are not completely specified, and develop the linear estimator that minimizes the worst-case mean-squared error (MSE) across all possible covaria...

2017
Yasin Abbasi-Yadkori Peter L. Bartlett Victor Gabillon

We study minimax strategies for the online prediction problem with expert advice. It has been conjectured that a simple adversary strategy, called COMB, is near optimal in this game for any number of experts. Our results and new insights make progress in this direction by showing that, up to a small additive term, COMB is minimax optimal in the finite-time three expert problem. In addition, we ...

2014
Ofer Dekel Jian Ding Tomer Koren Yuval Peres

We study a new class of online learning problems where each of the online algorithm’s actions is assigned an adversarial value, and the loss of the algorithm at each step is a known and deterministic function of the values assigned to its recent actions. This class includes problems where the algorithm’s loss is the minimum over the recent adversarial values, the maximum over the recent values,...

2015
Sébastien Bubeck Ofer Dekel Tomer Koren Yuval Peres

We analyze the minimax regret of the adversarial bandit convex optimization problem. Focusing on the one-dimensional case, we prove that the minimax regret is Θ̃( √ T ) and partially resolve a decade-old open problem. Our analysis is non-constructive, as we do not present a concrete algorithm that attains this regret rate. Instead, we use minimax duality to reduce the problem to a Bayesian setti...

2006
Karl H. Schlag

A decision maker has to recommend a treatment, knows that any outcome will be in [0; 1] but only has minimal information about the likelihood of outcomes (there is no prior). The decision maker can design a …nite number of experiments in which treatments are tested. For the case of two treatments we present a rule for designing experiments and making a recommendation that attains minimax regret...

2004
Nathanael Hyafil Craig Boutilier

Mechanism design has found considerable application to the construction of agent-interaction protocols. In the standard setting, the type (e.g., utility function) of an agent is not known by other agents, nor is it known by the mechanism designer. When this uncertainty is quantified probabilistically, a mechanism induces a game of incomplete information among the agents. However, in many settin...

2005
Nathanael Hyafil Craig Boutilier

In the standard mechanism design setting, the type (e.g., utility function) of an agent is not known by other agents, nor is it known by the mechanism designer. When this uncertainty is quantified probabilistically, a mechanism induces a game of incomplete information among the agents. However, in many settings, uncertainty over utility functions cannot easily be quantified. We consider the pro...

2015
Sébastien Bubeck Ofer Dekel Tomer Koren Yuval Peres

We analyze the minimax regret of the adversarial bandit convex optimization problem. Focusing on the one-dimensional case, we prove that the minimax regret is Θ̃( √ T ) and partially resolve a decade-old open problem. Our analysis is non-constructive, as we do not present a concrete algorithm that attains this regret rate. Instead, we use minimax duality to reduce the problem to a Bayesian setti...

Journal: :CoRR 2013
Alexander Rakhlin Karthik Sridharan Alexandre B. Tsybakov

We consider the random design regression with square loss. We propose a method that aggregates empirical minimizers (ERM) over appropriately chosen random subsets and reduces to ERM in the extreme case, and we establish exact oracle inequalities for its risk. We show that, under the −p growth of the empirical -entropy, the excess risk of the proposed method attains the rate n− 2 2+p for p ∈ (0,...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید