regret analysis

Bandit Convex Optimization: √ T Regret in One Dimension

2015

Sébastien Bubeck Ofer Dekel Tomer Koren Yuval Peres

We analyze the minimax regret of the adversarial bandit convex optimization problem. Focusing on the one-dimensional case, we prove that the minimax regret is Θ̃( √ T ) and partially resolve a decade-old open problem. Our analysis is non-constructive, as we do not present a concrete algorithm that attains this regret rate. Instead, we use minimax duality to reduce the problem to a Bayesian setti...

متن کامل

Treatment Decision Regret Among Long-Term Survivors of Localized Prostate Cancer: Results From the Prostate Cancer Outcomes Study.

Journal: :Journal of clinical oncology : official journal of the American Society of Clinical Oncology 2017

Richard M Hoffman Mary Lo Jack A Clark Peter C Albertsen Michael J Barry Michael Goodman David F Penson Janet L Stanford Antoinette M Stroup Ann S Hamilton

Purpose To determine the demographic, clinical, decision-making, and quality-of-life factors that are associated with treatment decision regret among long-term survivors of localized prostate cancer. Patients and Methods We evaluated men who were age ≤ 75 years when diagnosed with localized prostate cancer between October 1994 and October 1995 in one of six SEER tumor registries and who complet...

متن کامل

Bandit Convex Optimization: \(\sqrt{T}\) Regret in One Dimension

2015

Sébastien Bubeck Ofer Dekel Tomer Koren Yuval Peres

We analyze the minimax regret of the adversarial bandit convex optimization problem. Focusing on the one-dimensional case, we prove that the minimax regret is Θ̃( √ T ) and partially resolve a decade-old open problem. Our analysis is non-constructive, as we do not present a concrete algorithm that attains this regret rate. Instead, we use minimax duality to reduce the problem to a Bayesian setti...

متن کامل

Advance Selling When Consumers Regret

Journal: :Management Science 2012

Javad Nasiry Ioana Popescu

W characterize the effect of anticipated regret on consumer decisions and on firm profits and policies in an advance selling context where buyers have uncertain valuations. Advance purchases trigger action regret if valuations turn out to be lower than the price paid, whereas delaying purchase may cause inaction regret from missing a discount or facing a stockout. Consumers whom we describe as ...

متن کامل

Regret Aversion, Regret Neutrality, and Risk Aversion in Production

Journal: :SSRN Electronic Journal 2017

متن کامل

An algorithm with nearly optimal pseudo-regret for both stochastic and adversarial bandits

2016

Peter Auer Chao-Kai Chiang

We present an algorithm that achieves almost optimal pseudo-regret bounds against adversarial and stochastic bandits. Against adversarial bandits the pseudo-regret is O ( K √ n log n ) and against stochastic bandits the pseudo-regret is O ( ∑ i(log n)/∆i). We also show that no algorithm with O (log n) pseudo-regret against stochastic bandits can achieve Õ ( √ n) expected regret against adaptive...

متن کامل

Cultural grounding of regret: Regret in self and interpersonal contexts

Journal: :Cognition & Emotion 2011

متن کامل

Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems

Journal: :Foundations and Trends® in Machine Learning 2012

متن کامل

Minimizing Simple and Cumulative Regret in Monte-Carlo Tree Search

2014

Tom Pepels Tristan Cazenave Mark H. M. Winands Marc Lanctot

Regret minimization is important in both the Multi-Armed Bandit problem and Monte-Carlo Tree Search (MCTS). Recently, simple regret, i.e., the regret of not recommending the best action, has been proposed as an alternative to cumulative regret in MCTS, i.e., regret accumulated over time. Each type of regret is appropriate in different contexts. Although the majority of MCTS research applies the...

متن کامل

Regret and Regulation

Journal: :The Geneva Risk and Insurance Review 2014

متن کامل