نتایج جستجو برای: regret

تعداد نتایج: 5407  

Journal: :Electronic Colloquium on Computational Complexity (ECCC) 2010
Sascha Geulen Berthold Vöcking Melanie Winkler

Suppose a decision maker has to purchase a commodity over time with varying prices and demands. In particular, the price per unit might depend on the amount purchased and this price function might vary from step to step. The decision maker has a buffer of bounded size for storing units of the commodity that can be used to satisfy demands at later points in time. We seek for an algorithm decidin...

2011
Mikhail Kapralov Rina Panigrahy

Consider a sequence of bits where we are trying to predict the next bit from the previous bits. Assume we are allowed to say ‘predict 0’ or ‘predict 1’, and our payoff is +1 if the prediction is correct and −1 otherwise. We will say that at each point in time the loss of an algorithm is the number of wrong predictions minus the number of right predictions so far. In this paper we are interested...

Journal: :Science 2005
David M Eagleman

of the Orbitofrontal Cortex in the Experience of Regret’’ Regret is an emotion that accompanies negative outcomes to decisions for which we have been responsible (1). Given a choice between two gambles, subjects will experience regret if the unchosen gamble yields a higher reward than the chosen gamble (2). Camille et al. (3) hypothesized that the anticipation of regret is useful for steering d...

2013
Ofer Dekel Elad Hazan

We consider regret minimization in adversarial deterministic Markov Decision Processes (ADMDPs) with bandit feedback. We devise a new algorithm that pushes the state-of-theart forward in two ways: First, it attains a regret of O(T ) with respect to the best fixed policy in hindsight, whereas the previous best regret bound was O(T ). Second, the algorithm and its analysis are compatible with any...

2007
Mmboniseni Mulaudzi Ilse Schoeman

We model how decisions about the allocation of available funds to loans and Treasuries are dependent on perceptions about risk and regret in the banking industry. Our discussion is based on utility theory where a regret attribute is considered alongside a risk component as an integral part of the objective function. In addition, we provide a comparison between a riskand a regret-averse banks. I...

2013
Jeremiah Blocki Nicolas Christin Anupam Datta Arunesh Sinha

Online learning algorithms that minimize regret provide strong guarantees in situations that involve repeatedly making decisions in an uncertain environment, e.g. a driver deciding what route to drive to work every day. While regret minimization has been extensively studied in repeated games, we study regret minimization for a richer class of games called bounded memory games. In each round of ...

Journal: :The Annals of Applied Probability 1993

Journal: :Mathematics of Operations Research 2006

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید