regret minimization

نتایج جستجو برای: regret minimization

تعداد نتایج: 37822 فیلتر نتایج به سال:

Regret Minimization Under Partial Monitoring

Journal: :Mathematics of Operations Research 2006

متن کامل

Algorithms for Average Regret Minimization

Journal: :Proceedings of the AAAI Conference on Artificial Intelligence 2019

متن کامل

Risk minimization, regret minimization and progressive hedging algorithms

Journal: :Mathematical Programming 2020

متن کامل

Decision making using minimization of regret

Journal: :International Journal of Approximate Reasoning 2004

متن کامل

Meta-Learning for Simple Regret Minimization

Journal: :Proceedings of the ... AAAI Conference on Artificial Intelligence 2023

We develop a meta-learning framework for simple regret minimization in bandits. In this framework, learning agent interacts with sequence of bandit tasks, which are sampled i.i.d. from an unknown prior distribution, and learns its meta-parameters to perform better on future tasks. propose the first Bayesian frequentist algorithms setting. The algorithm has access distribution over meta m tasks ...

متن کامل

Regret Minimization in Discounted-Sum Games

Journal: :Electronic Proceedings in Theoretical Computer Science 2020

متن کامل

A Generalized Random Regret Minimization Model

Journal: :SSRN Electronic Journal 2013

متن کامل

Competing With Strategies

2013

Wei Han Alexander Rakhlin Karthik Sridharan

We study the problem of online learning with a notion of regret defined with respect to a set of strategies. We develop tools for analyzing the minimax rates and for deriving regret-minimization algorithms in this scenario. While the standard methods for minimizing the usual notion of regret fail, through our analysis we demonstrate existence of regret-minimization methods that compete with suc...

متن کامل

Efficient Constrained Regret Minimization

Journal: :CoRR 2012

Mehrdad Mahdavi Tianbao Yang Rong Jin

Online learning constitutes a mathematical and compelling framework to analyze sequential decision making problems in adversarial environments. The learner repeatedly chooses an action, the environment responds with an outcome, and then the learner receives a reward for the played action. The goal of the learner is to maximize his total reward. However, there are situations in which, in additio...

متن کامل

Iterated regret minimization: A new solution concept

Journal: :Games and Economic Behavior 2012

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید