نتایج جستجو برای: regret minimization

تعداد نتایج: 37822  

Journal: :Mathematics of Operations Research 2006

Journal: :Proceedings of the AAAI Conference on Artificial Intelligence 2019

Journal: :International Journal of Approximate Reasoning 2004

Journal: :Proceedings of the ... AAAI Conference on Artificial Intelligence 2023

We develop a meta-learning framework for simple regret minimization in bandits. In this framework, learning agent interacts with sequence of bandit tasks, which are sampled i.i.d. from an unknown prior distribution, and learns its meta-parameters to perform better on future tasks. propose the first Bayesian frequentist algorithms setting. The algorithm has access distribution over meta m tasks ...

Journal: :Electronic Proceedings in Theoretical Computer Science 2020

2013
Wei Han Alexander Rakhlin Karthik Sridharan

We study the problem of online learning with a notion of regret defined with respect to a set of strategies. We develop tools for analyzing the minimax rates and for deriving regret-minimization algorithms in this scenario. While the standard methods for minimizing the usual notion of regret fail, through our analysis we demonstrate existence of regret-minimization methods that compete with suc...

Journal: :CoRR 2012
Mehrdad Mahdavi Tianbao Yang Rong Jin

Online learning constitutes a mathematical and compelling framework to analyze sequential decision making problems in adversarial environments. The learner repeatedly chooses an action, the environment responds with an outcome, and then the learner receives a reward for the played action. The goal of the learner is to maximize his total reward. However, there are situations in which, in additio...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید