thompson

Thompson Sampling for Multi-Objective Multi-Armed Bandits Problem

2015

Saba Yahyaa

The multi-objective multi-armed bandit (MOMAB) problem is a sequential decision process with stochastic rewards. Each arm generates a vector of rewards instead of a single scalar reward. Moreover, these multiple rewards might be conflicting. The MOMAB-problem has a set of Pareto optimal arms and an agent’s goal is not only to find that set but also to play evenly or fairly the arms in that set....

متن کامل

Learning and Optimization for Sequential Decision Making 02 / 01 / 16 Lecture 4 : Thompson Sampling ( part 1 )

2016

Erik Waingarten

Consider the problem of learning a parametric distribution from observations. A frequentist approach to learning considers parameters to be fixed, and uses the data learn those parameters as accurately as possible. For example, consider the problem of learning Bernoulli distribution’s parameter ( a random variable is distributed as Bernoulli(μ) is 1 with probability μ and 0 with probability 1 −...

متن کامل

Vitamin A and Diarrheal Pathogen Infections

2006

Kurt Z. Long Jose Ignacio Santos Jorge L. Rosado Catalina Lopez-Saucedo Rocio Thompson-Bonilla Maricela Abonce Herbert L. DuPont Ellen Hertzmark Teresa Estrada-Garcia

Kurt Z. Long, Jose Ignacio Santos, Jorge L. Rosado, Catalina Lopez-Saucedo, Rocio Thompson-Bonilla, Maricela Abonce, Herbert L. DuPont, Ellen Hertzmark, and Teresa Estrada-Garcia Department of Nutrition and Department of Epidemiology, Harvard School of Public Health, Boston, Massachusetts, and University of Texas Medical School and School of Public Health, Houston; Hospital Infantil de Mexico F...

متن کامل

Verbal and Nonverbal Cues Activate Concepts Differently, at Different Times

2013

Pierce Edmiston Gary Lupyan

Although the word “dog” and an unambiguous barking sound may point to the same concept DOG, verbal labels and nonverbal cues appear to activate conceptual information in systematically different ways (Lupyan & Thompson-Schill, 2012). Here we investigate these differences in more detail. We replicate the finding that labels activate a more prototypical representation than do sounds, and find tha...

متن کامل

Racing Thompson: an Efficient Algorithm for Thompson Sampling with Non-conjugate Priors

Journal: :CoRR 2017

Yichi Zhou Jun Zhu Jingwei Zhuo

Thompson sampling has impressive empirical performance for many multi-armed bandit problems. But current algorithms for Thompson sampling only work for the case of conjugate priors since these algorithms require to infer the posterior, which is often computationally intractable when the prior is not conjugate. In this paper, we propose a novel algorithm for Thompson sampling which only requires...

متن کامل

The Seasonal Health Questionnaire is more effective at detecting seasonal affective disorder than the Seasonal Pattern Adjustment Questionnaire.

Journal: :Evidence-based mental health 2004

John M Eagles

Thompson C, Thompson S, Smith R. Prevalence of seasonal affective disorder in primary care; a comparison of the seasonal health questionnaire and the seasonal pattern assessment questionnaire. J Affect Disord 2004;78:219–26. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ...

متن کامل

اصل دفاع از خود و تعارض جان مادر و جنین از دیدگاه تامسون

ژورنال: اخلاق و تاریخ پزشکی 2016

آل بویه, علیرضا, دهقانی نیستانی, زینب,

Self-defence is everyone’s right. Based on this right, one can defend himself against any eminent threat, even if it cause the predator’s death. This ethical principle is an applicable principle in ethics in war. However, the principle of self-defence has been applied in other situation such as justification of abortion if mother’s life is threatened by her fetus. Judith Thompson is a philosoph...

متن کامل

Are Bayesian Inferences Weak for Wasserman's Example?

Journal: :Communications in Statistics - Simulation and Computation 2010

Longhai Li

An example was given in the textbook All of Statistics (Wasserman, 2004, pages 186-188) for arguing that, in the problems with a great many parameters Bayesian inferences are weak, because they rely heavily on the likelihood function that captures information of only a tiny fraction of the total parameters. Alternatively he suggested non-Bayesian Horwitz-Thompson estimator, which cannot be obta...

متن کامل

Satisficing in Time-Sensitive Bandit Learning

2018

Daniel Russo Benjamin Van Roy

Much of the recent literature on bandit learning focuses on algorithms that aim to converge on an optimal action. One shortcoming is that this orientation does not account for time sensitivity, which can play a crucial role when learning an optimal action requires much more information than near-optimal ones. Indeed, popular approaches such as upper-confidence-bound methods and Thompson samplin...

متن کامل

: Maya Archaeologist . J. Eric S. Thompson.

Journal: :American Anthropologist 1964

متن کامل