Simple learning models can illuminate biased results from choice titration experiments
نویسندگان
چکیده
The choice titration procedure presents a subject with a repeated choice between a standard option that always provides the same reward and an adjusting option for which the reward schedule is adjusted based on the subject’s previous choices. The procedure is designed to determine the point of indifference between the two schedules which is then used to estimate a utility equivalence point between the two options. Analyzing the titration procedure as a Markov birth death process, we show that a large class of reinforcement learning models invariably generates a titration bias, and that the bias varies non-linearly with the reward value. We treat several titration procedures, presenting analytic results for some simple learning models and simulation results for more complex models. These results suggest that results from titration experiments are likely to be biased and that inferences based on the titration experiments may need to be reconsidered.
منابع مشابه
Rational Choice Theory: A Cultural Reconsideration
Economists have heralded the formulation of the expected utility theorem as a universal method of choice under uncertainty. In their seminal paper, Stigler and Becker (Stigler & Becker, 1977) declared that “human behavior can be explained by a generalized calculus of utility-maximizing behavior” (p.76). The universality of the rational choice theory has been widely criticized by psychologists, ...
متن کاملTemporally-Biased Sampling for Online Model Management
To maintain the accuracy of supervised learning models in the presence of evolving data streams, we provide temporally-biased sampling schemes that weight recent data most heavily, with inclusion probabilities for a given data item decaying exponentially over time. We then periodically retrain the models on the current sample. This approach speeds up the training process relative to training on...
متن کاملLearning in Economics Experiments
Reinforcement learning, belief learning, experiments, probability matching, market price-choice games, computer simulations. This paper explains how simple psychological models of reinforcement and belief learning can be used to explain dynamic patterns of adjustment in economics experiments.
متن کاملUsing the XCS Classifier System for Multi-objective Reinforcement Learning Problems
We investigate the performance of a learning classifier system in some simple multi-objective, multi-step maze problems, using both random and biased action-selection policies for exploration. Results show that the choice of action-selection policy can significantly affect the performance of the system in such environments. Further, this effect is directly related to population size, and we rel...
متن کاملTheoretical Models of Learning to Learn
A Machine can only learn if it is biased in some way. Typically the bias is supplied by hand, for example through the choice of an appropriate set of features. However, if the learning machine is embedded within an environment of related tasks, then it can learn its own bias by learning suuciently many tasks from the environment 4, 6]. In this paper two models of bias learning (or equivalently,...
متن کامل