Understanding Simpson’s Paradox
نویسنده
چکیده
Simpson’s paradox is often presented as a compelling demonstration of why we need statistics education in our schools. It is a reminder of how easy it is to fall into a web of paradoxical conclusions when relying solely on intuition, unaided by rigorous statistical methods. In recent years, ironically, the paradox assumed an added dimension when educators began using it to demonstrate the limits of statistical methods, and why causal, rather than statistical considerations are necessary to avoid those paradoxical conclusions (Arah, 2008; Pearl, 2009, pp. 173–182; Wasserman, 2004). My comments are divided into two parts. First, I will give a brief summary of the history of Simpson’s paradox and how it has been treated in the statistical literature in the past century. Next I will ask what is required to declare the paradox “resolved,” and argue that modern understanding of causal inference has met those requirements.
منابع مشابه
Comment: Understanding Simpson’s Paradox
I thank the editor, Ronald Christensen, for the opportunity to discuss this important topic and to comment on the article by Armistead. Simpson’s paradox is often presented as a compelling demonstration of why we need statistics education in our schools. It is a reminder of how easy it is to fall into a web of paradoxical conclusions when relying solely on intuition, unaided by rigorous statist...
متن کاملComputational Social Scientist Beware: Simpson's Paradox in Behavioral Data
Observational data about human behavior is often heterogeneous, i.e., generated by subgroups within the population under study that vary in size and behavior. Heterogeneity predisposes analysis to Simpson’s paradox, whereby the trends observed in data that has been aggregated over the entire population may be substantially different from those of the underlying subgroups. I illustrate Simpson’s...
متن کاملHow Likely is Simpson's Paradox in Path Models?
Simpson’s paradox is a phenomenon arising from multivariate statistical analyses that often leads to paradoxical conclusions; in the field of e-collaboration as well as many other fields where multivariate methods are employed. We derive a general inequality for the occurrence of Simpson’s paradox in path models with or without latent variables. The inequality is then used to estimate the proba...
متن کاملThe ubiquity of the Simpson’s Paradox
Correspondence: [email protected] Department of Mathematics and Statistics of McMaster University, 1280 Main Street West, Hamilton, (ON) L8S-4K1, Canada Abstract The Simpson’s Paradox is the phenomenon that appears in some datasets, where subgroups with a common trend (say, all negative trend) show the reverse trend when they are aggregated (say, positive trend). Even if this issue has ...
متن کاملSimpson’s paradox, moderation, and the emergence of quadratic relationships in path models: An information systems illustration
While Simpson’s paradox is well-known to statisticians, it seems to have been largely neglected in many applied fields of research, including the field of information systems. This is problematic because of the strange nature of the phenomenon, the wrong conclusions and decisions to which it may lead, and its likely frequency. We discuss Simpson’s paradox and interpret it from the perspective o...
متن کامل