Lecture 24, Causal Discovery
ثبت نشده
چکیده
ly, the algorithm works as follows: • Start with a complete undirected graph on all variables. • For each pair of variables, see if conditioning on some set of variables makes them conditionally independent; if so, remove their edge. • Identify all colliders by checking for conditional dependence; orient the edges of colliders. • Try to orient undirected edges by consistency with already-oriented edges; do this recursively until no more edges can be oriented. Pseudo-code is in the appendix. Call the result of the SGS algorithm Ĝ. If all of the assumptions above hold, and the algorithm is correct in its guesses about when variables are conditionally independent, then Ĝ = G. In practice, of course, conditional independence guesses are really statistical tests based on finite data, so we should write the output as Ĝn, to indicate that it is based on only n samples. If the conditional independence test is consistent, then lim n→∞ Pr ( Ĝn 6= G ) = 0 In other words, the SGS algorithm converges in probability on the correct causal structure; it is consistent for all graphs G. Of course, at finite n, the probability of error — of having the wrong structure — is (generally!) not zero, but this Pearl (2009); Janzing (2007) makes a related suggestion). Arguably then using order in time to orient edges in a causal graph begs the question, or commits the fallacy of petitio principii. But of course every syllogism does, so this isn’t a distinctively statistical issue. (Take the classic: “All men are mortal; Socrates is a man; therefore Socrates is mortal.” How can we know that all men are mortal until we know about the mortality of this particular man, Socrates? Isn’t this just like asserting that tomatoes and peppers must be poisonous, because they belong to the nightshade family of plants, all of which are poisonous?) While these philosophical issues are genuinely fascinating, this footnote has gone on long enough, and it is time to return to the main text.
منابع مشابه
Comparative Benchmarking of Causal Discovery Techniques
In this paper we present a comprehensive view of prominent causal discovery algorithms, categorized into two main categories (1) assuming acyclic and no latent variables, and (2) allowing both cycles and latent variables, along with experimental results comparing them from three perspectives: (a) structural accuracy, (b) standard predictive accuracy, and (c) accuracy of counterfactual inference...
متن کاملLecture 21 “ Treating the Treated : Varieties of Causal Inference ”
The last decade has seen a vigorous revival of interest in models of causal inference in econometrics and statistics more generally. In this lecture, I would like offer a few comments on this literature that try to draw out some themes from earlier aspects of the course. In this survey we will meet again several old friends: instrumental variables, quantile regression, and perhaps most fundamen...
متن کاملThe effectiveness of Guided Discovery Learning on the learning and satisfaction of nursing students
Introduction: Revision of the traditional teaching methods as well as employment of modern and active learning method through educational systems is tangible. Application of such methods are quite common in different scientific areas. Therefore, performing modern educational approaches such as self-directed and long-life learning such as Guided Discovery Learning (GDL) is a step toward the stud...
متن کاملDemonstrating SerenA: Chance Encounters in the Space of Ideas
Citation for published version (APA): Forth, J., Giannimaras, T., Wiggins, G. A., Stewart, R., Bental, D., Aylett, R., ... Woods, M. (2013). SerenA: a multi-site pervasive agent environment that supports serendipitous discovery in research. In Y. Demazeau , T. Ishida, J. M. Corchado, & J. Bajo (Eds.), Advances on Practical Applications of Agents and Multi-Agent Systems: 11th International Confe...
متن کاملA Study of Causal Discovery With Weak Links and Small Samples
Weak causal relationships and small sample size pose two significant difficulties to the automatic discovery of causal models from observational data. This paper examines the influence of weak causal links and varying sample sizes on the discovery of causal models. The experimental results i l lustrate the effect of larger sample sizes for discovering causal models reliably and the relevance of...
متن کامل