A tool for interactive Subgroup Discovery
نویسندگان
چکیده
We describe an approach and a tool for the discovery of subgroups within the framework of distribution rule mining. Distribution rules are a kind of association rules particularly suited for the exploratory study of numerical variables of interest. Being an exploratory technique, the result of a distribution mining process is typically a very large number of patterns. Exploring such results is thus a complex task and limits the use of the technique. To overcome this shortcoming we developed a tool, written in Java, which supports subgroup discovery in a post-processing step. The tool engages the analyst in an interactive process of subgroup discovery by means of a graphical interface with well defined statistical grounds, where domain knowledge can be used during the identification of such subgroups amid the population. We show a case study to analyze the results of students in a large scale university admission examination. Key-Words: Data Mining. Subgroup Discovery. Post-processing. Visualization. Association Rules, Distributions.
منابع مشابه
Interactive Discovery of Interesting Subgroup Sets
Although subgroup discovery aims to be a practical tool for exploratory data mining, its wider adoption is hampered by redundancy and the re-discovery of common knowledge. This can be remedied by parameter tuning and manual result filtering, but this requires considerable effort from the data analyst. In this paper we argue that it is essential to involve the user in the discovery process to so...
متن کاملVisual Interactive Subgroup Discovery with Numerical Properties of Interest
Subgroup discovery consists in finding subsets of individuals from a given population which have distinctive collective properties with regard to one or more properties of interest. The interest of a subgroup can be objectively assessed using appropriate statistics, but it can also be evaluated by a data analyst or domain expert. In this paper we propose an approach to subgroup discovery via di...
متن کاملمقایسه تأثیر سه رویکرد یاددهی ـ یادگیری بر عملکرد یادگیری دانشآموزان در درسزیستشناسی
Present study was designed to investigate the effects of three teaching- learning approaches including discovery, interactive and transmission approaches on the students learning performance in biology lesson. In this quasi- experimental research three experimental groups (N1=60, N2=71, N3=63) were used in order to identify any significant difference between the students learning performance wh...
متن کاملNovel Techniques for Efficient and Effective Subgroup Discovery
Large volumes of data are collected today in many domains. Often, there is so much data available, that it is difficult to identify the relevant pieces of information. Knowledge discovery seeks to obtain novel, interesting and useful information from large datasets. One key technique for that purpose is subgroup discovery. It aims at identifying descriptions for subsets of the data, which have ...
متن کاملExpert Discovery: A web mining approach
Expert discovery is a quest in search of finding an answer to a question: “Who is the best expert of a specific subject in a particular domain within peculiar array of parameters?” Expert with domain knowledge in any field is crucial for consulting in industry, academia and scientific community. Aim of this study is to address the issues for expert-finding task in real-world community. Collabor...
متن کامل