Specializing CRISP-DM for Evidence Mining
نویسندگان
چکیده
The use of all forms of computer and communication devices is changing human interaction and thinking. Electronic traces of actions and activities are continually being left behind most often unknowingly so. This situation creates opportunities for criminal investigators to make use of these traces and marks to uncover evidence. In this evidentiary discovery process several problems are experienced including the linking of unstructured pieces of data to an evidence trail. Analysis is a crucial aspect of the overall Cyber Forensic process for which adequate support is not provided. In this article it is argued that in order to alleviate the situation around analysis and further the foundations of cyber forensics it is necessary to learn from another field that also needs to deal with vast amounts of information and develop methods for automated interpretation thereof; the field of Knowledge Discovery and Data Mining (KDD). A specialization of a well known KDD process (CRISP-DM) is developed and named CRISP-EM. The process of specialization is described and some of the results are shown. It is further shown that the CRISP-EM methodology supports a structured approach in defining the research gaps in evidence mining.
منابع مشابه
A cost model to estimate the effort of data mining projects (DMCoMo)
CRISP-DM is the standard to develop Data Mining projects. CRISP-DM proposes processes and tasks that you have to carry out to develop a Data Mining project. A task proposed by CRISP-DM is the cost estimation of the Data
متن کاملCRISP-DM: Towards a Standard Process Model for Data Mining
The CRISP-DM (CRoss Industry Standard Process for Data Mining) project proposed a comprehensive process model for carrying out data mining projects. The process model is independent of both the industry sector and the technology used. In this paper we argue in favor of a standard process model for data mining and report some experiences with the CRISP-DM process model in practice. We applied an...
متن کاملThe Search for Gold Nuggets Using CRISP-DM Without a Seasoned Miner
The rise of data mining has brought many changes to people’s lives but also to companies and the importance of data analysis. Companies always had a tendency to gather as much data as possible but it has only been recently due to the developments in IT that large quantities of data can be analyzed in a fast and easy way. This new field gave rise to the methodology of Cross Industry Standard Pro...
متن کاملCustomer Retention Based on the Number of Purchase: A Data Mining Approach
Purpose: this study wants to find any relationship between the numbers of purchase and the income the customer brings to the company. The attempt is to find those customers who buy more than one life insurance policy and represent the signs of good payments at the same time by the help of data mining tools. Design/ methodology/ approach: the approach of this research is to use data mining tools...
متن کاملAdapting CRISP-DM Process for Social Network Analytics: Application to Healthcare
One of the key limitations about research involving big data is the lack of a sound methodological process that drives the conceptual and analytical questions posed to the data. In this study, we adapt the popular CRISP-DM process to analyze large volumes of unstructured data to generate analytical insights. We add specificity to the CRISP-DM methodology. Specifically, we propose “Cross Industr...
متن کامل