A Statistical Perspective on Knowledge Discovery in Databases 4.1 Recent Statistical Contributions
نویسندگان
چکیده
The quest to nd models usefully characterizing data is a process central to the scientiic method, and has been carried out on many fronts. Researchers from an expanding number of elds have designed algorithms to discover rules or equations that capture key relationships between variables in a database. The task of this chapter is to provide a perspective on statistical techniques applicable to KDD; accordingly, we review below some major advances in statistics in the last few decades. We next highlight some distinctives of what may be called a \statistical viewpoint." Finally we overview some innuential classical and modern statistical methods for practical model induction. It would be unfortunate if the KDD community dismissed statistical methods on the basis of courses that they took on statistics several to many years ago. The following provides a rough chronology of \recent" signiicant contributions in statistics that are relevant to the KDD community. The noteworthy fact is that this time period coincides with the signiicant increases in computing horsepower and memory, powerful and expressive programming languages, and general accessibility to computing that has propelled us into
منابع مشابه
Clustering and Knowledge Discovery in Spatial Databases
In the past decades, clustering has been widely used in areas such as pattern recognition, data analysis, and image processing. Recently, clustering has been recognized as a useful method for knowledge discovery in spatial databases. To eeciently detect clusters from large spatial databases with limited amount of available memory, special database techniques have been developed. In this article...
متن کاملData Mining & Knowledge Discovery in Databases: An AI Perspective
Data mining and Knowledge discovery has several important application areas. Data mining and knowledge discovery have been topics considered at many AI, database and statistical conferences. Knowledge discovery generally refers to the process of identifying valid, novel and understandable patterns. Knowledge discovery from large databases, often called data mining, refers to the application of ...
متن کاملبررسی کاربردهای داده کاوی در نظام سلامت
Introduction: Extensive amounts of data stored in medical databases require the development of specialized tools for accessing the data, data analysis, knowledge discovery, and the effective use of the data. Data mining is one of the most important methods. The article sketches the used Data Mining techniques, and illustrates their applicability to medical diagnostic and prognostic problems. ...
متن کاملA Statistical Perspective on KDD
The quest to find models usefully characterizing data is a process central to the scientific method and has been carried out on many fronts. Researchers from an expanding number of fields have designed algorithms to discover rules or equations that capture key relationships between variables in a database. Some modern heuristic modeling approaches seem to have gained in popularity partly as a w...
متن کاملConcept drift detection in event logs using statistical information of variants
In recent years, business process management (BPM) has been highly regarded as an improvement in the efficiency and effectiveness of organizations. Extracting and analyzing information on business processes is an important part of this structure. But these processes are not sustainable over time and may change for a variety of reasons, such as the environment and human resources. These changes ...
متن کامل