A Kdd Process to Retrieve and Aggregate Data from Relational Databases
نویسندگان
چکیده
Relational databases are a standard for representing data models. SQL is the most widely used language for querying such databases. Consequently, in many research domains, scientists extract data from relational databases, compute them and do statistical treatments. But they have to deal with the complexity of relational databases models. In addition, it takes a long time for the scientists to manually retrieve and compute data. That's why we propose a system which automatically does. It contains the following layers: parameterizable extraction of data, automatic process of SQL queries, data aggregation, statistical parameters computation, writing the results to tables and final data processing by the scientist, thanks to a statistical analysis software. A use case on the research on a soil quality index from a large relational database will be presented.
منابع مشابه
Knowledge Discovery and Data Mining in Databases
Knowledge Discovery in Databases (KDD) is the process of automatic discovery of previously unknown patterns, rules, and other regular contents implicitly present in large volumes of data. Data Mining (DM) denotes discovery of patterns in a data set previously prepared in a specific way. DM is often used as a synonym for KDD. However, strictly speaking DM is just a central phase of the entire pr...
متن کاملUsing SQL primitives and parallel DB servers to speed up knowledge discovery in large relational databases
Efficiency is crucial in KDD (Knowledge Discovery in Databases), due to the huge amount of data stored in commercial databases. We argue that high efficiency in KDD can be achieved by combining two approaches, namely mapping KDD functionality onto standard DBMS operations and executing KDD tasks on a parallel SQL server. We propose generic KDD primitives which underly the candidate-rule evaluat...
متن کاملKDD-93: Progress and Challenges in Knowledge Discovery in Databases
Shapiro 1992) devoted or closely related to discovery in databases. The application side is of interest to any business or organization with large databases. KDD applications have been reported in many areas of business, government, and science (Parsaye and Chignell 1993; Inmon and Osterfelt 1991; Piatetsky-Shapiro and Frawley 1991). The notion of discovery in databases has been given various n...
متن کاملA Database Interface for Clustering in Large Spatial Databases
Both the number and the size of spatial databases are rapidly growing because of the large amount of data obtained from satellite images, X-ray crystallography or other scientific equipment. Therefore, automated knowledge discovery becomes more and more important in spatial databases. So far, most of the methods for knowledge discovery in databases (KDD) have been based on relational database s...
متن کاملA Calculus for Fuzzy Queries on Fuzzy Entity-Relationship Model
Most query languages are designed to retrieve information from databases containing precise and certain data using precisely specified commands. Application of fuzzy set theory to relational data models has been studied extensively in recent years. This paper presents a calculus for fuzzy queries on a fuzzy entity-relationship model. The paper, first, defines a fuzzy entity-relationship model c...
متن کامل