Data Mining Geoscientific Data Sets Using Self Organizing Maps
نویسنده
چکیده
Geoscientists are increasingly challenged by the joint interpretation of ever-expanding amounts of new and historic, spatially-located exploration data (e.g., geochemistry, geophysics, geology, mineralogy, elevation data, etc.). And because, we can gather data faster than it can be interpreted, the availability of geographic information systems (GIS) has, to some extent, compounded, rather than reduced this problem. Research into the analysis and interpretation methods for data held in a GIS is in its infancy. A limited number of “advanced” interpretation methods have been developed; however, these often rely on a priori knowledge, training, or assumptions about mineralisation models. Objective, unsupervised methods for the spatial analysis of disparate data sets are needed. We have investigated and developed a new computational “tool” to assist in the interpretation of spatially located mineral exploration data sets. Our procedures are based on the data-ordering and visualization capabilities of the Self Organizing Map (SOM), combined with interactive software to investigate and display the spatial context of the derived SOM “clusters”. These computational procedures have the capacity to improve the efficiency and effectiveness of geoscientists as they attempt to discover and understand the often subtle signals associated with specific geological processes (e.g., mineralization), and separate them from the effects of overprinting noise caused by other processes such as metamorphism or weathering. Based on the principles of “ordered vector quantization”, the SOM approach has the advantage that all input data samples are represented as vectors in a data-space defined by the number of observations (variables) for each sample. The SOM procedure is an exploratory data analysis technique whereby patterns and relationships within a database are internally derived (unsupervised) based on measures of vector similarity (e.g., Euclidean distance and the dot product). The outputs of a SOM analysis are highly visual, which assists the analyst in understanding the data’s internal relationships.
منابع مشابه
Interval set clustering of web users using modified Kohonen self-organizing maps based on the properties of rough sets
Web usage mining involves application of data mining techniques to discover usage patterns from the web data. Clustering is one of the important functions in web usage mining. The likelihood of bad or incomplete web usage data is higher than the conventional applications. The clusters and associations in web usage mining do not necessarily have crisp boundaries. Researchers have studied the pos...
متن کاملData Mining Using Self-Organizing Kohonen Maps: A Technique for Effective Data Clustering & Visualization
Exploratory data mining using artificial neural networks offers an alternative dimension to data mining, in particular techniques geared towards data clustering and classification. In this paper, we argue the case for using neural networks as a viable data mining tool that can provide statistical insights and models from large data-sets. We demonstrate how Self-Organizing Kohonen Maps, an unsup...
متن کاملESOM-Maps: tools for clustering, visualization, and classification with Emergent SOM
An overview on the usage of emergent self organizing maps is given. U-Maps visualize the distance structures of high dimensional data sets. P-Maps show their density structures and U*-Maps combine the advantages of the mentioned maps to a visualization suitable to detect nontrivial cluster structures. A concise summary on the usage of Emergent Self-organizing Maps (ESOM) for data mining is give...
متن کاملActa Polytechnica Scandinavica Mathematics, Computing and Management in Engineering Series No. 82 Data Exploration Using Self-organizing Maps Data Exploration Using Self-organizing Maps. Acta Polytechnica
Finding structures in vast multidimensional data sets be they measurement data statistics or textual documents is di cult and time consuming Interesting novel relations between the data items may be hidden in the data The self organizing map SOM algorithm of Kohonen can be used to aid the exploration the structures in the data sets can be illustrated on special map displays In this work the met...
متن کاملData Mining Using Synergies Between Self-Organizing Maps and Inductive Learning of Fuzzy Rules
Identifying structures in large data sets raises a number of problems. On the one hand, many methods cannot be applied to larger data sets, while, on the other hand, the results are often hard to interpret. We address these problems by a novel three-stage approach. First, we compute a small representation of the input data using a self-organizing map. This reduces the amount of data and allows ...
متن کاملSurvey and Comparison of Quality Measures for Self-Organizing Maps
Self-Organizing Maps have a wide range of beneficial properties for data mining, like vector quantization and projection. Several measures exist that quantify the quality of either of these properties. The scope of this work is to describe and compare some of the most well-known measures. This is done by conducting a series of experiments for different map topologies with several well-known dat...
متن کامل