Methods for interpreting a self-organized map in data analysis
نویسندگان
چکیده
The Self-Organizing Map (SOM) can be used for forming overviews of multivariate data sets and for visualizing them on graphical map displays. Each map location represents certain kinds of data items and the value of a variable in the representations can be visualized in the corresponding locations on the map display. Such component plane displays contain all the information needed for interpreting the map but information about the relations of the variables remains implicit. We have developed methods that visualize explicitly the contribution of each variable in the organization of the map at diierent locations. It is also possible to measure the contribution of each variable in the cluster structure within an area of the map to summarize, for instance, the characteristics of clusters. 1. Introduction The SOM algorithm 2, 3] forms a mapping of a usually two-dimensional map lattice into the high-dimensional data space. There is a model vector connected to each point of the discrete lattice. The model vectors are situated in the data space; they act as an ordered set of models of diierent types of data items. The map can be used as an ordered groundwork for illustrating diierent aspects of the data set. In addition to visualizing the values of the original variables as component planes (examples are shown in Fig. 2a) the map can be used to visualize the clustering tendency of the data in diierent regions of the data space. The model vectors follow the distribution of the data items and therefore the distances between the model vectors connected to neighboring points on the map lattice are shorter in clustered areas than in sparser regions. The so-called U-matrix display 4], an example of which is shown in Fig. 1, depicts the distances between neighboring model vectors as gray levels.
منابع مشابه
Statistical Prediction of Probable Seismic Hazard Zonation of Iran Using Self-organized Artificial Intelligence Model
The Iranian plateau has been known as one of the most seismically active regions of the world, and it frequently suffers destructive and catastrophic earthquakes that cause heavy loss of human life and widespread damage. Earthquakes are regularly felt on all sides of the region. Prediction of the occurrence location of the future earthquakes along with determining the probability percentage can...
متن کاملClassification of Streaming Fuzzy DEA Using Self-Organizing Map
The classification of fuzzy data is considered as the most challenging areas of data analysis and the complexity of the procedures has been obstacle to the development of new methods for fuzzy data analysis. However, there are significant advances in modeling systems in which fuzzy data are available in the field of mathematical programming. In order to exploit the results of the researches on ...
متن کاملNGTSOM: A Novel Data Clustering Algorithm Based on Game Theoretic and Self- Organizing Map
Identifying clusters is an important aspect of data analysis. This paper proposes a noveldata clustering algorithm to increase the clustering accuracy. A novel game theoretic self-organizingmap (NGTSOM ) and neural gas (NG) are used in combination with Competitive Hebbian Learning(CHL) to improve the quality of the map and provide a better vector quantization (VQ) for clusteringdata. Different ...
متن کاملApplication of Self Organized Maps and Curvilinear Component Analysis to the Discrimination of the Vesuvius Seismic Signals
This paper reports on the unsupervised analysis of seismic signals recorded by four stations situated on the Vesuvius area in Naples, Italy. The dataset under examination is composed of earthquakes and false events like thunders, quarry blasts and man-made undersea explosions. The goal is to use these specific data for comparing the performance of three projection methods that are well known to...
متن کاملLandforms identification using neural network-self organizing map and SRTM data
During an 11 days mission in February 2000 the Shuttle Radar Topography Mission (SRTM) collected data over 80% of the Earth's land surface, for all areas between 60 degrees N and 56 degrees S latitude. Since SRTM data became available, many studies utilized them for application in topography and morphometric landscape analysis. Exploiting SRTM data for recognition and extraction of topographic ...
متن کامل