Maps for the Visualization of high-dimensional Data Spaces
نویسنده
چکیده
-The U-Matrix is a canonical tool for the display of distance structures in data space using emergent SOM (ESOM). The U-Matrix defined originally for planar map spaces is extended in this work to toroid neuron spaces. Embedding the neuron space in a finite but borderless space, such as a torus, avoids border effects of planar spaces. A planar display of a toroid map space disrupts, however, coherent U-Matrix structures. Tiling multiple instances of the U-Matrix solves this problem at the cost of multiple images of data points. The P-Matrix, as defined here, is a display of the density relationships in the data space using Pareto Density Estimation. While the P-Matrix is useful for clustering, it can also be used for a non-ambiguous display of a non planar neuron space. Centering the display for high density regions and removing ambiguous images of data points leads to U-Maps and P-Maps. U-Maps depict the distance structure of a data space as a borderless three dimensional landscape whose floor space is ordered according to the topology preserving features of ESOM. P-Maps display the density structures. Both maps are specially suited for data mining and knowledge discovery.
منابع مشابه
Simulation of Smoke Emission from Fires in High-Rise Buildings Using the 3D Model Generated from 2-Dimensional Cadastral Data
Having a 3-Dimensional model of high-rise buildings can be used in disaster management such as fire cases to reduce casualties. The fundamental dilemma in 3D building modeling is the unavailability of suitable data sources. However, available cadastral 2D maps could be used as low-cost and attainable resources for 3D building modeling. Smoke will be a great threat to people's health during a f...
متن کاملNon-linear ergodic theorems in complete non-positive curvature metric spaces
Hadamard (or complete $CAT(0)$) spaces are complete, non-positive curvature, metric spaces. Here, we prove a nonlinear ergodic theorem for continuous non-expansive semigroup in these spaces as well as a strong convergence theorem for the commutative case. Our results extend the standard non-linear ergodic theorems for non-expansive maps on real Hilbert spaces, to non-expansive maps on Ha...
متن کاملVisualizing Similarities in High Dimensional Input Spaces with a Growing and Splitting Neural Network
The recognition of similarities in high dimensional input spaces and their visualization in low dimensional output spaces is a highly demanding application area for unsupervised artificial neural networks. Some of the problems inherent to structuring high dimensional input data may be shown with an application such as text document classification. One of the most prominent ways to represent tex...
متن کاملU-maps: topograpic visualization techniques for projections of high dimensional data
The visualization of distance structures in high dimensional data as topographic maps (U-matrix) is a standard method for Emergent Self Organizing Maps (ESOM). This work describes the extension of this visualization to other projections like principal component analysis (PCA), independent component analysis (ICA), multidimensional scaling (MDS), Sammon’s mapping, or Isomap. Each of the methods ...
متن کاملVisualization of high-dimensional data with relational perspective map
Received: 9 August 2003 Revised: 14 September 2003 Accepted: 15 September 2003 Abstract This paper introduces a method called relational perspective map (RPM) to visualize distance information in high-dimensional spaces. Like conventional multidimensional scaling, the RPM algorithm aims to produce proximity preserving 2-dimensional (2-D) maps. The main idea of the RPM algorithm is to simulate a...
متن کامل