XGvis: Interactive Data Visualization with Multidimensional Scaling
نویسندگان
چکیده
We discuss interactive techniques for multidimensional scaling (MDS) and a system, named \XGvis", that implements these techniques. MDS is a method for visualizing proximity data, that is, data where objects are characterized by dissimilarity values for all pairs of objects. MDS constructs maps of these objects in IR k by interpreting the dissimilarities as distances. MDS in its conventional batch implementations is prone to uncertainties with regard to 1) local minima in the underlying optimization, 2) sensitivity to the choice of the optimization criterion, 3) artifacts in point conngurations, and 4) local inadequacy of the point conngurations. These uncertainties will be addressed by the following interactive techniques: 1) algorithm animation, random restarts, and manual editing of conngurations, 2) interactive control over parameters that determine the criterion and its minimization, 3) diagnostics for pinning down artifactual point conngurations, and 4) restricting MDS to subsets of objects and subsets of pairs of objects. MDS was originally developed for the social sciences, but it is now also used for laying out graphs. Graph layout is usually done in 2D, but we allow layouts in arbitrary dimensions. We permit missing values, which can be used to implement multidimensional unfolding. We show applications to the mapping of computer usage data, to the dimension reduction of marketing segmentation data, to the layout of mathematical graphs and social network graphs, and nally to the reconstruction of molecules in nano technology. 1 XGvis uses the XGobi system for visualizing point conngurations. The XGvis system, which implements these techniques, is freely available with the XGobi distribution from
منابع مشابه
Visualization Methodology for Multidimensional Scaling
We describe methodology for multidimensional scaling based on interactive data visualization. This methodology was enabled by software in which MDS is integrated in a multivariate data visualization system. The software, called “XGvis”, is described in a companion paper (Buja, Swayne, Littman, Dean and Hofmann 2001), that lays out the implemented functionality in some detail; in the current pap...
متن کاملXgobi: Interactive Dynamic Data Visualization in the X Window System
XGobi is a data visualization system with state-of-the-art interactive and dynamic methods for the manipulation of views of data. It implements 2-D displays of projections of points and lines in high-dimensional spaces, as well as parallel coordinate displays and textual views thereof. Projection tools include dotplots of single variables, plots of pairs of variables, 3-D data rotations, variou...
متن کاملData Visualization With Multidimensional Scaling
We discuss methodology for multidimensional scaling (MDS) and its implementation in two software systems, GGvis and XGvis. MDS is a visualization technique for proximity data, that is, data in the form of N × N dissimilarity matrices. MDS constructs maps (“configurations,” “embeddings”) in IRk by interpreting the dissimilarities as distances. Two frequent sources of dissimilarities are high-dim...
متن کاملSanger-driven MDSLocalize - a comparative study for genomic data
Multidimensional scaling (MDS) methods are designed to establish a one-to-one correspondence of input-output relationships. While the input may be given as high-dimensional data items or as adjacency matrix characterizing data relations, the output space is usually chosen as low-dimensional Euclidean, ready for visualization. MDSLocalize, an existing method, is reformulated in terms of Sanger’s...
متن کاملProxiViz: an Interactive Visualization Technique to Overcome Multidimensional Scaling Artifacts
Projection algorithms such as multidimensional scaling are often used to visualize high-dimensional data. However, when attempting to interpret the visualization of the resulting 2D projection, users are faced with artifacts. This poster introduces ProxiViz: an interactive technique to provide better insights about the original data-space. Primary results of a controlled experiment show that Pr...
متن کامل