Analyzing the Role of Dimension Arrangement for Data Visualization in Radviz
نویسندگان
چکیده
The Radial Coordinate Visualization (Radviz) technique has been widely used to effectively evaluate the existence of patterns in highly dimensional data sets. A crucial aspect of this technique lies in the arrangement of the dimensions, which determines the quality of the posterior visualization. Dimension arrangement (DA) has been shown to be an NP-problem and different heuristics have been proposed to solve it using optimization techniques. However, very little work has focused on understanding the relation between the arrangement of the dimensions and the quality of the visualization. In this paper we first present two variations of the DA problem: (1) a Radviz independent approach and (2) a Radviz dependent approach. We then describe the use of the Davies-Bouldin index to automatically evaluate the quality of a visualization i.e., its visual usefulness. Our empirical evaluation is extensive and uses both real and synthetic data sets in order to evaluate our proposed methods and to fully understand the impact that parameters such as number of samples, dimensions, or cluster separability have in the relation between the optimization algorithm and the visualization tool.
منابع مشابه
Multidimensional clusters in RadViz
The paper reviews those properties of RadViz visualization method [2] mapping data from n dimensional space into the plane which are important for identification of clusters in the multidimensional original data. It uses 2 characteristic examples of datasets which clearly point to a certain drawback of the original RadViz mapping. The identified problem can be resolved using 2 minor modificatio...
متن کاملProperties of normalized radial visualizations
This paper defines and establishes properties of a class of normalized radial visualizations (NRVs) that includes the RadViz mapping onto the two-dimensional unit disk. A NRV normalizes data to map highdimensional records into lower dimensional space, where records’ images are convex combinations of points called dimensional anchors. NRVs are radial visualizations because dimensional anchors ar...
متن کاملVisualization-based cancer microarray data classification analysis
MOTIVATION Methods for analyzing cancer microarray data often face two distinct challenges: the models they infer need to perform well when classifying new tissue samples while at the same time providing an insight into the patterns and gene interactions hidden in the data. State-of-the-art supervised data mining methods often cover well only one of these aspects, motivating the development of ...
متن کاملVisualization-Aided Classification Ensembles Discriminate Lung Adenocarcinoma and Squamous Cell Carcinoma Samples Using Their Gene Expression Profiles
INTRODUCTION The widespread application of microarray experiments to cancer research is astounding including lung cancer, one of the most common fatal human tumors. Among non-small cell lung carcinoma (NSCLC), there are two major histological types of NSCLC, adenocarcinoma (AC) and squamous cell carcinoma (SCC). RESULTS In this paper, we proposed to integrate a visualization method called Rad...
متن کاملPoint Sensitivity for Radial Visualization under Dimensional Anchor Motion
This paper extends prior work with normalized radial visualizations (NRVs) that includes the RadViz mapping onto the two-dimensional unit disk. Here we examine point sensitivity under varying assumptions about dimensional anchor motion. First, we describe the role of the barycenter of the dimensional anchors as the position where records map to under a NRV when all of their dimensional values a...
متن کامل