Geodesic Clustering
ثبت نشده
چکیده
We propose a new class of distances for the purpose of data clustering, called the geodesic distance, and introduce a geodesic extension of K-medoids algorithm. We analyze the theoretical properties of the geodesic distance within a clustering framework and prove that the geodesic K-medoids algorithm converges to the correct clustering assignment in the asymptotic regime, even in the presence of outliers. We also present experimental evidence on the abilities of geodesic Kmedoids to handle clustering problems involving outliers, nonlinearly separable clusters, and varying densities in the clusters. The results are compared to a few hierarchical and spectral clustering algorithms on several clustering problems.
منابع مشابه
A Self-adaptive Spectral Clustering Based on Geodesic Distance and Shared Nearest Neighbors
Spectral clustering is a method of subspace clustering which is suitable for the data of any shape and converges to global optimal solution. By combining concepts of shared nearest neighbors and geodesic distance with spectral clustering, a self-adaptive spectral clustering based on geodesic distance and shared nearest neighbors was proposed. Experiments show that the improved spectral clusteri...
متن کاملColor Object Recognition Using General Fuzzy Min Max Neural Network
A hybrid approach based on Fuzzy Logic and neural networks with the combination of the classic Hu & Zernike moments joined with Geodesic descriptors is used to keep the maximum amount of information that are given by the color of the image. These moments are calculated for each color level and geodesic descriptors are applied directly to binary images to get information about the general shape ...
متن کاملENTROPY OF GEODESIC FLOWS ON SUBSPACES OF HECKE SURFACE WITH ARITHMETIC CODE
There are dierent ways to code the geodesic flows on surfaces with negative curvature. Such code spaces give a useful tool to verify the dynamical properties of geodesic flows. Here we consider special subspaces of geodesic flows on Hecke surface whose arithmetic codings varies on a set with innite alphabet. Then we will compare the topological complexity of them by computing their topological ...
متن کاملA Local Smoothing and Geodesic Distance Based Clustering Algorithm for High Dimensional Noisy Data; Utilizing Embedded Geometric Structures
A method that utilizes embedded geometric structure is proposed, to enhance clustering. The geometric structure is identified via a combination of a local smoothing method – local linear projection (LLP) – and a computational method for the geodesic distance. It is found that comparing to existing algorithms, it is more efficient to deal with noisy and “structured” data. Simulations for both sy...
متن کاملStructure and time evolution of an Internet dating community
We present statistics for the structure and time evolution of a network constructed from user activity in an Internet community. The vastness and precise time resolution of an Internet community offers unique possibilities to monitor social network formation and dynamics. Time evolution of well-known quantities, such as clustering, mixing (degree–degree correlations), average geodesic length, d...
متن کامل