Band depth based initialization of K-means for functional data clustering
نویسندگان
چکیده
Abstract The k -Means algorithm is one of the most popular choices for clustering data but well-known to be sensitive initialization process. There a substantial number methods that aim at finding optimal initial seeds -Means, though none them universally valid. This paper presents an extension longitudinal such methods, BRIk algorithm, relies on set centroids derived from bootstrap replicates and use versatile Modified Band Depth. In our approach we improve method by adding step where fit appropriate B-splines observations resampling process allows computational feasibility handling issues as noise or missing data. We have two techniques providing suitable seeds, each stressing respectively multivariate functional nature Our results with simulated real sets indicate F unctional Data A pproach BRIK (FABRIk) D ata E xtension (FDEBRIk) are more effective than previous proposals initialize in terms recovery.
منابع مشابه
Persistent K-Means: Stable Data Clustering Algorithm Based on K-Means Algorithm
Identifying clusters or clustering is an important aspect of data analysis. It is the task of grouping a set of objects in such a way those objects in the same group/cluster are more similar in some sense or another. It is a main task of exploratory data mining, and a common technique for statistical data analysis This paper proposed an improved version of K-Means algorithm, namely Persistent K...
متن کاملCanonical k-Means Clustering for Functional Data
Thaddeus Tarpey and Eva Petkova 1 Department Mathematics and Statistics, Wright State University, Dayton, Ohio 45435, [email protected]. 2 Department of Child and Adolescent Psychiatry, New York University, New York, NY 10016-6023 Abstract Cluster analysis is a powerful tool for discovering sources of heterogeneity in data. However, clinically interesting sources of heterogeneity, such...
متن کاملpersistent k-means: stable data clustering algorithm based on k-means algorithm
identifying clusters or clustering is an important aspect of data analysis. it is the task of grouping a set of objects in such a way those objects in the same group/cluster are more similar in some sense or another. it is a main task of exploratory data mining, and a common technique for statistical data analysis this paper proposed an improved version of k-means algorithm, namely persistent k...
متن کاملEfficient and Fast Initialization Algorithm for K- means Clustering
The famous K-means clustering algorithm is sensitive to the selection of the initial centroids and may converge to a local minimum of the criterion function value. A new algorithm for initialization of the K-means clustering algorithm is presented. The proposed initial starting centroids procedure allows the K-means algorithm to converge to a “better” local minimum. Our algorithm shows that ref...
متن کاملCluster center initialization algorithm for K-means clustering
Performance of iterative clustering algorithms which converges to numerous local minima depend highly on initial cluster centers. Generally initial cluster centers are selected randomly. In this paper we propose an algorithm to compute initial cluster centers for K-means clustering. This algorithm is based on two observations that some of the patterns are very similar to each other and that is ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Advances in data analysis and classification
سال: 2022
ISSN: ['1862-5355', '1862-5347']
DOI: https://doi.org/10.1007/s11634-022-00510-w