means cluster

نتایج جستجو برای: means cluster

تعداد نتایج: 537032 فیلتر نتایج به سال:

Streaming k-means approximation

2009

Nir Ailon Ragesh Jaiswal Claire Monteleoni

We provide a clustering algorithm that approximately optimizes the k-means objective, in the one-pass streaming setting. We make no assumptions about the data, and our algorithm is very light-weight in terms of memory, and computation. This setting is applicable to unsupervised learning on massive data sets, or resource-constrained devices. The two main ingredients of our theoretical work are: ...

متن کامل

Notes on using Determinantal Point Processes for Clustering with Applications to Text Clustering

Journal: :CoRR 2014

Apoorv Agarwal Anna Choromanska Krzysztof Choromanski

In this paper, we compare three initialization schemes for the KMEANS clustering algorithm: 1) random initialization (KMEANSRAND), 2) KMEANS++, and 3) KMEANSD++. Both KMEANSRAND and KMEANS++ have a major that the value of k needs to be set by the user of the algorithms. (Kang 2013) recently proposed a novel use of determinantal point processes for sampling the initial centroids for the KMEANS a...

متن کامل

On Clustering Histograms with k-Means by Using Mixed α-Divergences

Journal: :Entropy 2014

Frank Nielsen Richard Nock Shun-ichi Amari

Clustering sets of histograms has become popular thanks to the success of the generic method of bag-of-X used in text categorization and in visual categorization applications. In this paper, we investigate the use of a parametric family of distortion measures, called the α-divergences, for clustering histograms. Since it usually makes sense to deal with symmetric divergences in information retr...

متن کامل

Performance Enhancement of K-Means Clustering Algorithms for High Dimensional Data sets

2014

Amita Verma

Data mining has been defined as "The nontrivial extraction of implicit, previously unknown, and potentially useful information from data". Clustering is the automated search for group of related observations in a data set. The K-Means method is one of the most commonly used clustering techniques for a variety of applications. This paper proposes a method for making the K-Means algorithm more ef...

متن کامل

Cost-Effective Clustering through Active Feature-value Acquisition

2008

Many datasets include feature values that are missing but may be acquired at a cost. In this paper, we consider the clustering task for such datasets, and address the problem of acquiring missing feature values that improve clustering quality in a cost-effective manner. Since acquiring all missing information may be unnecessarily expensive, we propose a framework for iteratively selecting featu...

متن کامل

On the Consistency of k-means++ algorithm

Journal: :CoRR 2017

Mieczyslaw A. Klopotek

We prove in this paper that the expected value of the objective function of the k-means++ algorithm for samples converges to population expected value. As k-means++, for samples, provides with constant factor approximation for k-means objectives, such an approximation can be achieved for the population with increase of the sample size. This result is of potential practical relevance when one is...

متن کامل

The k-means-u* algorithm: non-local jumps and greedy retries improve k-means++ clustering

Journal: :CoRR 2017

Bernd Fritzke

We present a new clustering algorithm called k-means-u* which in many cases is able to significantly improve the clusterings found by k-means++, the current de-facto standard for clustering in Euclidean spaces. First we introduce the k-means-u algorithm which starts from a result of k-means++ and attempts to improve it with a sequence of non-local “jumps” alternated by runs of standard k-means....

متن کامل

Comparison of K-Means and Fuzzy C-Means Algorithms on Different Cluster Structures

Journal: :Journal of Agricultural Informatics 2015

متن کامل

K-Means Cluster Analysis for Image Segmentation

2014

S. M. Aqil Burney Humera Tariq

Does K-Means reasonably divides the data into k groups is an important question that arises when one works on Image Segmentation? Which color space one should choose and how to ascertain that the k we determine is valid? The purpose of this study was to explore the answers to aforementioned questions. We perform K-Means on a number of 2-cluster, 3cluster and k-cluster color images (k>3) in RGB ...

متن کامل

Multimorbidity patterns with K-means nonhierarchical cluster analysis

Journal: :BMC Family Practice 2018

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید