نتایج جستجو برای: means cluster

تعداد نتایج: 537032  

2016
Chaoyue Liu Mikhail Belkin

Clustering, in particular k-means clustering, is a central topic in data analysis. Clustering with Bregman divergences is a recently proposed generalization of k-means clustering which has already been widely used in applications. In this paper we analyze theoretical properties of Bregman clustering when the number of the clusters k is large. We establish quantization rates and describe the lim...

2004
Guihong Cao Dawei Song Peter Bruza

One way of representing semantics could be via a high dimensional conceptual space constructed by certain lexical semantic space models. Concepts (words), represented as a vector of other words in the semantic space, can be categorized via clustering techniques into a number of regions reflecting different contexts. The conventional clustering algorithms, e.g., K-means method, however, normally...

Journal: :J. Classification 2010
Mark Ming-Tso Chiang Boris G. Mirkin

The issue of determining “the right number of clusters” in K-Means has attracted considerable interest, especially in the recent years. Cluster overlap appears to be a factor most affecting the clustering results. This paper proposes an experimental setting for comparison of different approaches at data generated from Gaussian clusters with the controlled parameters of betweenand within-cluster...

Journal: :Pattern Recognition Letters 2003
Yiu-ming Cheung

This paper presents a generalized version of the conventional k-means clustering algorithm [Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability, 1, University of California Press, Berkeley, 1967, p. 281]. Not only is this new one applicable to ellipse-shaped data clusters without dead-unit problem, but also performs correct clustering without pre-assigning the exact...

2007
Antoine Naud Shiro Usui

Abstract. An application of cluster analysis to identify topics in a collection of posters abstracts from the Society for Neuroscience (SfN) Annual Meeting in 2006 is presented. The topics were identified by selecting from the abstracts belonging to each cluster the terms with the highest scores using different ranking schemes. The ranking scheme based on logentropy showed better performance in...

Journal: :CoRR 2011
Aaron Gerow Mark T. Keane

Using a corpus of over 17,000 financial news reports (involving over 10M words), we perform an analysis of the argument-distributions of the UPand DOWN-verbs used to describe movements of indices, stocks, and shares. Using measures of the overlap in the argument distributions of these verbs and k-means clustering of their distributions, we advance evidence for the proposal that the metaphors re...

Journal: :Pattern Recognition Letters 1996
Mohd Belal Al-Daoud Stuart A. Roberts

One of the most widely used clustering techniques is the kmeans algorithm. Solutions obtained from this technique are dependent on the initialisation of cluster centres. In this article, two initialisation methods are developed. These methods are particularly suited to problems involving very large data sets. The methods have been applied to di erent data sets and good results are obtained.

Journal: :JCIT 2010
Jun Tang

This paper proposed improved K-means clustering algorithm based on user tag. It first used social annotation data to expand the vector space model of K-means. Then, it applied the links involved in social tagging network to enhance the clustering performance. Experimental result shows that the proposed improved K-means clustering algorithm based on user tag is effective.

Journal: :Remote Sensing 2014
Melanie Becker Joecila Santos da Silva Stéphane Calmant Vivien Robinet Laurent Linguet Frédérique Seyler

In the Congo Basin, the elevated vulnerability of food security and the water supply implies that sustainable development strategies must incorporate the effects of climate change on hydrological regimes. However, the lack of observational hydro-climatic data over the past decades strongly limits the number of studies investigating the effects of climate change in the Congo Basin. We present th...

Journal: :Symmetry 2017
Hai Wang Xiongyou Peng Xue Xiao Yan Liu

A modified method for better superpixel generation based on simple linear iterative clustering (SLIC) is presented and named BSLIC in this paper. By initializing cluster centers in hexagon distribution and performing k-means clustering in a limited region, the generated superpixels are shaped into regular and compact hexagons. The additional cluster centers are initialized as edge pixels to imp...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید