Introducing a Hybrid Swarm Intelligence Based Technique for Document Clustering
نویسندگان
چکیده
Swarm intelligence (SI) is widely used in many complex optimization problems. It is a collective behavior of social systems such as honey bees (bee algorithm, BA) and birds (particle swarm optimization, PSO). This paper presents a detailed overview of Particle Swarm Optimization (PSO), its variants and hybridization of PSO with Bee Algorithm (BA). This paper also surveys various SI techniques presented by the researchers. The objective is to utilize the capability of this technique for document clustering which will be utilized to solve the issues of clustering by applying modifications to the Bee Algorithm and Particle
منابع مشابه
Text Clustering Quality Improvement using a hybrid Social spider optimization
Text document clustering is one of the most widely studied data mining problems. It organizes text documents into groups such that each group has similar text documents. While grouping text documents, several issues have been observed. Accuracy and Efficiency are the main issues in text document clustering. Recently, as clustering problem can be mapped to optimization problem, evolutionary opti...
متن کاملStock Price Prediction using Machine Learning and Swarm Intelligence
Background and Objectives: Stock price prediction has become one of the interesting and also challenging topics for researchers in the past few years. Due to the non-linear nature of the time-series data of the stock prices, mathematical modeling approaches usually fail to yield acceptable results. Therefore, machine learning methods can be a promising solution to this problem. Methods: In this...
متن کاملComputational Intelligence Methods for Clustering of Sense Tagged Nepali Documents
This paper presents a method using hybridization of self organizing map (SOM ), particle swarm optimization(PSO) and k-means clustering algorithm for document clustering. Document representation is an important step for clustering purposes. The common way of represent a text is bag of words approach. This approach is simple but has two drawbacks viz. synonymy and polysemy which arise because of...
متن کاملDiscrete PSO with GA Operators for Document Clustering
The paper presents Discrete PSO algorithm for document clustering problems. This algorithm is hybrid of PSO with GA operators. The proposed system is based on population-based heuristic search technique, which can be used to solve combinatorial optimization problems, modeled on the concepts of cultural and social rules derived from the analysis of the swarm intelligence (PSO) with GA operators ...
متن کاملA Joint Semantic Vector Representation Model for Text Clustering and Classification
Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...
متن کامل