Annotating Web Image using Parallel Graph Bipartition and Word Clustering
نویسنده
چکیده
A novel web image annotation method by candidate annotations clustering and parallel graph bipartition is proposed in this paper. Firstly, surrounding texts and other textual information in the hosting pages are extracted as the candidate annotations. For Web images, the candidate annotation sets of which are usually fairly large. Therefore, we cluster candidate annotations to reduce computation complexity. Next, centroids of clustering results and the distance between them are used to construct a graph. Then a parallel 0.87856 heuristics MAX-CUT algorithm is applied to partition the graph. Finally, one part of the graph partition results is selected as final annotation results. Experimental results show that our method works more effectively than existing methods.
منابع مشابه
Finding Community Base on Web Graph Clustering
Search Pointers organize the main part of the application on the Internet. However, because of Information management hardware, high volume of data and word similarities in different fields the most answers to the user s’ questions aren`t correct. So the web graph clustering and cluster placement in corresponding answers helps user to achieve his or her intended results. Community (web communit...
متن کاملContents Special Issue : Recent Advances in Computer Science and Engineering
SPECIAL ISSUE PAPERS An Effective Adaptive Multi-objective Particle Swarm for Multimodal Consreained Function Optimization Yongquan Zhou and Shengyu Pei Research on Formal Verification Technique for Aircraft Safety-Critical Software Yongfeng Yin, Bin Liu, and Duo Su Determination of Optimal SVM Parameters by Using GA/PSO Yuan Ren and Guangchen Bai Water Resources Allocation Effect Evaluation Ba...
متن کاملHigh Performance Implementation of Fuzzy C-Means and Watershed Algorithms for MRI Segmentation
Image segmentation is one of the most common steps in digital image processing. The area many image segmentation algorithms (e.g., thresholding, edge detection, and region growing) employed for classifying a digital image into different segments. In this connection, finding a suitable algorithm for medical image segmentation is a challenging task due to mainly the noise, low contrast, and steep...
متن کاملHigh Performance Implementation of Fuzzy C-Means and Watershed Algorithms for MRI Segmentation
Image segmentation is one of the most common steps in digital image processing. The area many image segmentation algorithms (e.g., thresholding, edge detection, and region growing) employed for classifying a digital image into different segments. In this connection, finding a suitable algorithm for medical image segmentation is a challenging task due to mainly the noise, low contrast, and steep...
متن کاملGraph-based Word Clustering using a Web Search Engine
Word clustering is important for automatic thesaurus construction, text classification, and word sense disambiguation. Recently, several studies have reported using the web as a corpus. This paper proposes an unsupervised algorithm for word clustering based on a word similarity measure by web counts. Each pair of words is queried to a search engine, which produces a co-occurrence matrix. By cal...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- JCP
دوره 5 شماره
صفحات -
تاریخ انتشار 2010