similarity measurement web mining

نتایج جستجو برای: similarity measurement web mining

تعداد نتایج: 829535 فیلتر نتایج به سال:

NaCTeM at SemEval-2016 Task 1: Inferring sentence-level semantic similarity from an ensemble of complementary lexical and sentence-level features

2016

Piotr Przybyla Nhung T. H. Nguyen Matthew Shardlow Georgios Kontonatsios Sophia Ananiadou

We present a description of the system submitted to the Semantic Textual Similarity (STS) shared task at SemEval 2016. The task is to assess the degree to which two sentences carry the same meaning. We have designed two different methods to automatically compute a similarity score between sentences. The first method combines a variety of semantic similarity measures as features in a machine lea...

متن کامل

A General Model for Similarity Measurement between Objects

2015

Manh Hung Nguyen Thi Hoi Nguyen

The problem to detect the similarity or the difference between objects are faced regularly in several domains of applications such as e-commerce, social network, expert system, data mining, decision support system, etc. This paper introduces a general model for measuring the similarity between objects based on their attributes. In this model, the similarity on each attribute is defined with dif...

متن کامل

Automatic discovery of similarity relationships through Web mining

Journal: :Decision Support Systems 2003

Dmitri Roussinov J. Leon Zhao

This work demonstrates how the World Wide Web can be mined in a fully automated manner for discovering the semantic similarity relationships among the concepts surfaced during an electronic brainstorming session, and thus improving the accuracy of automated clustering meeting messages. Our novel Context Sensitive Similarity Discovery (CSSD) method takes advantage of the meeting context when sel...

متن کامل

Ranking Hyperlinks Approach for Focused Web Crawler

2014

Aye Nandar Hlaing

The World Wide Web is growing rapidly and many search engines do not cover all the visible pages. Therefore, a more effective crawling method is required to collect more accurate data. In this paper, we introduce an effective focused web crawler containing smart methods. In text analysis, similarity measurement applies to different parts of the Web pages including title, body, anchor text and U...

متن کامل

Improving Definite Anaphora Resolution by Effective Weight Learning and Web-Based Knowledge Acquisition

Journal: :IEICE Transactions 2011

Dian-Song Wu Tyne Liang

In this paper, effective Chinese definite anaphora resolution is addressed by using feature weight learning and Web-based knowledge acquisition. The presented salience measurement is based on entropybased weighting on selecting antecedent candidates. The knowledge acquisition model is aimed to extract more semantic features, such as gender, number, and semantic compatibility by employing multip...

متن کامل

Web Personalization using Web Mining

Journal: :International Journal for Research in Applied Science and Engineering Technology 2018

متن کامل

Cleaning Web Pages for Effective Web Content Mining

2006

Jing Li Christie I. Ezeife

Classifying and mining noise-free web pages will improve on accuracy of search results as well as search speed, and may benefit webpage organization applications (e.g., keyword-based search engines and taxonomic web page categorization applications). Noise on web pages are irrelevant to the main content on the web pages being mined, and include advertisements, navigation bar, and copyright noti...

متن کامل

Computing Term Similarity by Knowledge from Big Data

2013

Peipei Li Haixun Wang Kenny Q. Zhu Zhongyuan Wang Xindong Wu

Computing semantic similarity between two terms is essential for a variety of text analytics and understanding applications. However, existing approaches are more suitable for semantic similarity between words rather than the more general multi-word expressions (MWEs), and they do not scale very well. Therefore, we propose a lightweight and effective approach for semantic similarity using a lar...

متن کامل

Improvement Tfidf for News Document Using Efficient Similarity

2012

Abdolkarim Elahi

This study proposed a new method about clustering in documents. Clustering is a very powerful data mining technique for topic discovery from documents. In document clustering, it must be more similarity between intra-document and less similarity between intra-document of two clusters. The cosine function measures the similarity of two documents. When the clusters are not well separated, partiti...

متن کامل

WebSim: A Pathway to Unveiling Term Relationships using a Web Search Technology

2006

Seokkyung Chung Jongeun Jun Dennis McLeod

We present WebSim (Web-based Similarity metric), whose feature extraction and similarity model is based on a conventional Web search engine. By utilizing the search engine, we can obtain the freshest content for each term that represents the up-to-date knowledge on the term. In comparison with previous text mining approaches that use the certain amount of crawled Web documents as corpus, our me...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید