نتایج جستجو برای: jaccards similarity coefficient

تعداد نتایج: 273628  

2010
Jun'ichi Kazama Stijn De Saeger Kow Kuroda Masaki Murata Kentaro Torisawa

Existing word similarity measures are not robust to data sparseness since they rely only on the point estimation of words’ context profiles obtained from a limited amount of data. This paper proposes a Bayesian method for robust distributional word similarities. The method uses a distribution of context profiles obtained by Bayesian estimation and takes the expectation of a base similarity meas...

Journal: :CoRR 2017
Marzieh Saeidi Alessandro Venerandi Licia Capra Sebastian Riedel

In this paper, we investigate whether text from a Community Question Answering (QA) platform can be used to predict and describe real-world attributes. We experiment with predicting a wide range of 62 demographic attributes for neighbourhoods of London. We use the text from QA platform of Yahoo! Answers and compare our results to the ones obtained from Twitter microblogs. Outcomes show that the...

Journal: :Computer and Information Science 2010
Nurhilyana Anuar Abu Bakar Md Sultan

Dice Coefficient is the techniques to find similarity of an object and widely used in digital library, sciences and other fields. Thus, this project is the first attempts to employed Dice Coefficient for selecting paper in conference management system. An experimental result with limited test cases indicates Dice Coefficient is potentially to be used in the broad spectrum of respective applicat...

2008
Sung-Hyuk Cha

Abstract: Distance or similarity measures are of fundamental importance to pattern classification, clustering, and information retrieval problems. Various distance/similarity measures that are applicable to compare two nominal type histograms are reviewed and categorized in both syntactic and semantic relationships. A correlation coefficient and a hierarchical clustering technique are adopted t...

2010
Niraj Aswani Robert J. Gaizauskas

In this paper, we present an approach to measure the transliteration similarity of English-Hindi word pairs. Our approach has two components. First we propose a bi-directional mapping between one or more characters in the Devanagari script and one or more characters in the Roman script (pronounced as in English). This allows a given Hindi word written in Devanagari to be transliterated into the...

2007

Distance or similarity measures are essential to solve many pattern recognition problems such as classification, clustering, and retrieval problems. Various distance/similarity measures that are applicable to compare two probability density functions, pdf in short, are reviewed and categorized in both syntactic and semantic relationships. A correlation coefficient and a hierarchical clustering ...

2012
Safaa I. Hajeer

Document retrieval is the process of matching of some sated user query against a set of free-text records (documents), its one major technique for organizing and managing information. This project was concerned with studying which of the different statistical measures in IR have the most effectiveness on document retrieval using a unified set of documents. The results show that the Cosine Simil...

2014
Zdeněk Šulc

The paper deals with selected similarity measures which can be used for hierarchical clustering of nominal variables. These variables are commonly used in questionnaire surveys. Cluster analysis can be applied in case a reduction of a dataset size is welcomed. In this paper, there are examined several similarity measures for nominal variable clustering, which have been introduced in recent year...

2011
T. Rekha Kottackal Poulose Martin V. B. Sreekumar Joseph Madassery

Random amplified polymorphic DNA fingerprinting was performed to assess the genetic diversity among rarely cultivated traditional indica rice (Oryza sativa L.) varieties collected from a tribal hamlet of Kerala State, India. A total of 664 DNA bands amplified by 15 primers exhibited 72.9% polymorphism (an average of 32.3 polymorphic bands per primer). The varieties Jeerakasala and Kalladiyaran ...

2012
D. Tom-Dery

Illegal small-scale gold mining brings several benefits to developing countries like Ghana, manifested mainly as employment and revenue but simultaneously impacts negatively on the immediate environment. The study tested the hypothesis that density and diversity of key native tree and shrub species differ in the mined and unmined areas of Nangodi in the Talensi-Nabdam District of the Upper East...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید