text similarity

نتایج جستجو برای: text similarity

تعداد نتایج: 268086 فیلتر نتایج به سال:

Monolingual Distributional Similarity for Text-to-Text Generation

2012

Juri Ganitkevitch Benjamin Van Durme Chris Callison-Burch

Previous work on paraphrase extraction and application has relied on either parallel datasets, or on distributional similarity metrics over large text corpora. Our approach combines these two orthogonal sources of information and directly integrates them into our paraphrasing system’s log-linear model. We compare different distributional similarity feature-sets and show significant improvements...

متن کامل

Text-to-Text Semantic Similarity for Automatic Short Answer Grading

2009

Michael Mohler Rada Mihalcea

In this paper, we explore unsupervised techniques for the task of automatic short answer grading. We compare a number of knowledge-based and corpus-based measures of text similarity, evaluate the effect of domain and size on the corpus-based measures, and also introduce a novel technique to improve the performance of the system by integrating automatic feedback from the student answers. Overall...

متن کامل

Identifying Metaphoric Antonyms in a Corpus Analysis of Finance Articles

Journal: :CoRR 2011

Aaron Gerow Mark T. Keane

Using a corpus of 17,000+ financial news reports (involving over 10M words), we perform an analysis of the argument-distributions of the UP and DOWN-verbs used to describe movements of indices, stocks and shares. In Study 1 people identified antonyms of these verb sets in a free-generation task and a match-theopposite task and the most commonly identified antonyms were compiled. In Study 2, we ...

متن کامل

Effective Early Termination Techniques for Text Similarity Join Operator

2005

Selma Ayse Özalp Özgür Ulusoy

Text similarity join operator joins two relations if their join attributes are textually similar to each other, and it has a variety of application domains including integration and querying of data from heterogeneous resources; cleansing of data; and mining of data. Although, the text similarity join operator is widely used, its processing is expensive due to the huge number of similarity comp...

متن کامل

Efficient Hybrid Semantic Text Similarity using Wordnet and a Corpus

Journal: :International Journal of Advanced Computer Science and Applications 2016

متن کامل

Legal case document similarity: You need both network and text

Journal: :Information Processing and Management 2022

Estimating the similarity between two legal case documents is an important and challenging problem, having various downstream applications such as prior-case retrieval citation recommendation. There are broad approaches for task — network-based text-based. Prior consider citations only to prior-cases (also called precedents) (PCNet). This approach misses signals inherent in Statutes (written la...

متن کامل

Measuring Patent Similarity Based on Text Mining and Image Recognition

Journal: :Systems 2023

Patent application is one of the important ways to protect innovation achievements that have great commercial value for enterprises; it initial step enterprises set business development track, as well a powerful means their core competitiveness. The emergence large amount patent data makes effective detection difficult, and infringement cases occur frequently. Manual measurement in slow, costly...

متن کامل

Secure Computation Protocol of Text Similarity against Malicious Attacks for Text Classification in Deep-Learning Technology

Journal: :Electronics 2023

With the development of deep learning, demand for similarity matching between texts in text classification is becoming increasingly high. How to match quickly under premise keeping private information secure has become a research hotspot. However, most existing protocols currently have full set limitations, and applicability these methods limited when data size large scattered. Therefore, this ...

متن کامل

Text Reuse Detection using a Composition of Text Similarity Measures

2012

Daniel Bär Torsten Zesch Iryna Gurevych

Detecting text reuse is a fundamental requirement for a variety of tasks and applications, ranging from journalistic text reuse to plagiarism detection. Text reuse is traditionally detected by computing similarity between a source text and a possibly reused text. However, existing text similarity measures exhibit a major limitation: They compute similarity only on features which can be derived ...

متن کامل

Discovering Light Verb Constructions and their Translations from Parallel Corpora without Word Alignment

2017

Natalie Vargas Carlos Ramisch Helena de Medeiros Caseli

We propose a method for joint unsupervised discovery of multiword expressions (MWEs) and their translations from parallel corpora. First, we apply independent monolingual MWE extraction in source and target languages simultaneously. Then, we calculate translation probability, association score and distributional similarity of co-occurring pairs. Finally, we rank all translations of a given MWE ...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید