Computing semantic similarity from bilingual dictionaries

نویسندگان

  • Steffen Eger
  • Ineta Sejane
چکیده

In this paper, we address the task of calculating monoand bilingual semantic similarity. We introduce a method that, in order to arrive at a measure of semantic relatedness, exploits the information implicitly contained in bilingual dictionaries. Through experiments we show that our method performs well, with a performance comparable to approaches based on hierarchical knowledge bases and corpus statistics. The advantage of our approach is that it solely relies on easily available bilingual dictionaries and that it is capable of computing monoand bilingual semantic relatedness at the same time.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On multiword lexical units and their role in maritime dictionaries

Multi-word lexical units are a typical feature of specialized dictionaries, in particular monolingual and bilingual maritime dictionaries. The paper studies the concept of the multi-word lexical unit and considers the similarities and differences of their selection and presentation in monolingual and bilingual maritime dictionaries. The work analyses such issues as the classification of multi-w...

متن کامل

Measuring the Semantic Distance between Languages from a Statistical Analysis of Bilingual Dictionaries

A bilingual dictionary is a valuable linguistic resource which records, among other things, the di erences in the segmentation of semantic space by the two languages and hence the di culty in producing faithful translations between the two languages. Statistical analysis of nearly a hundred dictionaries has allowed us to determine how best to measure the semantic distance between languages from...

متن کامل

Creating Reverse Bilingual Dictionaries

Bilingual dictionaries are expensive resources and not many are available when one of the languages is resource-poor. In this paper, we propose algorithms for creation of new reverse bilingual dictionaries from existing bilingual dictionaries in which English is one of the two languages. Our algorithms exploit the similarity between word-concept pairs using the English Wordnet to produce revers...

متن کامل

Bilingual emb e ddings with random walks over multilingual wordnets

Bilingual word embeddings represent words of two languages in the same space, and allow to transfer knowledge from one language to the other without machine translation. The main approach is to train monolingual embeddings first and then map them using bilingual dictionaries. In this work, we present a novel method to learn bilingual embeddings based on multilingual knowledge bases (KB) such as...

متن کامل

Improving Calculation of Contextual Similarity for Constructing a Bilingual Dictionary via a Third Language

A novel method is proposed for measuring contextual similarity by “weighted overlapping ratio (WOR)” to construct a bilingual dictionary of a new language pair from two bilingual dictionaries sharing one language. The WOR alleviates the effect of a noisy seed dictionary resulting from merger of two bilingual dictionaries via a third language. Combined use of two word-association measures for ex...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010