Computing semantic similarity from bilingual dictionaries
نویسندگان
چکیده
In this paper, we address the task of calculating monoand bilingual semantic similarity. We introduce a method that, in order to arrive at a measure of semantic relatedness, exploits the information implicitly contained in bilingual dictionaries. Through experiments we show that our method performs well, with a performance comparable to approaches based on hierarchical knowledge bases and corpus statistics. The advantage of our approach is that it solely relies on easily available bilingual dictionaries and that it is capable of computing monoand bilingual semantic relatedness at the same time.
منابع مشابه
On multiword lexical units and their role in maritime dictionaries
Multi-word lexical units are a typical feature of specialized dictionaries, in particular monolingual and bilingual maritime dictionaries. The paper studies the concept of the multi-word lexical unit and considers the similarities and differences of their selection and presentation in monolingual and bilingual maritime dictionaries. The work analyses such issues as the classification of multi-w...
متن کاملMeasuring the Semantic Distance between Languages from a Statistical Analysis of Bilingual Dictionaries
A bilingual dictionary is a valuable linguistic resource which records, among other things, the di erences in the segmentation of semantic space by the two languages and hence the di culty in producing faithful translations between the two languages. Statistical analysis of nearly a hundred dictionaries has allowed us to determine how best to measure the semantic distance between languages from...
متن کاملCreating Reverse Bilingual Dictionaries
Bilingual dictionaries are expensive resources and not many are available when one of the languages is resource-poor. In this paper, we propose algorithms for creation of new reverse bilingual dictionaries from existing bilingual dictionaries in which English is one of the two languages. Our algorithms exploit the similarity between word-concept pairs using the English Wordnet to produce revers...
متن کاملBilingual emb e ddings with random walks over multilingual wordnets
Bilingual word embeddings represent words of two languages in the same space, and allow to transfer knowledge from one language to the other without machine translation. The main approach is to train monolingual embeddings first and then map them using bilingual dictionaries. In this work, we present a novel method to learn bilingual embeddings based on multilingual knowledge bases (KB) such as...
متن کاملImproving Calculation of Contextual Similarity for Constructing a Bilingual Dictionary via a Third Language
A novel method is proposed for measuring contextual similarity by “weighted overlapping ratio (WOR)” to construct a bilingual dictionary of a new language pair from two bilingual dictionaries sharing one language. The WOR alleviates the effect of a noisy seed dictionary resulting from merger of two bilingual dictionaries via a third language. Combined use of two word-association measures for ex...
متن کامل