Updating LSP dictionaries with collocational information
نویسنده
چکیده
Despite the big amount of general language dictionaries in electronic form, those coming from specialised areas are still “under construction”. There are two main reasons for this: firstly, the need for these dictionaries was/is less essential than the need for general language dictionaries, since these were/are aiming mainly to specialists, and secondly many specialised areas are changing over time, resulting to dictionaries that need continuing updating. Due to this, techniques that improve the automatic or semiautomatic construction and updating of specialised dictionaries are and will always be welcome. In this work we are concerned with the updating of dictionaries for Languages for Special Purposes (LSPs) with information coming from collocations. The collocations to be used are extracted from LSP corpora of not necessarily big size.
منابع مشابه
Collocational Clashes in the Persian Translations of Tuesdays with Morrie
This study aimed at finding features of collocational deviations in the translations of Tuesdays with Mor- rie. In this direction, categories of collocations and collocational clashes, as well as causes of collocation- al clashes were explored. The present work investigated five Persian translations of the novel. All the books were examined completely and all possible collocational clashes were...
متن کاملExtending WordNet with Fine-Grained Collocational Information via Supervised Distributional Learning
WordNet is probably the best known lexical resource in Natural Language Processing. While it is widely regarded as a high quality repository of concepts and semantic relations, updating and extending it manually is costly. One important type of relation which could potentially add enormous value to WordNet is the inclusion of collocational information, which is paramount in tasks such as Machin...
متن کاملSense Tagging: Don't Look for the Meaning but for the Use
Automatic sense-tagging is one of the next challenges that corpus linguists have to face. So far, results are modest, despite numerous efforts, and sense-tagging appears as a vexing task. Difficulties stem from various sources, and in particular the extraction of disambiguating information from the context. However, one of the main problems comes upstream of the disambiguating process and lies ...
متن کاملSignificant Triples: Adjective+Noun+Verb Combinations
We investigate the identification and, to some extent, the classification of collocational word groups that consist of an adjectival modifier (A), an accusative object noun (N), and a verb (V) by means of parsing a newspaper corpus with a lexicalized probabilistic grammar.1 Data triples are extracted from the resulting Viterbi parses and subsequently evaluated by means of a statistical associat...
متن کاملApplying a Hybrid Query Translation Method to Japanese/English Cross-Language Patent Retrieval
This paper applies an existing query translation method to cross-language patent retrieval. In our method, multiple dictionaries are used to derive all possible translations for an input query, and collocational statistics are used to resolve translation ambiguity. We used Japanese/English parallel patent abstracts to perform comparative experiments, where our method outperformed a simple dicti...
متن کامل