Boosting the Coverage of a Semantic Lexicon by Automatically Extracted Event Nominalizations

نویسندگان

Kata Gábor

Marianna Apidianaki

Benoît Sagot

Éric Villemonte de la Clergerie

چکیده

An important trend in recent works on lexical semantics has been the development of learning methods capable of extracting semantic information from text corpora. The majority of these methods are based on the distributional hypothesis of meaning and acquire semantic information by identifying distributional patterns in texts. In this article, we present a distributional analysis method for extracting nominalization relations from monolingual corpora. The acquisition method makes use of distributional and morphological information to select nominalization candidates. We explain how the learning is performed on a dependency annotated corpus and describe the nominalization results. Furthermore, we show how these results served to enrich an existing lexical resource, the WOLF (Wordnet Libre du Français). We present the techniques that we developed in order to integrate the new information into WOLF, based on both its structure and content. Finally, we evaluate the validity of the automatically obtained information and the correctness of its integration into the semantic resource. The method proved to be useful for boosting the coverage of WOLF and presents the advantage of filling verbal synsets, which are particularly difficult to handle due to the high level of verbal polysemy.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

AnCora-Nom: A Spanish Lexicon of Deverbal Nominalizations

This paper describes a new lexical resource: Ancora-Nom, a Spanish lexicon of deverbal nominalizations. At present, it contains 1,655 lexical entries and 3,094 senses. Each sense has a denotation type associated, and the mapping of nominal complements with arguments and the corresponding theta roles is also annotated. A particular interest of this lexicon is that it has been automatically extra...

متن کامل

AD -Classifier: Automatically Assigning Denotation Types to ominalizations

This paper presents the ADN-Classifier, an Automatic classification system of Spanish Deverbal Nominalizations aimed at identifying its semantic denotation (i.e. event, result, underspecified, or lexicalized). The classifier can be used for NLP tasks such as coreference resolution or paraphrase detection. To our knowledge, the ADN-Classifier is the first effort in acquisition of denotations for...

متن کامل

NomLex-PT: A Lexicon of Portuguese Nominalizations

This paper presents NomLex-PT, a lexical resource describing Portuguese nominalizations. NomLex-PT connects verbs to their nominalizations, thereby enabling NLP systems to observe the potential semantic relationships between the two words when analysing a text. NomLex-PT is freely available and encoded in RDF for easy integration with other resources. Most notably, we have integrated NomLex-PT ...

متن کامل

Nominalizaciones deverbales: denotación y estructura argumental

Spanish deverbal nominalizations are linguistic constructions characterized by presenting properties of common nouns but also by inheriting the argument structure of the verbs from which they derive. This duality aroused considerable interest in deverbal nominalizations in Linguistics. On the one hand, they can denote both the state or the result of the action expressed by the corresponding bas...

متن کامل

The Automatic Acquisition of Verb Subcategorisations and Their Impact on the Performance of an HPSG Parser

We describe the automatic acquisition of a lexicon of verb subcategorisations from a domain-specific corpus, and an evaluation of the impact this lexicon has on the performance of a “deep”, HPSG parser of English. We conducted two experiments to determine whether the empirically extracted verb stems would enhance the lexical coverage of the grammar and to see whether the automatically extracted...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2012

Boosting the Coverage of a Semantic Lexicon by Automatically Extracted Event Nominalizations

نویسندگان

چکیده

منابع مشابه

AnCora-Nom: A Spanish Lexicon of Deverbal Nominalizations

AD -Classifier: Automatically Assigning Denotation Types to ominalizations

NomLex-PT: A Lexicon of Portuguese Nominalizations

Nominalizaciones deverbales: denotación y estructura argumental

The Automatic Acquisition of Verb Subcategorisations and Their Impact on the Performance of an HPSG Parser

عنوان ژورنال:

اشتراک گذاری