Acquiring Predicate-Argument Mapping Information From Multilingual Texts
نویسندگان
چکیده
This paper discusses automatic acquisition of predicate-argument mapping information from multilingual texts. The lexicon of our NLP system abstracts the language-dependent portion of predicate-argument mapping information from the core meaning of verb senses (i.e. semantic concepts as defined in the knowledge base). We represent this mapping information in terms of cross-linguistically generalized mapping types called situation types and word sense-specific idiosyncrasies. This representation has enabled us to automatically acquire predicate-argument mapping information, specifically situation types and idiosyncrasies, for verbs in English, Spanish, and Japanese texts.
منابع مشابه
Three-Level Knowledge Representation of Predicate-Argument Mapping for Multilingual Lexicons
Constructing a semantic concept hierarchy covers only part of the problem in building any knowledgebased natural language processing (NLP) system: Language understanding requires mapping from syntactic structures into conceptual (i.e. interlingua) representation (henceforth verb-argument mapping), while language generation requires the inverse mapping. Jacobs [13] proposed a solution in which h...
متن کاملPresenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کاملAcquiring Reliable Predicate-argument Structures from Raw Corpora for Case Frame Compilation
We present a method for acquiring reliable predicate-argument structures from raw corpora for automatic compilation of case frames. Such lexicon compilation requires highly reliable predicate-argument structures to practically contribute to Natural Language Processing (NLP) applications, such as paraphrasing, text entailment, and machine translation. We first apply chunking to raw corpora and t...
متن کاملAcquiring Strongly-related Events using Predicate-argument Co-occurring Statistics and Case Frames
This paper proposes a method for automatically acquiring strongly-related events from a large corpus using predicateargument co-occurring statistics and case frames. The strongly-related events are acquired in the form of strongly-related two predicates with their relevant arguments. First, strongly-related events are acquired from predicate-argument cooccurring statistics. Then, the remaining ...
متن کاملA multilingual semantic dependency parser for texts of legislation in the domain of Value Added Tax (VAT)
Semantic parsing is a rapidly growing subeld of computational linguistics, driven by the seemingly endless potential of natural language processing (NLP) in combination with an increasing demand for applications guiding and aiding humans in a variety of language related tasks. As a pilot study, this project investigated whether it was possible to create a classi er that correctly detected and l...
متن کامل