Possissive Pronominal Anaphor Resolution in Portuguese Written Texts
نویسندگان
چکیده
This paper describes a proposal for Portuguese possessive pronominal anaphor (PPA) resolution, a problem little considered so far. Particularly, we address the problem of Portuguese 3rd person intrasentential PPAs seu/sua/seus/suas (his/her/their/its, for human and non-human subjects in English), which constitute 30% of pronominal occurrences in our corpus (Brazilian laws about environment protection). Considering some differences between PPAs and other kinds of anaphors, such as personal or demonstrative pronouns, we define three knowledge sources (KSs) for PPA resolution: surface patterns (taking in account factors such as syntactic parallelism), possessive relationship rules and sentence centering. These knowledge sources are organized in a blackboard architecture for PPA resolution, which provides both knowledge and procedure distribution among autonomous entities (reflexive agents), each of them specialized in a particular aspect of the problem solving. The proposal has been implemented and its results are discussed at the end of the work.
منابع مشابه
Anaphor Resolution in Unrestricted Texts with Partial Parsing
In this paper we deal with several kinds of anaphora in unrestricted texts. These kinds of anaphora are pronominal references, surfacecount anaphora and one-anaphora. In order to solve these anaphors we work on the output of a part-of-speech tagger, on which we automatically apply a partial parsing from the formalism: Slot Unification Grammar, which has been implemented in Prolog. We only use t...
متن کاملExtending DRT With A Focusing Mechanism For Pronominal Anaphora And Ellipsis Resolution
Cormack (1992) proposed a framework for pronominal anaphora resolution. Her proposal integrates focusing theory (Sidner et al.) and DRT (Kamp and Reyle). We analyzed this methodology and adjusted it to the processing of Portuguese texts. The scope of the framework was widened to cover sentences containing restrictive relative clauses and subject ellipsis. Tests were conceived and applied to pro...
متن کاملA Machine Learning Approach to Portuguese Pronoun Resolution
Anaphora resolution is an essential component of most NLP applications, from text understanding to Machine Translation. In this work we discuss a supervised machine learning approach to the problem, focusing on instances of anaphora ubiquitously found in a corpus of Brazilian Portuguese texts, namely, third-person pronominal references. Although still limited to a subset of the more general co-...
متن کاملConditional Random Fields based Pronominal Resolution in Tamil
This paper deals with Tamil pronominal resolution using Conditional Random Fields a machine learning approach. A detailed linguistic analysis of Tamil pronominals and its antecedence occurring in various syntactic constructs is done, which led to the selection of appropriate features for CRF approach. The syntactic features thus identified made the system learn most frequently occurring pronoun...
متن کاملZAC: Zero Anaphora Corpus A Corpus for Zero Anaphora Resolution in Portuguese
This paper describes a corpus of Brazilian Portuguese texts built in view of the construction of an Anaphora Resolution system, which is part of a fully-fledged Natural Language Processing system (STRING). The ZAC corpus is aimed at the resolution of the so-called zero-anaphora, that is, an anaphora relation where the anaphoric expression (or anaphor) has been zeroed The paper briefly discusses...
متن کامل