A Generic Anaphora Resolution Engine for Indian Languages
نویسندگان
چکیده
In this paper, we present a generic anaphora engine for Indian languages, which are mostly resource poor languages. We have analysed the similarit ies and variations between pronouns and their agreement with antecedents in Indian languages. The generic algorithm developed uses the morphological richness of Indian languages. The machine learn ing approach uses the features which can handle major Indian languages. We have tested the system with Indo-Aryan and Dravidian languages namely Bengali, Hindi and Tamil. The results are encouraging.
منابع مشابه
A Survey of Anaphora Resolution Approaches for Different Indian Languages
— Anaphora Resolution is a key problem in the field of natural language processing. It is the problem of identifying the referents in the discourse. Pronominal anaphora resolution is a subpart of anaphora resolution where pronouns are referred to noun antecedents. This paper describes the research done in the development of anaphora resolution systems for various Indian Languages.The process of...
متن کاملEvent Anaphora Resolution in Natural Language Processing for Hindi text
This paper presents a comprehensive study about the anaphora resolution. Anaphora resolution can be of any type such as Entity Anaphora resolution and Event Anaphora resolution etc. Event Anaphora resolution is the main focus of this review paper. This resolution of Event anaphora can be different for different languages as the structure of the language changes. Here the literature about anapho...
متن کاملDARE: Deep Anaphora Resolution in Dialogue based Intelligent Tutoring Systems
Anaphora resolution is a central topic in dialogue and discourse processing that deals with finding the referents of pronouns. There are no studies, to the best of our knowledge, that focus on anaphora resolution in the context of tutorial dialogues. In this paper, we present the first version of DARE (Deep Anaphora Resolution Engine), an anaphora resolution engine for dialogue-based Intelligen...
متن کاملConditional Random Fields based Pronominal Resolution in Tamil
This paper deals with Tamil pronominal resolution using Conditional Random Fields a machine learning approach. A detailed linguistic analysis of Tamil pronominals and its antecedence occurring in various syntactic constructs is done, which led to the selection of appropriate features for CRF approach. The syntactic features thus identified made the system learn most frequently occurring pronoun...
متن کاملAR-Engine - a framework for unrestricted co-reference resolution
The paper presents a framework that allows the design, realisation and validation of different anaphora resolution models on real texts. The type of processing implemented by the engine is an incremental one, simulating the reading of texts by humans. Advanced behaviour like postponed resolution and accumulation of values for features of the discourse entities during reading is implemented. Fou...
متن کامل