Oriya Multiword Chunking using Lexical knowledge base of verbs
نویسندگان
چکیده
The multiword chunking is otherwise thought of as the shallow parsing technique which identifies the multiword chunks and their interdependencies. The paper presents the proposed solution to the problem. Here we have designed the model of the proposed syntactic processor which uses lexical knowledge base of verbs for identifying intra chunk boundaries and finally forming the inter dependencies graph. The lexical knowledge base of verbs is used as there is basic assumption that verbs determine the semantics of a sentence and its surface realization is based on this fact. Verbs impose syntactic restriction over the grammatical representation of the sentence. Further NLP tasks are based on the relations that verbs established with the other words. Several hands on experiments has been conducted to establish the fact that the performance of present scheme exceeds the rule based scheme Key w ords: MWC analyzer, inflection, Tagging, Dependency graphs, free word order, semantic role labeling
منابع مشابه
Automatic Rule Acquisition for Chinese Intra-chunk Relations
Multiword chunking is defined as a task to automatically analyze the external function and internal structure of the multiword chunk(MWC) in a sentence. To deal with this problem, we proposed a rule acquisition algorithm to automatically learn a chunk rule base, under the support of a large scale annotated corpus and a lexical knowledge base. We also proposed an expectation precision index to o...
متن کاملAccounting for Contiguous Multiword Expressions in Shallow Parsing
In this paper, we focus on chunking including contiguous multiword expression recognition, namely super-chunking. In particular, we present different strategies to improve a superchunker based on Conditional Random Fields by combining it with a finite-state symbolic super-chunker driven by lexical and grammatical resources. We display a substantial gain of 7.6 points in terms of overall accuracy.
متن کاملShallow morphology based complex predicates extraction in Oriya
This paper presents the extraction of Complex Predicates (CPs) in Oriya based on shallow morphology and available seed lists of verbs. Generally Oriya language is a free word order language. Free word order languages have relatively unrestricted local word group or phrase structures that make the problem of complex predicates extraction quite challenging. The complex predicates are generally th...
متن کاملOn multiword lexical units and their role in maritime dictionaries
Multi-word lexical units are a typical feature of specialized dictionaries, in particular monolingual and bilingual maritime dictionaries. The paper studies the concept of the multi-word lexical unit and considers the similarities and differences of their selection and presentation in monolingual and bilingual maritime dictionaries. The work analyses such issues as the classification of multi-w...
متن کاملMULTILINGUAL MULTIWORD EXPRESSIONS Literature Survey
Multiword Expressions are idiosyncratic word usages of a language which often have noncompositional meaning. The knowledge of multiword expressions is necessary for many NLP tasks like, machine translation, natural language generation, named entity recognition, sentiment analysis etc. In order for other NLP applications to benefit from the knowledge of multiword expressions, they need to be ide...
متن کامل