Transformation-Based Information Extraction Using Learned Meta-rules
نویسنده
چکیده
Information extraction (IE) is a form of shallow text understanding that locates specific pieces of data in natural language documents. Although automated IE systems began to be developed using machine learning techniques recently, the performances of those IE systems still need to be improved. This paper describes an information extraction system based on transformation-based learning, which uses learned meta-rules on patterns for slots. We plan to empirically show these techniques improve the performance of the underlying information extraction system by running experiments on a corpus of IT resumé documents collected from Internet newsgroups.
منابع مشابه
META-DARE: Monitoring the Minimally Supervised ML of Relation Extraction Rules
This paper demonstrates a web-based online system, called META-DARE1. META-DARE is built to assist researchers to obtain insights into seed-based minimally supervised machine learning for relation extraction. META-DARE allows researchers and students to conduct experiments with an existing machine learning system called DARE (Xu et al., 2007). Users can run their own learning experiments by con...
متن کاملLearning Transformation Rules for Semantic Parsing
This paper presents an approach for inducing transformation rules that map natural-language sentences into a formal semantic representation language. The approach assumes a formal grammar for the target representation language and learns transformation rules that exploit the non-terminal symbols in this grammar. Patterns for the transformation rules are learned using an induction algorithm base...
متن کاملPredicate Argument Structure Analysis Using Transformation Based Learning
Maintaining high annotation consistency in large corpora is crucial for statistical learning; however, such work is hard, especially for tasks containing semantic elements. This paper describes predicate argument structure analysis using transformation-based learning. An advantage of transformation-based learning is the readability of learned rules. A disadvantage is that the rule extraction pr...
متن کاملDefinition of a Computing Independent Model and Rules for Transformation Focused on the Model-View-Controller Architecture
This paper presents a model-oriented development approach to software development in the Model-View-Controller (MVC) architectural standard. This approach aims to expose a process of extractions of information from the models, in which through rules and syntax defined in this work, assists in the design of the initial model and its future conversions. The proposed paper presents a syntax based ...
متن کاملTransformation-based correction of rule-based MT
We present a pilot study for using transformation-based learning for automatic correction of rule-based machine translation. Correction rules are learned based on a parallel corpus of machine translations from a commercial machine translation system and a human-corrected version of these translations. The correction rules exploit information on word forms and part of speech. The experiment resu...
متن کامل