Wrap - Up : a Trainable DiscourseModule for Information
نویسندگان
چکیده
The vast amounts of on-line text now available have led to renewed interest in information extraction (IE) systems that analyze unrestricted text, producing a structured representation of selected information from the text. This paper presents a novel approach that uses machine learning to acquire knowledge for some of the higher level IE processing. Wrap-Up is a trainable IE discourse component that makes intersentential inferences and identiies logical relations among information extracted from the text. Previous corpus-based approaches were limited to lower level processing such as part-of-speech tagging, lexical disambiguation, and dictionary construction. Wrap-Up is fully trainable, and not only automatically decides what classiiers are needed, but even derives the feature set for each classiier automatically. Performance equals that of a partially trainable discourse module requiring manual customization for each domain.
منابع مشابه
Wrap - Up : a Trainable DiscourseModule
The vast amounts of on-line text now available have led to renewed interest in Information Extraction (IE) systems that analyze unrestricted text, producing a structured representation of selected information from the text. This paper presents a novel approach that uses machine learning to acquire knowledge for some of the higher level IE processing. Wrap-Up is a trainable IE discourse componen...
متن کاملWrap-Up: a Trainable Discourse Module for Information Extraction
The vast amounts of on-line text now available have led to renewed interest in information extraction (IE) systems that analyze unrestricted text, producing a structured representation of selected information from the text. This paper presents a novel approach that uses machine learning to acquire knowledge for some of the higher level IE processing. Wrap-Up is a trainable IE discourse componen...
متن کاملAAAI 1995 Spring Symposium on Empirical Methods in Discourse Interpretation and Generation Learning Domain-Speci c Discourse Rules for Information Extraction
This paper describes a system that learns discourse rules for domain-speci c analysis of unrestricted text. The goal of discourse analysis in this context is to transform locally identi ed references to relevant information in the text into a coherent representation of the entire text. This involves a complex series of decisions about merging coreferential objects, ltering out irrelevant inform...
متن کاملAxial Compressive Strength of Reinforced Concrete Columns Wrapped with Fiber Reinforced Polymers (FRP)
This paper presents the results of a study on the axial compressive strength of columnsstrengthened with FRP wrap. The experimental part of the study included testing 6 reinforcedconcrete columns in two series. The first series comprised three similar circular reinforced concretecolumns strengthened with FRP wrap. The second series consisted of three similar square columns,two with sharp corner...
متن کاملTools and techniques for rapid porting
Charlie Dolan, from Hughes Research Laboratories, discussed some of the difficulties in using trainable components in an information extraction system. The UMass/Hughes system used six different trainable components in their MUC5 system ; portability between the EJV and EME domains was achieved partl y through retraining these components. One of these components, the Trainable Template Generato...
متن کامل