Explainie -explaining Information Extraction
نویسندگان
چکیده
Business Intelligence (BI) over unstructured text is under intense scrutiny both in the industry and research. Recent work in this field includes automatic integrating of unstructured text into business analytics, model recognition, and probabilistic databases to handle uncertainty of Information Extraction (IE). However, still an open issue is how to handle IE quality, which is a part of ETL like process for the BI. Precision of IE is still too low for BI and, according to Sunita Sarawagi in recent survey on IE, we are still far from a comprehensive quality model for IE. Currently the BI user has neither methodology nor tools, which would help him to discover if the result is an unexpected fact or an error in IE. In this work we present preliminary results on developing methodology and tool (ExplainIE), which helps users to debug unexpected results. ExplainIE presents results within BI tool and auxiliary view on low level detail (e.g., entity graph). We consider two kinds of users: BI and IE expert.
منابع مشابه
On the provenance of non-answers to queries over extracted data
In information extraction, uncertainty is ubiquitous. For this reason, it is useful to provide users querying extracted data with explanations for the answers they receive. Providing the provenance for tuples in a query result partially addresses this problem, in that provenance can explain why a tuple is in the result of a query. However, in some cases explaining why a tuple is not in the resu...
متن کاملPortable Extraction of Partially Structured Facts from the Web
A novel fact extraction task is defined to fill a gap between current information retrieval and information extraction technologies. It is shown that it is possible to extract useful partially structured facts about different kinds of entities in a broad domain, i.e. all kinds of places depicted in tourist images. Importantly the approach does not rely on existing linguistic resources (gazettee...
متن کاملPortable Extraction of Partially Structured Facts from the Web
A novel fact extraction task is defined to fill a gap between current information retrieval and information extraction technologies. It is shown that it is possible to extract useful partially structured facts about different kinds of entities in a broad domain, i.e. all kinds of places depicted in tourist images. Importantly the approach does not rely on existing linguistic resources (gazettee...
متن کاملPotential and Limitations of Information Extraction on the Terrestrial Biosphere from Satellite Remote Sensing
T h e extraction of infornmtion on terrestrial environments from satellite observations requires the use of quantitative models to interpret the radiation data collected in space. Several approaches are feasible, ranging from the development of models capable of explaining the nature of the measured physical signal or of characterizing the state of the system under observation, to the establish...
متن کاملExplaining Semantic Web Applications
In this chapter, we introduce the concept of explanation for Semantic Web applications by providing motivation, description, and examples. We describe the Inference Web explanation toolkit that provides support for a broad range of explanation tasks ranging from explaining deductive reasoning, to information extraction, to hybrid integrated learning systems. We argue that an explanation solutio...
متن کامل