Explainie -explaining Information Extraction

نویسندگان

  • Wojciech Barczynski
  • Falk Brauer
  • Adrian Mocan
چکیده

Business Intelligence (BI) over unstructured text is under intense scrutiny both in the industry and research. Recent work in this field includes automatic integrating of unstructured text into business analytics, model recognition, and probabilistic databases to handle uncertainty of Information Extraction (IE). However, still an open issue is how to handle IE quality, which is a part of ETL like process for the BI. Precision of IE is still too low for BI and, according to Sunita Sarawagi in recent survey on IE, we are still far from a comprehensive quality model for IE. Currently the BI user has neither methodology nor tools, which would help him to discover if the result is an unexpected fact or an error in IE. In this work we present preliminary results on developing methodology and tool (ExplainIE), which helps users to debug unexpected results. ExplainIE presents results within BI tool and auxiliary view on low level detail (e.g., entity graph). We consider two kinds of users: BI and IE expert.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the provenance of non-answers to queries over extracted data

In information extraction, uncertainty is ubiquitous. For this reason, it is useful to provide users querying extracted data with explanations for the answers they receive. Providing the provenance for tuples in a query result partially addresses this problem, in that provenance can explain why a tuple is in the result of a query. However, in some cases explaining why a tuple is not in the resu...

متن کامل

Portable Extraction of Partially Structured Facts from the Web

A novel fact extraction task is defined to fill a gap between current information retrieval and information extraction technologies. It is shown that it is possible to extract useful partially structured facts about different kinds of entities in a broad domain, i.e. all kinds of places depicted in tourist images. Importantly the approach does not rely on existing linguistic resources (gazettee...

متن کامل

Portable Extraction of Partially Structured Facts from the Web

A novel fact extraction task is defined to fill a gap between current information retrieval and information extraction technologies. It is shown that it is possible to extract useful partially structured facts about different kinds of entities in a broad domain, i.e. all kinds of places depicted in tourist images. Importantly the approach does not rely on existing linguistic resources (gazettee...

متن کامل

Potential and Limitations of Information Extraction on the Terrestrial Biosphere from Satellite Remote Sensing

T h e extraction of infornmtion on terrestrial environments from satellite observations requires the use of quantitative models to interpret the radiation data collected in space. Several approaches are feasible, ranging from the development of models capable of explaining the nature of the measured physical signal or of characterizing the state of the system under observation, to the establish...

متن کامل

Explaining Semantic Web Applications

In this chapter, we introduce the concept of explanation for Semantic Web applications by providing motivation, description, and examples. We describe the Inference Web explanation toolkit that provides support for a broad range of explanation tasks ranging from explaining deductive reasoning, to information extraction, to hybrid integrated learning systems. We argue that an explanation solutio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009