A New Method for Improving Computational Cost of Open Information Extraction Systems Using Log-Linear Model

نویسندگان

چکیده مقاله:

Information extraction (IE) is a process of automatically providing a structured representation from an unstructured or semi-structured text. It is a long-standing challenge in natural language processing (NLP) which has been intensified by the increased volume of information and heterogeneity, and non-structured form of it. One of the core information extraction tasks is relation extraction which aims at extracting semantic relations among entities from natural language text. Traditional relation extraction techniques were relation-specific, producing new instances of relations determined a priori. While effective, this model is not applicable in cases where the relations are not defined a priori or when the number of relations is high. Open Relation Extraction (ORE) methods were developed to elicit instances of arbitrary relations while requiring fewer training examples. Since ORE systems are employed by the applications depended on large-scale relation extraction, high performance and low computational cost are major requirements for ORE methods. This is particularly important in the large scales such as the Web. Many OIE systems have been proposed in recent years. These approaches range from shallow (such as part-of-speech tagging) to deep (such as semantic role labeling), therefore they differ in their performance level and computational cost. In this paper, we use the state-of-the-art shallow NLP tools to extract instances of relations. A supervised log-linear model for OIE is presented which is based on using advantages of shallow NLP tools, as they are fast and lead to a low computational time. Extractor which is the main core of proposed approach integrates a high performance subset of the shallow NLP tools with the strength of the deep NLP tools by using a supervised log linear model and produces a high performance method that is scalable. This causes efficient use of time and therefore reduces computational cost and increases precision. Proposed approach achieves higher precision and recall than ReVerb, one of the most successful shallow OIE system.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

a new type-ii fuzzy logic based controller for non-linear dynamical systems with application to 3-psp parallel robot

abstract type-ii fuzzy logic has shown its superiority over traditional fuzzy logic when dealing with uncertainty. type-ii fuzzy logic controllers are however newer and more promising approaches that have been recently applied to various fields due to their significant contribution especially when the noise (as an important instance of uncertainty) emerges. during the design of type- i fuz...

15 صفحه اول

Improving Open Information Extraction using Domain Knowledge

Open Information Extraction (OIE) aims to identify all the possible assertions within a sentence. Recent and thus the most efficient OIE-tools use the grammatical dependencies or the syntactic tree of the sentence to perform extraction. When they provide a wrong extraction it is mainly due to parsing errors. In this paper, we propose to handle these parsing errors before doing OIE itself. To ac...

متن کامل

A New Cost Model for Estimation of Open Pit Copper Mine Capital Expenditure

One of the most important issues in all stages of mining study is capital cost estimation. Determination of capital expenditure is a challenging issue for mine designers. In recent decade, quite a few number of studies have focused on proposing estimation models to predict mining capital cost. However, these efforts have not achieved to a predictor model with reliable range of error. Both of ov...

متن کامل

A METHOD FOR SOLVING FUZZY LINEAR SYSTEMS

In this paper we present a method for solving fuzzy linear systemsby two crisp linear systems. Also necessary and sufficient conditions for existenceof solution are given. Some numerical examples illustrate the efficiencyof the method.

متن کامل

Improving 3-D Imaging Breast Cancer Diagnosis Systems Using a New Method for Placement of Near-Infrared Sources and Detectors

The objective of this research was to improve 3-D imaging system by near-infrared light emission in breast tissue to achieve a more accurate diagnosis of tumor. The results of repeated experiments in this research have shown that with this imaging system, a more accurate diagnosis of abnormal area depends on the location of the sources and detectors. Therefore, an optimal location model has bee...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}


عنوان ژورنال

دوره 16  شماره 1

صفحات  3- 20

تاریخ انتشار 2019-06

با دنبال کردن یک ژورنال هنگامی که شماره جدید این ژورنال منتشر می شود به شما از طریق ایمیل اطلاع داده می شود.

کلمات کلیدی

کلمات کلیدی برای این مقاله ارائه نشده است

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023