Weakly-supervised Relation Extraction by Pattern-enhanced Embedding Learning

نویسندگان

  • Meng Qu
  • Xiang Ren
  • Yu Zhang
  • Jiawei Han
چکیده

Extracting relations from text corpora is an important task in text mining. It becomes particularly challenging when focusing on weakly-supervised relation extraction, that is, utilizing a few relation instances (i.e., a pair of entities and their relation) as seeds to extract more instances from corpora. Existing distributional approaches leverage the corpus-level co-occurrence statistics of entities to predict their relations, and require large number of labeled instances to learn e‚ective relation classi€ers. Alternatively, paŠern-based approaches perform bootstrapping or apply neural networks to model the local contexts, but still rely on large number of labeled instances to build reliable models. In this paper, we study integrating the distributional and paŠern-based methods in a weakly-supervised seŠing, such that the two types of methods can provide complementary supervision for each other to build an e‚ective, uni€ed model. We propose a novel co-training framework with a distributional module and a paŠern module. During training, the distributional module helps the paŠern module discriminate between the informative paŠerns and other paŠerns, and the paŠern module generates some highly-con€dent instances to improve the distributional module. Œe whole framework can be e‚ectively optimized by iterating between improving the paŠern module and updating the distributional module. We conduct experiments on two tasks: knowledge base completion with text corpora and corpus-level relation extraction. Experimental results prove the e‚ectiveness of our framework in the weakly-supervised seŠing. ACM Reference format: Meng ‹1, Xiang Ren2, Yu Zhang1, Jiawei Han1. 2016. Weakly-supervised Relation Extraction by PaŠern-enhanced Embedding Learning. In Proceedings of , , , 10 pages. DOI: 10.1145/nnnnnnn.nnnnnnn

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Fine-grained Relations from Chinese User Generated Categories

User generated categories (UGCs) are short texts that reflect how people describe and organize entities, expressing rich semantic relations implicitly. While most methods on UGC relation extraction are based on pattern matching in English circumstances, learning relations from Chinese UGCs poses different challenges due to the flexibility of expressions. In this paper, we present a weakly super...

متن کامل

Chinese Hypernym-Hyponym Extraction from User Generated Categories

Hypernym-hyponym (“is-a”) relations are key components in taxonomies, object hierarchies and knowledge graphs. While there is abundant research on is-a relation extraction in English, it still remains a challenge to identify such relations from Chinese knowledge sources accurately due to the flexibility of language expression. In this paper, we introduce a weakly supervised framework to extract...

متن کامل

A convex relaxation for weakly supervised relation extraction

A promising approach to relation extraction, called weak or distant supervision, exploits an existing database of facts as training data, by aligning it to an unlabeled collection of text documents. Using this approach, the task of relation extraction can easily be scaled to hundreds of different relationships. However, distant supervision leads to a challenging multiple instance, multiple labe...

متن کامل

Multi-Task Transfer Learning for Weakly-Supervised Relation Extraction

Creating labeled training data for relation extraction is expensive. In this paper, we study relation extraction in a special weakly-supervised setting when we have only a few seed instances of the target relation type we want to extract but we also have a large amount of labeled instances of other relation types. Observing that different relation types can share certain common structures, we p...

متن کامل

Relation Extraction with Weak Supervision and Distributional Semantics

Relation Extraction aims at detecting and categorizing semantic relations between pairs of entities in unstructured text. It benefits an enormous number of applications such as Web search and Question Answering. Traditional approaches for relation extraction either rely on learning from a large number of accurate human-labeled examples or pattern matching with hand-crafted rules. These resource...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017