Unimodal and Multimodal Representation Training for Relation Extraction

نویسندگان

چکیده

Abstract Multimodal integration of text, layout and visual information has achieved SOTA results in visually rich document understanding (VrDU) tasks, including relation extraction (RE). However, despite its importance, evaluation the relative predictive capacity these modalities is less prevalent. Here, we demonstrate value shared representations for RE tasks by conducting experiments which each data type iteratively excluded during training. In addition, text are evaluated isolation. While a bimodal approach performs best (F1 = 0.684), show that most important single predictor entity relations. Additionally, geometry highly may even be feasible unimodal approach. Despite being effective, highlight circumstances where can bolster performance. total, our efficacy training joint RE.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adversarial Training for Relation Extraction

Adversarial training is a mean of regularizing classification algorithms by generating adversarial noise to the training data. We apply adversarial training in relation extraction within the multi-instance multi-label learning framework. We evaluate various neural network architectures on two different datasets. Experimental results demonstrate that adversarial training is generally effective f...

متن کامل

Multimodal Versus Unimodal Instructions

This module provides an overview of multimodal perception, including information Your nose might even be stimulated by the smell of burning rubber or gasoline. In other words, how does the perceptual system determine which unimodal between the two balls that then bounce off each other in opposite directions. Principles and heuristics for designing minimalist instruction. H Van der Multimodal ve...

متن کامل

Self-Crowdsourcing Training for Relation Extraction

One expensive step when defining crowdsourcing tasks is to define the examples and control questions for instructing the crowd workers. In this paper, we introduce a self-training strategy for crowdsourcing. The main idea is to use an automatic classifier, trained on weakly supervised data, to select examples associated with high confidence. These are used by our automatic agent to explain the ...

متن کامل

Cortical plasticity induced by short-term unimodal and multimodal musical training.

Learning to play a musical instrument requires complex multimodal skills involving simultaneous perception of several sensory modalities: auditory, visual, somatosensory, as well as the motor system. Therefore, musical training provides a good and adequate neuroscientific model to study multimodal brain plasticity effects in humans. Here, we investigated the impact of short-term unimodal and mu...

متن کامل

Unimodal & Multimodal Biometric Recognition Techniques A Survey

Biometric recognition refers to an automatic recognition of individuals based on a feature vector(s) derived from their physiological and/or behavioral characteristic. Biometric recognition systems should provide a reliable personal recognition schemes to either confirm or determine the identity of an individual. These features are used to provide an authentication for computer based security s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Communications in computer and information science

سال: 2023

ISSN: ['1865-0937', '1865-0929']

DOI: https://doi.org/10.1007/978-3-031-26438-2_35