Idiomatic Expression Identification using Semantic Compatibility

نویسندگان

چکیده

Abstract Idiomatic expressions are an integral part of natural language and constantly being added to a language. Owing their non-compositionality ability take on figurative or literal meaning depending the sentential context, they have been classical challenge for NLP systems. To address this challenge, we study task detecting whether sentence has idiomatic expression localizing it when occurs in sense. Prior research studied specific classes offering limited views generalizability new idioms. We propose multi-stage neural architecture with attention flow as solution. The network effectively fuses contextual lexical information at different levels using word sub-word representations. Empirical evaluations three largest benchmark datasets varied syntactic patterns degrees show that our proposed model achieves state-of-the-art results. A salient feature is its identify idioms unseen during training gains from 1.4% 30.8% over competitive baselines dataset.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Type-based Search of Idiomatic Expression

This paper presents evaluation of different approaches to extract verb-noun idiomatic expressions in Czech. These approaches are based on the structure of the idiom and its behavior in language. PMI and syntactic and lexical fixedness modified using VerbaLex and generated thesaurus provide useful tool for choosing best idiomatic candidates for manual annotation and evaluation. Moreover we focus...

متن کامل

Improving Pronoun Resolution Using Statistics-Based Semantic Compatibility Information

In this paper we focus on how to improve pronoun resolution using the statisticsbased semantic compatibility information. We investigate two unexplored issues that influence the effectiveness of such information: statistics source and learning framework. Specifically, we for the first time propose to utilize the web and the twin-candidate model, in addition to the previous combination of the co...

متن کامل

Clitic incorporation and abstract semantic objects in idiomatic constructions

This paper analyses inherent clitics of idiomatic constructions as verbal arguments (Jelinek 1984, Baker 1996, Hale 2003) that are translated as free variables (Delfitto 2002): they are anaphoric to a (hidden) non-referential discourse topic or (right) dislocated constituent. Furthermore, since they denote abstract semantic objects (Asher 1993), they are assumed to be semantically incorporated ...

متن کامل

Unsupervised Type and Token Identification of Idiomatic Expressions

Idiomatic expressions are plentiful in everyday language, yet they remain mysterious, as it is not clear exactly how people learn and understand them. They are of special interest to linguists, psycholinguists, and lexicographers, mainly because of their syntactic and semantic idiosyncrasies as well as their unclear lexical status. Despite a great deal of research on the properties of idioms in...

متن کامل

Study of Compatibility Relationships Among Some Almond Cultivars and Genotypes Using of SAlleles Identification

Almond (Prunus dulcis L.) is one of the most important nut crops in Iran. Most almond cultivars and genotypes are self-incompatible. However, research on S-alleles indicates that it is very efficient in cultivar selection. Selfincompatibility in almond is gametophytic and controlled by a single S-locus with multiple codominant alleles. In this study, compatibility relationships among cultivars,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Transactions of the Association for Computational Linguistics

سال: 2021

ISSN: ['2307-387X']

DOI: https://doi.org/10.1162/tacl_a_00442