Idiomatic Expression Identification using Semantic Compatibility
نویسندگان
چکیده
Abstract Idiomatic expressions are an integral part of natural language and constantly being added to a language. Owing their non-compositionality ability take on figurative or literal meaning depending the sentential context, they have been classical challenge for NLP systems. To address this challenge, we study task detecting whether sentence has idiomatic expression localizing it when occurs in sense. Prior research studied specific classes offering limited views generalizability new idioms. We propose multi-stage neural architecture with attention flow as solution. The network effectively fuses contextual lexical information at different levels using word sub-word representations. Empirical evaluations three largest benchmark datasets varied syntactic patterns degrees show that our proposed model achieves state-of-the-art results. A salient feature is its identify idioms unseen during training gains from 1.4% 30.8% over competitive baselines dataset.
منابع مشابه
Type-based Search of Idiomatic Expression
This paper presents evaluation of different approaches to extract verb-noun idiomatic expressions in Czech. These approaches are based on the structure of the idiom and its behavior in language. PMI and syntactic and lexical fixedness modified using VerbaLex and generated thesaurus provide useful tool for choosing best idiomatic candidates for manual annotation and evaluation. Moreover we focus...
متن کاملImproving Pronoun Resolution Using Statistics-Based Semantic Compatibility Information
In this paper we focus on how to improve pronoun resolution using the statisticsbased semantic compatibility information. We investigate two unexplored issues that influence the effectiveness of such information: statistics source and learning framework. Specifically, we for the first time propose to utilize the web and the twin-candidate model, in addition to the previous combination of the co...
متن کاملClitic incorporation and abstract semantic objects in idiomatic constructions
This paper analyses inherent clitics of idiomatic constructions as verbal arguments (Jelinek 1984, Baker 1996, Hale 2003) that are translated as free variables (Delfitto 2002): they are anaphoric to a (hidden) non-referential discourse topic or (right) dislocated constituent. Furthermore, since they denote abstract semantic objects (Asher 1993), they are assumed to be semantically incorporated ...
متن کاملUnsupervised Type and Token Identification of Idiomatic Expressions
Idiomatic expressions are plentiful in everyday language, yet they remain mysterious, as it is not clear exactly how people learn and understand them. They are of special interest to linguists, psycholinguists, and lexicographers, mainly because of their syntactic and semantic idiosyncrasies as well as their unclear lexical status. Despite a great deal of research on the properties of idioms in...
متن کاملStudy of Compatibility Relationships Among Some Almond Cultivars and Genotypes Using of SAlleles Identification
Almond (Prunus dulcis L.) is one of the most important nut crops in Iran. Most almond cultivars and genotypes are self-incompatible. However, research on S-alleles indicates that it is very efficient in cultivar selection. Selfincompatibility in almond is gametophytic and controlled by a single S-locus with multiple codominant alleles. In this study, compatibility relationships among cultivars,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Transactions of the Association for Computational Linguistics
سال: 2021
ISSN: ['2307-387X']
DOI: https://doi.org/10.1162/tacl_a_00442