Adposition Supersenses v2

نویسندگان

  • Nathan Schneider
  • Jena D. Hwang
  • Archna Bhatia
  • Na-Rae Han
  • Vivek Srikumar
  • Tim O'Gorman
  • Omri Abend
چکیده

This document describes in detail an inventory of 50 semantic labels designed to characterize the use of adpositions and case markers at a somewhat coarse level of granularity. Version 2 is a revision of the supersense inventory proposed for English by Schneider et al. (2015, 2016) and documented in PrepWiki1 (henceforth “v1”), which in turn was based on previous schemes. The present inventory was developed after extensive review of the v1 corpus annotations for English, plus previously unanalyzed ’s (genitive case) possessives (Blodgett and Schneider, 2018), as well as consideration of adposition and case phenomena in Hebrew, Hindi, and Korean. Hwang et al. (2017) present the theoretical underpinnings of the v2 scheme. Though the v2 inventory aspires to be universal, this document is specific to English; documentation for other languages will be published separately. The STREUSLE 4.0 corpus containing English annotations according to this scheme will be released at https://github.com/nert-gu/streusle/. http://tiny.cc/prepwiki 1 ar X iv :1 70 4. 02 13 4v 2 [ cs .C L ] 1 6 Ja n 20 18

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Discovery of Adposition Typology

Natural languages (NL) can be classified as prepositional or postpositional based on the order of the noun phrase and the adposition. Categorizing a language by its adposition typology helps in addressing several challenges in linguistics and natural language processing (NLP). Understanding the adposition typologies for less-studied languages by manual analysis of large text corpora can be quit...

متن کامل

A Corpus and Model Integrating Multiword Expressions and Supersenses

This paper introduces a task of identifying and semantically classifying lexical expressions in running text. We investigate the online reviews genre, adding semantic supersense annotations to a 55,000 word English corpus that was previously annotated for multiword expressions. The noun and verb supersenses apply to full lexical expressions, whether singleor multiword. We then present a sequenc...

متن کامل

Supersense Embeddings: A Unified Model for Supersense Interpretation, Prediction, and Utilization

Coarse-grained semantic categories such as supersenses have proven useful for a range of downstream tasks such as question answering or machine translation. To date, no effort has been put into integrating the supersenses into distributional word representations. We present a novel joint embedding model of words and supersenses, providing insights into the relationship between words and superse...

متن کامل

A corpus of preposition supersenses in English web reviews

We present the first corpus annotated with preposition supersenses, unlexicalized categories for semantic functions that can be marked by English prepositions (Schneider et al., 2015). That scheme improves upon its predecessors to better facilitate comprehensive manual annotation. Moreover, unlike the previous schemes, the preposition supersenses are organized hierarchically. Our data will be p...

متن کامل

Supersense Tagging for Arabic: the MT-in-the-Middle Attack

We consider the task of tagging Arabic nouns with WordNet supersenses. Three approaches are evaluated. The first uses an expertcrafted but limited-coverage lexicon, Arabic WordNet, and heuristics. The second uses unsupervised sequence modeling. The third and most successful approach uses machine translation to translate the Arabic into English, which is automatically tagged with English superse...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1704.02134  شماره 

صفحات  -

تاریخ انتشار 2017