Extending Multi-Sense Word Embedding to Phrases and Sentences for Unsupervised Semantic Applications

نویسندگان

چکیده

Most unsupervised NLP models represent each word with a single point or region in semantic space, while the existing multi-sense embeddings cannot longer sequences like phrases sentences. We propose novel embedding method for text sequence (a phrase sentence) where is represented by distinct set of multi-mode codebook to capture different facets its meaning. The can be viewed as cluster centers which summarize distribution possibly co-occurring words pre-trained space. introduce an end-to-end trainable neural model that directly predicts from input during test time. Our experiments show per-sentence significantly improve performances sentence similarity and extractive summarization benchmarks. In experiments, we discover multi-facet provide interpretable representation but do not outperform single-facet baseline.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The semantic analysis of ^/-phrases for word sense disambiguation

The aim of this paper is to show that the construction of semantic resources is a sine qua non if one wishes to tackle such complex and ambitious tasks as word sense disambiguation or (machine) translation selection, which are notorious stumbling blocks in most natural language processing systems. More specifically, I will illustrate my contention with examples featuring o/phrases to show that ...

متن کامل

Word Sense Disambiguation for Semantic Applications

Natural language processing (NLP) has become the most significant obstacle that has been restricting the applications via the web. Today, very little of the content on the web can be understood by the machines, although vast amount of electronic information has been kept on them. Word sense disambiguation (WSD) is an important intermediate step in many language processing applications. It is ba...

متن کامل

Sense Embedding Learning for Word Sense Induction

Conventional word sense induction (WSI) methods usually represent each instance with discrete linguistic features or cooccurrence features, and train a model for each polysemous word individually. In this work, we propose to learn sense embeddings for the WSI task. In the training stage, our method induces several sense centroids (embedding) for each polysemous word. In the testing stage, our m...

متن کامل

UMND1: Unsupervised Word Sense Disambiguation Using Contextual Semantic Relatedness

In this paper we describe an unsupervised WordNet-based Word Sense Disambiguation system, which participated (as UMND1) in the SemEval-2007 Coarsegrained English Lexical Sample task. The system disambiguates a target word by using WordNet-based measures of semantic relatedness to find the sense of the word that is semantically most strongly related to the senses of the words in the context of t...

متن کامل

Unsupervised word sense disambiguation in dynamic semantic spaces

In this paper, we are mainly concerned with the ability to quickly and automa cally dis nguish word senses in dynamic seman c spaces in which new terms and new senses appear frequently. Such spaces are built “on the fly” from constantly evolving data sets such as Wikipedia, repositories of patent grants and applica ons, or large sets of legal documents for Technology Assisted Review and e-disco...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i8.16857