Compositionality in (high-dimensional) space

نویسنده

  • Marco Baroni
چکیده

Formal semanticists have developed sophisticated compositional theories of sentential meaning, paying a lot of attention to those grammatical words (determiners, logical connectives, etc.) that constitute the functional scaffolding of sentences. Corpus-based computational linguists, on the other hand, have developed powerful distributional methods to induce representations of the meaning of contentrich words (nouns, verbs, etc.), typically discarding the functional scaffolding as "stop words". Since we do not communicate by logical formulas, nor, Tarzan-style, by flat concatenation of content words, a satisfactory model of the semantics of natural language should strike a balance between the two approaches. In this talk, I will present some recent proposals that try to get the best of both worlds by adapting the classic view of compositionality as function application developed by formal semanticists to distributional models of meaning. I will present preliminary evidence of the effectiveness of these methods in scaling up to the phrasal and sentential domains, and discuss to what extent the representations of phrases and sentences we get out of compositional distributional semantics are related to what formal semanticists are trying to capture.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Non-Linear Similarity Learning for Compositionality

Many NLP applications rely on the existence of similarity measures over text data. Although word vector space models provide good similarity measures between words, phrasal and sentential similarities derived from composition of individual words remain as a difficult problem. In this paper, we propose a new method of of non-linear similarity learning for semantic compositionality. In this metho...

متن کامل

یک روش مبتنی بر خوشه‌بندی سلسله‌مراتبی تقسیم‌کننده جهت شاخص‌گذاری اطلاعات تصویری

It is conventional to use multi-dimensional indexing structures to accelerate search operations in content-based image retrieval systems. Many efforts have been done in order to develop multi-dimensional indexing structures so far. In most practical applications of image retrieval, high-dimensional feature vectors are required, but current multi-dimensional indexing structures lose their effici...

متن کامل

Inter-observer agreement between 2-dimensional CT versus 3-dimensional I-Space model in the Diagnosis of Occult Scaphoid Fractures

  Background: The I-Space is a radiological imaging system in which Computed Tomography (CT)-scans can be evaluated as a three dimensional hologram. The aim of this study is to analyze the value of virtual reality (I-Space) in diagnosing acute occult scaphoid fractures.   Methods: A convenient cohort of 24 patients with a CT-scan from prior studies, without a scaphoid fracture on radiograph, ye...

متن کامل

Constructing Two-Dimensional Multi-Wavelet for Solving Two-Dimensional Fredholm Integral Equations

In this paper, a two-dimensional multi-wavelet is constructed in terms of Chebyshev polynomials. The constructed multi-wavelet is an orthonormal basis for space. By discretizing two-dimensional Fredholm integral equation reduce to a algebraic system. The obtained system is solved by the Galerkin method in the subspace of by using two-dimensional multi-wavelet bases. Because the bases of subs...

متن کامل

Supervised Feature Extraction of Face Images for Improvement of Recognition Accuracy

Dimensionality reduction methods transform or select a low dimensional feature space to efficiently represent the original high dimensional feature space of data. Feature reduction techniques are an important step in many pattern recognition problems in different fields especially in analyzing of high dimensional data. Hyperspectral images are acquired by remote sensors and human face images ar...

متن کامل

Detecting Compositionality of Multi-Word Expressions using Nearest Neighbours in Vector Space Models

We present a novel unsupervised approach to detecting the compositionality of multi-word expressions. We compute the compositionality of a phrase through substituting the constituent words with their “neighbours” in a semantic vector space and averaging over the distance between the original phrase and the substituted neighbour phrases. Several methods of obtaining neighbours are presented. The...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012