Resolving Language and Vision Ambiguities Together: Joint Segmentation & Prepositional Attachment Resolution in Captioned Scenes
نویسندگان
چکیده
We present an approach to simultaneously perform semantic segmentation and prepositional phrase attachment resolution for captioned images. The motivation for this work comes from the fact that some ambiguities in language simply cannot be resolved without simultaneously reasoning about an associated image. If we consider the sentence “I shot an elephant in my pajamas”, looking at the language alone (and not reasoning about common sense), it is unclear if it is the person or the elephant that is wearing the pajamas or both. Our approach involves producing a diverse set of plausible hypotheses for both semantic segmentation and prepositional phrase attachment resolution that are then jointly re-ranked to select the most consistent pair. We show that our semantic segmentation and prepositional phrase attachment resolution modules have complementary strengths, and that joint reasoning produces more accurate results than any module operating in isolation. We also show that multiple hypotheses are crucial to improved multiple-module reasoning. Our vision and language approach significantly outperforms a state-of-the-art NLP system (Stanford Parser [18, 30]) by 17.91% (28.69% relative) in one experiment, and by 12.83% (25.28% relative) in another. We also make small improvements over a state-of-the-art vision system (DeepLab-CRF [15]).
منابع مشابه
Prepositional Phrase Attachment Problem Revisited: how Verbnet can Help
Resolving attachment ambiguities is a pervasive problem in syntactic analysis. We propose and investigate an approach to resolving prepositional phrase attachment that centers around the ways of incorporating semantic knowledge derived from the lexico-semantic ontologies such as VERBNET and WORDNET.
متن کاملResolving attachment ambiguities with multiple constraints.
Different theories of syntactic ambiguity resolution argue for different sources of information determining initial parsing decisions (e.g., structurally defined parsing principles, lexically specific biases, or referential pragmatics). However, a "constraint-based" approach to syntactic ambiguity resolution proposes that both lexically specific biases and referential pragmatics are used in par...
متن کاملA Weighted Polynomial Information Gain Kernel for Resolving Prepositional Phrase Attachment Ambiguities with Support Vector Machines
We introduce a new kernel for Support Vector Machine learning in a natural language setting. As a case study to incorporate domain knowledge into a kernel, we consider the problem of resolving Prepositional Phrase attachment ambiguities. The new kernel is derived from a distance function that proved to be succesful in memory-based learning. We start with the Simple Overlap Metric from which we ...
متن کاملIntegrating Symbolic and Statistical Methods for Prepositional Phrase Attachment
This paper’ presents a novel methodology of resolving prepositional phrase attachment ambiguities. The approach consists of three phases. First, we rely on a publicly available database to classify a large corpus of prepositional attachments extracted from the Treebank parses. As a by-product, the arguments of every prepositional relation are semantically disambiguated. In the second phase, the...
متن کاملResolving prepositional phrase attachment ambiguities in Spanish with a classifier
In this paper we present a classifier that solves a certain kind of ambiguities in syntactic structure for Spanish, namely, ambiguities as to the point of adjunction of a prepositional phrase in the syntactic structure of a sentence (PP attachment). As a starting point, we used EsTxala dependency grammar for Spanish, integrated within FreeLing, with an accuracy score of 61% on PP adjunction. Ou...
متن کامل