Expanding a dictionary of marker words for uncertainty and negation using distributional semantics
نویسندگان
چکیده
Approaches to determining the factuality of diagnoses and findings in clinical text tend to rely on dictionaries of marker words for uncertainty and negation. Here, a method for semi-automatically expanding a dictionary of marker words using distributional semantics is presented and evaluated. It is shown that ranking candidates for inclusion according to their proximity to cluster centroids of semantically similar seed words is more successful than ranking them according to proximity to each individual seed word.
منابع مشابه
Expanding HiErarcHical contExts for constructing a sEmantic Word nEtWork
A semantic word network is a network that represents the semantic relations between individual words or their lexical senses. This paper proposes Watlink, an unsupervised method for inducing a semantic word network (SWN) by constructing and expanding the hierarchical contexts using both the available dictionary resources and distributional semantics’ methods for is-a relations. It has three ste...
متن کاملImplementing a Reverse Dictionary, based on word definitions, using a Node-Graph Architecture
In this paper, we outline an approach to build graph-based reverse dictionaries using word definitions. A reverse dictionary takes a phrase as an input and outputs a list of words semantically similar to that phrase. It is a solution to the Tip-of-the-Tongue problem. We use a distance-based similarity measure, computed on a graph, to assess the similarity between a word and the input phrase. We...
متن کاملThere Is No Logical Negation Here, But There Are Alternatives: Modeling Conversational Negation with Distributional Semantics
Logical negation is a challenge for distributional semantics, because predicates and their negations tend to occur in very similar contexts, and consequently their distributional vectors are very similar. Indeed, it is not even clear what properties a “negated” distributional vector should possess. However, when linguistic negation is considered in its actual discourse usage, it often performs ...
متن کاملSame Referent, Different Words: Unsupervised Mining of Opaque Coreferent Mentions
Coreference resolution systems rely heavily on string overlap (e.g., Google Inc. and Google), performing badly on mentions with very different words (opaque mentions) like Google and the search giant. Yet prior attempts to resolve opaque pairs using ontologies or distributional semantics hurt precision more than improved recall. We present a new unsupervised method for mining opaque pairs. Our ...
متن کاملQuantifier Scope in Categorical Compositional Distributional Semantics
Categorical Compositional Distributional semantics (CCDS) adds compositionality to distributional semantics via a functorial passage from the syntax to the semantics of natural language [4]. Both the syntax and the semantics are represented by compact closed categories. The claim is that regardless of how complex the structure of a sentence can be and what bizarre forms the words therein can ta...
متن کامل