Errata for SemEval-2013 Task 13: Word Sense Induction for Graded and Non-Graded Senses
نویسندگان
چکیده
SemEval-2013 Task 13 measures the performance of Word Sense Induction (WSI) and unsupervised Word Sense Disambiguation (WSD) systems. Given a word in context, the systems must label the word with one or more senses, indicating valid interpretations of the word, and where each sense annotation may be accompanied by a weight indicating how likely that interpretation is. Before performing the annotation task, WSI systems first induce the different meanings of a word by examining usages of the word in text; in contrast, the WSD systems were asked to use the WordNet 3.1 sense inventory. After the completion of Task 13 and the publication of the task description paper (Jurgens and Klapaftis, 2013), a software bug was discovered in the evaluation program that affected the scores in a limited set of circumstances.1 Specifically, the bug resulted in an incorrect calculation of Recall for a WSI or WSD system when not all instances were labeled with senses. In most cases, a system does label all instances with senses and thus, the bug does not occur. However, the task report includes a follow-up experiment that tested systems using only those instances that were labeled with multiple senses; in this setting, many WSI systems ultimately reported fewer instances and, due to the bug, had incorrect scores in the task report.2
منابع مشابه
SemEval-2013 Task 13: Word Sense Induction for Graded and Non-Graded Senses
Most work on word sense disambiguation has assumed that word usages are best labeled with a single sense. However, contextual ambiguity or fine-grained senses can potentially enable multiple sense interpretations of a usage. We present a new SemEval task for evaluating Word Sense Induction and Disambiguation systems in a setting where instances may be labeled with multiple senses, weighted by t...
متن کاملUoS: A Graph-Based System for Graded Word Sense Induction
This paper presents UoS, a graph-based Word Sense Induction system which attempts to find all applicable senses of a target word given its context, grading each sense according to its suitability to the context. Senses of a target word are induced through use of a non-parameterised, linear-time clustering algorithm that returns maximal quasi-strongly connected components of a target word graph ...
متن کاملunimelb: Topic Modelling-based Word Sense Induction
This paper describes our system for shared task 13 “Word Sense Induction for Graded and Non-Graded Senses” of SemEval-2013. The task is on word sense induction (WSI), and builds on earlier SemEval WSI tasks in exploring the possibility of multiple senses being compatible to varying degrees with a single contextual instance: participants are asked to grade senses rather than selecting a single s...
متن کاملAI-KU: Using Substitute Vectors and Co-Occurrence Modeling For Word Sense Induction and Disambiguation
Word sense induction aims to discover different senses of a word from a corpus by using unsupervised learning approaches. Once a sense inventory is obtained for an ambiguous word, word sense discrimination approaches choose the best-fitting single sense for a given context from the induced sense inventory. However, there may not be a clear distinction between one sense and another, although for...
متن کاملAn Evaluation of Graded Sense Disambiguation using Word Sense Induction
Word Sense Disambiguation aims to label the sense of a word that best applies in a given context. Graded word sense disambiguation relaxes the single label assumption, allowing for multiple sense labels with varying degrees of applicability. Training multi-label classifiers for such a task requires substantial amounts of annotated data, which is currently not available. We consider an alternate...
متن کامل