Domain Specific Sense Disambiguation with Unsupervised Methods

نویسندگان

  • Diana Steffen
  • Bogdan Sacaleanu
  • Paul Buitelaar
چکیده

Most approaches in sense disambiguation have been restricted to supervised training over manually annotated, non-technical, English corpora. Application to a new language or technical domain requires extensive manual annotation of appropriate training corpora. As this is both expensive and inefficient, unsupervised methods are to be preferred, specifically in technical domains such as medicine. In the context of a project in the medical domain, we developed and evaluated two unsupervised methods for sense disambiguation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Resolving Ambiguities in Biomedical Text With Unsupervised Clustering Approaches

This paper explores the effectiveness of unsupervised clustering techniques developed for general English in resolving semantic ambiguities in the biomedical domain. Methods that use first and second order representations of context are evaluated on the National Library of Medicine Word Sense Disambiguation Corpus. We show that the method of clustering second order contexts in similarity space ...

متن کامل

TreeMatch: A Fully Unsupervised WSD System Using Dependency Knowledge on a Specific Domain

Word sense disambiguation (WSD) is one of the main challenges in Computational Linguistics. TreeMatch is a WSD system originally developed using data from SemEval 2007 Task 7 (Coarse-grained English Allwords Task) that has been adapted for use in SemEval 2010 Task 17 (All-words Word Sense Disambiguation on a Specific Domain). The system is based on a fully unsupervised method using dependency k...

متن کامل

Word sense disambiguation in the clinical domain: a comparison of knowledge-rich and knowledge- poor unsupervised methods

To cite: Chasin R, Rumshisky A, Uzuner O, et al. J Am Med Inform Assoc 2014;21:842–849. ABSTRACT Objective To evaluate state-of-the-art unsupervised methods on the word sense disambiguation (WSD) task in the clinical domain. In particular, to compare graphbased approaches relying on a clinical knowledge base with bottom-up topic-modeling-based approaches. We investigate several enhancements to ...

متن کامل

Research and applications: Word sense disambiguation in the clinical domain: a comparison of knowledge-rich and knowledge-poor unsupervised methods

OBJECTIVE To evaluate state-of-the-art unsupervised methods on the word sense disambiguation (WSD) task in the clinical domain. In particular, to compare graph-based approaches relying on a clinical knowledge base with bottom-up topic-modeling-based approaches. We investigate several enhancements to the topic-modeling techniques that use domain-specific knowledge sources. MATERIALS AND METHOD...

متن کامل

Unsupervised Domain Relevance Estimation for Word Sense Disambiguation

This paper presents Domain Relevance Estimation (DRE), a fully unsupervised text categorization technique based on the statistical estimation of the relevance of a text with respect to a certain category. We use a pre-defined set of categories (we call them domains) which have been previously associated to WORDNET word senses. Given a certain domain, DRE distinguishes between relevant and non-r...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • LDV Forum

دوره 19  شماره 

صفحات  -

تاریخ انتشار 2004