Using a Similarity Measurement to Partition a Vocabulary of Medical Concepts

نویسندگان

  • Huanying Gu
  • James Geller
  • Li-min Liu
  • Michael Halper
چکیده

Controlled medical vocabularies have become increasingly important in a range of medical informatics applications. However, the extensive size of most vocabularies often makes it diicult for users to gain an understanding of their contents. In previous work, we have investigated the partitioning of a large semantic-network based medical vocabulary into smaller units, for the purpose of easier graphical display and comprehension. The partitioning process relied heavily on a domain expert. In this paper, we propose a structural method for automating the partitioning of a vocabulary. The structural method is based on a deenition of the similarity of a pair consisting of a child concept and its parent concept in the semantic network. A distribution over these similarities for all pairs in the semantic network is then computed. Based on this distribution, the semantic network can be partitioned into more manageable pieces. The approach has been applied to the InterMED and a complex portion of the MED, two large medical vocabularies.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Similarity measurement for describe user images in social media

Online social networks like Instagram are places for communication. Also, these media produce rich metadata which are useful for further analysis in many fields including health and cognitive science. Many researchers are using these metadata like hashtags, images, etc. to detect patterns of user activities. However, there are several serious ambiguities like how much reliable are these informa...

متن کامل

Building Folk UMLS: An Approach to Finding Meaning of Folk Terms in Medical Domain

As a medical domain knowledge base, the Unified Medical Language System (UMLS) focuses on formal and professional medical terms; online health forums contain user-generated “folk terms”, which can be used to complement the UMLS vocabulary. In this paper, we propose an approach to detecting folk terms from online discussions and matching their meanings to UMLS concepts. This approach makes conne...

متن کامل

خوشه‌بندی اسناد مبتنی بر آنتولوژی و رویکرد فازی

Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...

متن کامل

Similarity Reasoning over Semantic Context–graphs

Similarity is a central cognitive mechanism for humans which enables a broad range of perceptual and abstraction processes, including recognizing and categorizing objects, drawing parallelism, and predicting outcomes. It has been studied computationally through models designed to replicate human judgment. The work presented in this dissertation leverages general purpose semantic networks to der...

متن کامل

Text Simplification Using Consumer Health Vocabulary to Generate Patient-Centered Radiology Reporting: Translation and Evaluation

BACKGROUND Radiology reporting is a clinically oriented form of documentation that reflects critical information for patients about their health care processes. Realizing its importance, many medical institutions have started providing radiology reports in patient portals. The gain, however, can be limited because of medical language barriers, which require a way for customizing these reports f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999