Using Biomedical Terminologies to extract Noun Phrases for managing knowledge evolution

نویسندگان

  • Adila Merabti
  • Lina Fatima Soualmia
  • Stéfan Jacques Darmoni
چکیده

In order to identify variations between two or several versions of Clinical Practice Guidelines, we propose a method based on the detection of noun phrases. Currently, we are developing a comparison approach to extract similar and different elements between medical documents in French in order to identify any significant changes such as new medical terms or concepts, new treatments etc. In this paper, we describe a basic initial step for this comparison approach i.e. detecting noun phrases. This step is based on patterns constructed from six main medical terminologies used in document indexing. The patterns are constructed by using a Tree Tagger. To avoid a great number of generated patterns, the most relevant ones are selected by choosing those that identify more than 80% of the six terminologies used in this study. These steps allowed us to obtain a manageable list of 262 patterns which have been evaluated. Using this list of patterns, 708 maximal noun phrases were found, among them, 364 are correct which represent a 51.41% precision. However by detecting these phrases manually, 602 maximal noun phrases were found which represent a 60.47% recall and by consequence a 55.57% F-measure. We tried to improve these results by increasing a number of patterns from 262 to 493. We obtained a total of 729 maximal noun phrases, with 365 which were correct, which corresponding to a 50.07% precision, 60.63% recall and 54.85% F-measure.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extracting Conceptual Terms from Medical Documents

Automated biomedical concept recognition is important for biomedical document retrieval and text mining research. In this paper, we describe a two-step concept extraction technique for documents in biomedical domain. Step one includes noun phrase extraction, which can automatically extract noun phrases from medical documents. Extracted noun phrases are used as concept term candidates which beco...

متن کامل

A Hybrid Approach to Extract and Classify Relation from Biomedical Text

Unstructured biomedical text is a key source of knowledge. Information extraction in biomedical is a complex task due to the high volume of data. Manual efforts produce the best results; however, it is a near impossible task for such a large amount of data. Thus, there is a need of tools and techniques in biomedical text to extract the information automatically. Biomedical text contains relatio...

متن کامل

Use of Articles in Learning English as a Foreign Language: A Study of Iranian English Undergraduates

The significance of error analysis for the learner, the teacher and the researcher is now widely recognized. Earlier studies of error analysis concentrated on intersystematic comparison of the “native language” and the “target language” and drew the required data largely from intuitions and impressionistic observations. This study was conducted on the basis of the following observations: (1) to...

متن کامل

Method Mention Extraction from Scientific Research Papers

Scientific publications contain many references to method terminologies used during scientific experiments. New terms are constantly created within the research community, especially in the biomedical domain where thousands of papers are published each week. In this study we report our attempt to automatically extract such method terminologies from scientific research papers, using rule-based a...

متن کامل

Extracting noun phrases for all of MEDLINE

A natural language parser that could extract noun phrases for all medical texts would be of great utility in analyzing content for information retrieval. We discuss the extraction of noun phrases from MEDLINE, using a general parser not tuned specifically for any medical domain. The noun phrase extractor is made up of three modules: tokenization; part-of-speech tagging; noun phrase identificati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014