Using Biomedical Terminologies to extract Noun Phrases for managing knowledge evolution
نویسندگان
چکیده
In order to identify variations between two or several versions of Clinical Practice Guidelines, we propose a method based on the detection of noun phrases. Currently, we are developing a comparison approach to extract similar and different elements between medical documents in French in order to identify any significant changes such as new medical terms or concepts, new treatments etc. In this paper, we describe a basic initial step for this comparison approach i.e. detecting noun phrases. This step is based on patterns constructed from six main medical terminologies used in document indexing. The patterns are constructed by using a Tree Tagger. To avoid a great number of generated patterns, the most relevant ones are selected by choosing those that identify more than 80% of the six terminologies used in this study. These steps allowed us to obtain a manageable list of 262 patterns which have been evaluated. Using this list of patterns, 708 maximal noun phrases were found, among them, 364 are correct which represent a 51.41% precision. However by detecting these phrases manually, 602 maximal noun phrases were found which represent a 60.47% recall and by consequence a 55.57% F-measure. We tried to improve these results by increasing a number of patterns from 262 to 493. We obtained a total of 729 maximal noun phrases, with 365 which were correct, which corresponding to a 50.07% precision, 60.63% recall and 54.85% F-measure.
منابع مشابه
Extracting Conceptual Terms from Medical Documents
Automated biomedical concept recognition is important for biomedical document retrieval and text mining research. In this paper, we describe a two-step concept extraction technique for documents in biomedical domain. Step one includes noun phrase extraction, which can automatically extract noun phrases from medical documents. Extracted noun phrases are used as concept term candidates which beco...
متن کاملA Hybrid Approach to Extract and Classify Relation from Biomedical Text
Unstructured biomedical text is a key source of knowledge. Information extraction in biomedical is a complex task due to the high volume of data. Manual efforts produce the best results; however, it is a near impossible task for such a large amount of data. Thus, there is a need of tools and techniques in biomedical text to extract the information automatically. Biomedical text contains relatio...
متن کاملUse of Articles in Learning English as a Foreign Language: A Study of Iranian English Undergraduates
The significance of error analysis for the learner, the teacher and the researcher is now widely recognized. Earlier studies of error analysis concentrated on intersystematic comparison of the “native language” and the “target language” and drew the required data largely from intuitions and impressionistic observations. This study was conducted on the basis of the following observations: (1) to...
متن کاملMethod Mention Extraction from Scientific Research Papers
Scientific publications contain many references to method terminologies used during scientific experiments. New terms are constantly created within the research community, especially in the biomedical domain where thousands of papers are published each week. In this study we report our attempt to automatically extract such method terminologies from scientific research papers, using rule-based a...
متن کاملExtracting noun phrases for all of MEDLINE
A natural language parser that could extract noun phrases for all medical texts would be of great utility in analyzing content for information retrieval. We discuss the extraction of noun phrases from MEDLINE, using a general parser not tuned specifically for any medical domain. The noun phrase extractor is made up of three modules: tokenization; part-of-speech tagging; noun phrase identificati...
متن کامل