Fine-Grained Certainty Level Annotations Used for Coarser-Grained E-Health Scenarios - Certainty Classification of Diagnostic Statements in Swedish Clinical Text
نویسندگان
چکیده
An important task in information access methods is distinguishing factual information from speculative or negated information. Fine-grained certainty levels of diagnostic statements in Swedish clinical text are annotated in a corpus from a medical university hospital. The annotation model has two polarities (positive and negative) and three certainty levels. However, there are many e-health scenarios where such fine-grained certainty levels are not practical for information extraction. Instead, more coarse-grained groups are needed. We present three scenarios: adverse event surveillance, decision support alerts and automatic summaries and collapse the fine-grained certainty level classifications into coarser-grained groups. We build automatic classifiers for each scenario and analyze the results quantitatively. Annotation discrepancies are analyzed qualitatively through manual corpus analysis. Our main findings are that it is feasible to use a corpus of fine-grained certainty level annotations to build classifiers for coarser-grained real-world scenarios: 0.89, 0.91 and 0.8 F-score (overall average).
منابع مشابه
SHADES OF CERTAINTY Annotation and Classification of Swedish Medical Records
Access to information is fundamental in health care. Today, with electronic documentation possibilities, techniques for automatic extraction of information from written documentation are used daily in many areas. However, in the clinical setting, written documentation is still unattainable for improving health care from many perspectives. For Swedish, research for improving automatic informatio...
متن کاملA Compositional Interpretation of Biomedical Event Factuality
We propose a compositional method to assess the factuality of biomedical events extracted from the literature. The composition procedure relies on the notion of semantic embedding and a fine-grained classification of extrapropositional phenomena, including modality and valence shifting, and a dictionary based on this classification. The event factuality is computed as a product of the extra-pro...
متن کاملExploring Fine-Grained Emotion Detection in Tweets
We examine if common machine learning techniques known to perform well in coarsegrained emotion and sentiment classification can also be applied successfully on a set of fine-grained emotion categories. We first describe the grounded theory approach used to develop a corpus of 5,553 tweets manually annotated with 28 emotion categories. From our preliminary experiments, we have identified two ma...
متن کاملTowards a better understanding of uncertainties and speculations in Swedish clinical text – Analysis of an initial annotation trial
Electronic Health Records (EHRs) contain a large amount of free text documentation which is potentially very useful for Information Retrieval and Text Mining applications. We have, in an initial annotation trial, annotated 6 739 sentences randomly extracted from a corpus of Swedish EHRs for sentence level (un)certainty, and token level speculative keywords and negations. This set is split into ...
متن کاملTERMINOLOGY AND THE CLASSIFICATION OF FINE GRAINED SEDIMENTARY ROCKS – is there a difference between a claystone, a mudstone and a shale?
Fine grained sedimentary rocks, both clastic and carbonate, are believed to be the most abundant rock type on the Earth‟s surface (Picard, 1971; Blatt, 1982). Fine grained rocks appear to constitute somewhere in the region of 70% (Holmes, 1937) and 80% (Clarke, 1924) of all the sediment ever produced. In sedimentology the size grade scale most commonly used is that which was introduced by Udden...
متن کامل