Biosurveillance for Invasive Fungal Infections via Text Mining
نویسندگان
چکیده
Invasive fungal diseases (IFDs) cause more than 1,000 deaths in hospitals and cost the health system more than AUD100m in Australia each year. The most common life-threatening IFD is aspergillosis and a patient with this IFD typically has 12 days prolonged in-patient time in hospital and an 8% mortality rate. Surveillance and detection of IFDs irrespective of the stage of diagnosis (i.e., early or late in disease) is important. We describe an application of text mining techniques, using machine learning over a range of features, to automatically detect cases of patients with IFD from the text in the reports of CT scans performed on them. We focus on detecting the presence of aspergillosis; however, we anticipate the approach to be transferable to other diseases or conditions by training the text mining component over appropriate reports. Previous systems based on language technology have been deployed for processing radiology reports and for detecting hospitalacquired infection using language-processing technology, with significant success. Our approach differs by using a purely statistical/machine-learning approach to the language technology, and by being trained and tested on data collected from a number of hospitals. We collected reports for 288 IFD and 291control patients from three different hospitals in Melbourne, Australia: Alfred Health, Melbourne Health, and Peter MacCallum Cancer Centre. We extracted a sample of 69 IFD and 49 control patients to perform detailed analysis of the text with regard to IFD; each patient had possibly multiple scans (and associated reports), resulting in a total of 398 scan reports from IFD-positive patients and 83 scan reports from control patients. We had medical experts annotate the patient-level classification on all scan reports at both sentence and report level: The annotators had to decide, for each sentence and report, whether it was positive, neutral, or negative with regards to IFD. We classify reports and patients as IFD-positive if they contain at least one positive sentence, and as negative otherwise. We used the Weka SVM implementation and employed a variety of textand concept-based features, including bag-of-words, punctuation, UMLS concepts and negated contexts extracted using MetaMap. We also automatically extracted high-value terms (as measured using log-likelihood ratio) and formulated multi-word concept descriptions. Our system showed Sensitivity of 0.94 and Specificity of 0.76 for classifying individual reports as being indicative of aspergillus, and 1.0 and 0.51 for classifying patients as having contracted the in-
منابع مشابه
Invasive Aspergillosis in COVID-19: A Review Study and Recommendations for Diagnostic Approaches
Background and purpose: COVID-19 is an emerging viral disease caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Bacterial or fungal superinfections of the lung may cause complications in clinical manifestations, treatment, and increase the mortality rate. In this article, we reviewed previous studies on invasive aspergillosis (IA) in viral infections and in adjustment with...
متن کاملRNAi technology: A Novel approaches against fungal infections
Despite the introduction of new antifungal agents, resistances to antifungal therapy continue to increase and outcome of invasive fungal infections treatment is frequently suboptimal. A large amount of the recent effort in antifungal drug discovery has focused on a limited set of targets with functions known or expected to be important for fungal viability and virulence. A variety of techniques...
متن کاملA Review of Antifungals and their Mono- and Combination- therapy in the Treatment of Invasive Fungal Infections
Invasive fungal infections with high mortality rate are a growing health concern in hospitals and medical centers. The infection usually occurs in people with compromised immune systems. The purpose of this paper is a review of the most commonly prescribed antifungal drugs for invasive fungal diseases. Antifungals consist of the four main groups; polyenes, azoles, echinocandins and DNA an...
متن کاملFungal infections of the Respiratory Tract-Section 1
Fungal infections of the Respiratory Tract are important causes of morbidity and mortality in immunocompromised hospitalized patients. Among these patients are those receiving cytotoxic chemotherapy for neoplastic diseases, those undergoing bone marrow stem cell transplantation or organ transplantation, and those affected with the acquired immune deficiency syndrome. Invasive Aspergillosis has ...
متن کاملOntology-Based Text Mining for Predicting Disease Outbreaks
We have implemented an ontology-based text-mining tool for predicting disease outbreaks. This tool is designed to be used as a free and open-source plug-in for InSTEDD’s interactive biosurveillance system Riff. Availability. This tool, in its source code, is freely available from http://code.google.com/p/e-dop/.
متن کامل