The effects of natural language processing on cross-institutional portability of influenza case detection for disease surveillance.

نویسندگان

  • Jeffrey P Ferraro
  • Ye Ye
  • Per H Gesteland
  • Peter J Haug
  • Fuchiang Rich Tsui
  • Gregory F Cooper
  • Rudy Van Bree
  • Thomas Ginter
  • Andrew J Nowalk
  • Michael Wagner
چکیده

OBJECTIVES This study evaluates the accuracy and portability of a natural language processing (NLP) tool for extracting clinical findings of influenza from clinical notes across two large healthcare systems. Effectiveness is evaluated on how well NLP supports downstream influenza case-detection for disease surveillance. METHODS We independently developed two NLP parsers, one at Intermountain Healthcare (IH) in Utah and the other at University of Pittsburgh Medical Center (UPMC) using local clinical notes from emergency department (ED) encounters of influenza. We measured NLP parser performance for the presence and absence of 70 clinical findings indicative of influenza. We then developed Bayesian network models from NLP processed reports and tested their ability to discriminate among cases of (1) influenza, (2) non-influenza influenza-like illness (NI-ILI), and (3) 'other' diagnosis. RESULTS On Intermountain Healthcare reports, recall and precision of the IH NLP parser were 0.71 and 0.75, respectively, and UPMC NLP parser, 0.67 and 0.79. On University of Pittsburgh Medical Center reports, recall and precision of the UPMC NLP parser were 0.73 and 0.80, respectively, and IH NLP parser, 0.53 and 0.80. Bayesian case-detection performance measured by AUROC for influenza versus non-influenza on Intermountain Healthcare cases was 0.93 (using IH NLP parser) and 0.93 (using UPMC NLP parser). Case-detection on University of Pittsburgh Medical Center cases was 0.95 (using UPMC NLP parser) and 0.83 (using IH NLP parser). For influenza versus NI-ILI on Intermountain Healthcare cases performance was 0.70 (using IH NLP parser) and 0.76 (using UPMC NLP parser). On University of Pisstburgh Medical Center cases, 0.76 (using UPMC NLP parser) and 0.65 (using IH NLP parser). CONCLUSION In all but one instance (influenza versus NI-ILI using IH cases), local parsers were more effective at supporting case-detection although performances of non-local parsers were reasonable.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Review of Influenza Surveillance System in the Islamic Republic of Iran: History, Structures and Processes

Background and Objectives: Iran, like most other countries in the world, is always threatened with global epidemics and pandemics of influenza. The purpose of this study was to review the influenza surveillance system in Iran.   Methods: Data of this study were obtained from the surveillance system of the Center for Communicable Disease Control, the review of records, documents, books and pub...

متن کامل

Applying conserved peptides of NS1 Protein of avian influenza virus to differentiate infected from vaccinated chickens

Avian influenza (AI) is a highly contagious disease in poultry and outbreaks can have dramatic economic and health implications. For effective disease surveillance, rapid and sensitive assays are needed to detect antibodies against AI virus (AIV) proteins. In order to support eradication efforts of avian influenza (AI) infections in poultry, the implementation of “DIVA” vaccination strategies, ...

متن کامل

امکان‌سنجی استفاده از منابع داده‌های بالینی و غیربالینی در نظام مراقبت سندرومیک آنفلوانزا: به‌کارگیری رویکرد تجزیه‌وتحلیل همبستگی

Background and Objectives: Syndromic surveillance systems are used to early detection of outbreaks. The purpose of this study was to determine the feasibility of clinical and non-clinical data sources used in influenza syndromic surveillance in Zanjan. Methods: In this time series study, clinical and non-clinical data related to influenza like illness (ILI) as a potential data source of synd...

متن کامل

Almost-Unsupervised Cross-Language Opinion Analysis at NTCIR-7

We describe the Sussex NLCL System entered in the NTCIR-7 Multilingual Opinion Analysis Task (MOAT). Our main focus is on the problem of portability of natural language processing systems across languages. Our system was the only one entered for all four of the MOAT languages, Japanese, English, and Simplified and Traditional Chinese. The system uses an almostunsupervised approach applied to tw...

متن کامل

Molecular Surveillance of Avian Influenza in Bird Parks of Tehran, Iran

BACKGROUND: Avian influenza (AI) viruses have been isolated from a wide diversity of free-living avian species representing several orders. Since 1998, H9N2 AI outbreaks have been one of the major problems in Iranian poultry industry. In 2006, H5N1 was reported in swans in the north of Iran first , but until now there has been no official report from commercial flocks in Iran. OBJECTIVES: The...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Applied clinical informatics

دوره 8 2  شماره 

صفحات  -

تاریخ انتشار 2017