Comparing ICD9-Encoded Diagnoses and NLP-Processed Discharge Summaries for Clinical Trials Pre-Screening: A Case Study

نویسندگان

  • Li Li
  • Herbert S. Chase
  • Chintan Patel
  • Carol Friedman
  • Chunhua Weng
چکیده

The prevalence of electronic medical record (EMR) systems has made mass-screening for clinical trials viable through secondary uses of clinical data, which often exist in both structured and free text formats. The tradeoffs of using information in either data format for clinical trials screening are understudied. This paper compares the results of clinical trial eligibility queries over ICD9-encoded diagnoses and NLP-processed textual discharge summaries. The strengths and weaknesses of both data sources are summarized along the following dimensions: information completeness, expressiveness, code granularity, and accuracy of temporal information. We conclude that NLP-processed patient reports supplement important information for eligibility screening and should be used in combination with structured data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automating ICD-9-CM Encoding Using Medical Language Processing: A Feasibility Study

Objective. To provide a qualitative evaluation of Natural Language Processing (NLP) based ICD-9CM (Encoding of narrative discharge summaries). Background. MedLEE is a NLP system that structures the information of textual medical reports. It was shown to be effective for decision support applications associated with narrative chest X-rays, mammograms, and Discharge Summaries (DS). Significance. ...

متن کامل

Administrative data underestimate acute ischemic stroke events and thrombolysis treatments: Data from a multicenter validation survey in Italy

BACKGROUND Informing health systems and monitoring hospital performances using administrative data sets, mainly hospital discharge data coded according to International-Classification-Diseases-9edition-Clinical-Modifiers (ICD9-CM), is now commonplace in several countries, but the reliability of diagnostic coding of acute ischemic stroke in the routine practice is uncertain. This study aimed at ...

متن کامل

Toward the Automatic Generation of the Entry Level CDA Documents

Objective: CDA (Clinical Document Architecture) is a markup standard for clinical document exchange. In order to increase the semantic interoperability of documents exchange, the clinical statements in the narrative blocks should be encoded with code values. Natural language processing (NLP) is required in order to transform the narrative blocks into the coded elements in the level 3 CDA docume...

متن کامل

Validity of Principal Diagnoses in Discharge Summaries and ICD-10 Coding Assessments Based on National Health Data of Thailand

Objectives This study examined the validity of the principal diagnoses on discharge summaries and coding assessments. Methods Data were collected from the National Health Security Office (NHSO) of Thailand in 2015. In total, 118,971 medical records were audited. The sample was drawn from government hospitals and private hospitals covered by the Universal Coverage Scheme in Thailand. Hospitals...

متن کامل

ICD9 codes cannot reliably identify hemorrhagic transformation of ischemic stroke.

major objective of inpatient stroke care is the prevention of medical and neurological complications, although rare, hemorrhagic transformation (HT) of an isch-emic stroke (IS) can cause neurological deterioration and is associated with an increased risk of death. 1 If HT can be reliably identified in administrative data, it could become a component of hospital quality benchmarks. Previous stud...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • AMIA ... Annual Symposium proceedings. AMIA Symposium

دوره   شماره 

صفحات  -

تاریخ انتشار 2008