Automated annotation and classification of BI-RADS assessment from radiology reports

نویسندگان

  • Sergio M. Castro
  • Eugene Tseytlin
  • Olga Medvedeva
  • Kevin J. Mitchell
  • Shyam Visweswaran
  • Tanja Bekhuis
  • Rebecca S. Jacobson
چکیده

The Breast Imaging Reporting and Data System (BI-RADS) was developed to reduce variation in the descriptions of findings. Manual analysis of breast radiology report data is challenging but is necessary for clinical and healthcare quality assurance activities. The objective of this study is to develop a natural language processing (NLP) system for automated BI-RADS categories extraction from breast radiology reports. We evaluated an existing rule-based NLP algorithm, and then we developed and evaluated our own method using a supervised machine learning approach. We divided the BI-RADS category extraction task into two specific tasks: (1) annotation of all BI-RADS category values within a report, (2) classification of the laterality of each BI-RADS category value. We used one algorithm for task 1 and evaluated three algorithms for task 2. Across all evaluations and model training, we used a total of 2159 radiology reports from 18 hospitals, from 2003 to 2015. Performance with the existing rule-based algorithm was not satisfactory. Conditional random fields showed a high performance for task 1 with an F-1 measure of 0.95. Rules from partial decision trees (PART) algorithm showed the best performance across classes for task 2 with a weighted F-1 measure of 0.91 for BIRADS 0-6, and 0.93 for BIRADS 3-5. Classification performance by class showed that performance improved for all classes from Naïve Bayes to Support Vector Machine (SVM), and also from SVM to PART. Our system is able to annotate and classify all BI-RADS mentions present in a single radiology report and can serve as the foundation for future studies that will leverage automated BI-RADS annotation, to provide feedback to radiologists as part of a learning health system loop.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automated detection of ambiguity in BI-RADS assessment categories in mammography reports.

An unsolved challenge in biomedical natural language processing (NLP) is detecting ambiguities in the reports that can help physicians to improve report clarity. Our goal was to develop NLP methods to tackle the challenges of identifying ambiguous descriptions of the laterality of BI-RADS Final Assessment Categories in mammography radiology reports. We developed a text processing system that us...

متن کامل

Using automatically extracted information from mammography reports for decision-support

OBJECTIVE To evaluate a system we developed that connects natural language processing (NLP) for information extraction from narrative text mammography reports with a Bayesian network for decision-support about breast cancer diagnosis. The ultimate goal of this system is to provide decision support as part of the workflow of producing the radiology report. MATERIALS AND METHODS We built a syst...

متن کامل

Machine Learning Approaches to Automatic BI-RADS Classification of Mammography Reports

The average American radiologist interprets at least 1,777 mammogram reports each year, or approximately one new mammogram every 70 minutes [1]. Because radiologists interpret so many mammograms and because the proper interpretation of a screening mammogram is often a matter of life or death for the woman involved, various attempts have been made to streamline the mammography reporting process ...

متن کامل

Automated Indexing of Mammography Reports Using Linear Least Squares Fit

Radiologists routinely document mammography results in free text dictations. In the last decade, there has been an increase in the volume of mammography performed in the U.S. As a result, The American College of Radiology has standardized the practice of screening mammography by introducing a controlled vocabulary and practice standards tracked by audits. Extracting data from these free text re...

متن کامل

Presenting a Simplified Assistant Tool for Breast Cancer Diagnosis in Mammography to Radiologists

This paper proposes a method to simplify a computational model from logistic regression for clinical use without computer. The model was built using human interpreted featrues including some BI-RADS standardized features for diagnosing the malignant masses. It was compared with the diagnosis using only assessment categorization from BI-RADS. The research aims at assisting radiologists to diagno...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of biomedical informatics

دوره 69  شماره 

صفحات  -

تاریخ انتشار 2017