DeepARG: a deep learning approach for predicting antibiotic resistance genes from metagenomic data

نویسندگان

  • Gustavo Arango-Argoty
  • Emily Garner
  • Amy Pruden
  • Lenwood S. Heath
  • Peter Vikesland
  • Liqing Zhang
چکیده

BACKGROUND Growing concerns about increasing rates of antibiotic resistance call for expanded and comprehensive global monitoring. Advancing methods for monitoring of environmental media (e.g., wastewater, agricultural waste, food, and water) is especially needed for identifying potential resources of novel antibiotic resistance genes (ARGs), hot spots for gene exchange, and as pathways for the spread of ARGs and human exposure. Next-generation sequencing now enables direct access and profiling of the total metagenomic DNA pool, where ARGs are typically identified or predicted based on the "best hits" of sequence searches against existing databases. Unfortunately, this approach produces a high rate of false negatives. To address such limitations, we propose here a deep learning approach, taking into account a dissimilarity matrix created using all known categories of ARGs. Two deep learning models, DeepARG-SS and DeepARG-LS, were constructed for short read sequences and full gene length sequences, respectively. RESULTS Evaluation of the deep learning models over 30 antibiotic resistance categories demonstrates that the DeepARG models can predict ARGs with both high precision (> 0.97) and recall (> 0.90). The models displayed an advantage over the typical best hit approach, yielding consistently lower false negative rates and thus higher overall recall (> 0.9). As more data become available for under-represented ARG categories, the DeepARG models' performance can be expected to be further enhanced due to the nature of the underlying neural networks. Our newly developed ARG database, DeepARG-DB, encompasses ARGs predicted with a high degree of confidence and extensive manual inspection, greatly expanding current ARG repositories. CONCLUSIONS The deep learning models developed here offer more accurate antimicrobial resistance annotation relative to current bioinformatics practice. DeepARG does not require strict cutoffs, which enables identification of a much broader diversity of ARGs. The DeepARG models and database are available as a command line version and as a Web service at http://bench.cs.vt.edu/deeparg .

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploratory Metagenomic Analysis of Antibiotic Resistance Genes in Bacterial Communities A statistical approach for classification of the bacterial resitome

The increasing prevalence of antibiotic-resistant bacteria has become a notorious threat to human health. Bacteria become resistant through resistance genes that can move between cells using horizontal gene transfer. Antibiotics are naturally produced by microorganisms in the environment and therefore bacterial communities maintain a large collection of resistance genes (the resistome). The div...

متن کامل

Virulence-associated and antibiotic resistance genes of microbial populations in cattle feces analyzed using a metagenomic approach.

The bovine fecal microbiota impacts human food safety as well as animal health. Although the bacteria of cattle feces have been well characterized using culture-based and culture-independent methods, techniques have been lacking to correlate total community composition with community function. We used high throughput sequencing of total DNA extracted from fecal material to characterize general ...

متن کامل

The Human Gut Microbiome as a Transporter of Antibiotic Resistance Genes between Continents.

Previous studies of antibiotic resistance dissemination by travel have, by targeting only a select number of cultivable bacterial species, omitted most of the human microbiome. Here, we used explorative shotgun metagenomic sequencing to address the abundance of >300 antibiotic resistance genes in fecal specimens from 35 Swedish students taken before and after exchange programs on the Indian pen...

متن کامل

Deep Learning for Metagenomic Data: using 2D Embeddings and Convolutional Neural Networks

Deep learning (DL) techniques have had unprecedented success when applied to images, waveforms, and texts to cite a few. In general, when the sample size (N ) is much greater than the number of features (d), DL outperforms previous machine learning (ML) techniques, often through the use of convolution neural networks (CNNs). However, in many bioinformatics ML tasks, we encounter the opposite si...

متن کامل

Detection of antibiotic resistance genes in some Lactococcus garvieae strains isolated from infected rainbow trout

The present study was done to evaluate the presence of antibiotic resistance genes in Lactococcus garvieae isolated from cultured rainbow trout, West Iran.The isolates were examined for antimicrobial resistance using disc diffusion method. Of the 24 strains tested, 21 were resistant to ampicillin (87.5%), 9 to erythromycin (37.5%) and 19 to tetracycline (79.1%). Fourteen strains were resistant ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2018