Machine learning solutions to fight the Antibiotic Resistance

Antibiotic Resistance

Antibiotic resistance is turning into a global healthcare problem. The exponential growth of metagenomics data has contributed to creation of more accurate and fastest algorithms.

What is DeepARG?

DeepARG is a machine learning solution that uses deep learning to characterize and annotate antibiotic resistance genes in metagenomes. It is composed of two models for two types of input: short sequence reads and gene-like sequences.

Short NGS reads

DeepARG is able to annotate short sequence reads from Next Generation Sequencing (NGS) technologies such as Ilummina. This model has been trained with simulated antibiotic resistance reads to perform better with metagenomic samples.

Gene-Like Sequences

DeepARG is able to predict antibiotic resistance in long gene-like sequences. This model is suitable for annotating full sequence genes and to discover novel antibiotic resistance genes from assembled samples.


Antibiotic resistance Database

Please visit our repository to download deepARG-DB

Important! DeepARG-DB has been under continuous inspection in order to improve the quality of the models. As result, we created a new online resource called ARGminer that allows the manual inspection of ARGs.

For more information please visit the ARGminer website

Train Your deep Learning Model

We have included the neccesary steps to re-train the deepARGs models or to create your own deeep learning model using the architecture of deepARG.
Please look at the Bitbucket README file for details in the DeepARG repository.

Figure 1: Automatic annotation of highly homologous ARGs

Deep Learning Models

The deepARG models have been designed for computational analysis of next generation sequencing data such as Metagenomes. The main contribution of the deepARG models are their low false negative rate during predictions. Also, the gene-like sequences model is designed to find novel ARGs based on the sequence homology.

DeepARG pipeline

The pipeline can be used as an stand alone program. It was developed in python 2.7 and requires (optional) DIAMOND for making the alignments. The source code can be downloaded from this Git Repository hosted in BitBucket.


DeepARG requires the next python modules (all can be installed via pip):

  • Nolearn lasagne deep learning library.
  • Sklearn machine learning routines.
  • Theano for fast computation. For GPU usage (see theano documentation)


Open a terminal and clone the source code:

git clone


  • Go to the directory where the program was saved and open the file

    Replace path = '/home/gustavo1/tmp/deeparg-ss/'; with the current directory (deepARG path).

    For instance, deepARG was cloned at /home/user/deeparg-ss/
    The file should looks like
    path = '/home/user/deeparg-ss/';

  • Go to ./bin under deeparg-ss and run chmod +x diamond (only for LINUX)


Please visit our repository for details

Direct Annotation Service

New! (9/6/2017) We are excited to release a new version of our deepARG online analysis.

The DeepARG webservice (beta) is a tool with fully automated data analysis pipeline for Antibiotic Resistance annotation of raw metagenomics samples using the deepARG algorithm and our developed database (deepARG-DB). You just need to upload your raw sequence reads (*.fastq.gz) and our service will take care of everything else.

Our pipeline first removes low quality reads using TRIMMOMATIC, then, reads are merged into one big file (VSEARCH) and are submited for classification to the deepARG algorithm. Results are normalized to the 16s rRNA abundance in the sample.

DeepARG webservice has a very simple web user interface for the annotation of ARGs from metagenomes. It runs under the ARC (advanced research computing) center at Virginia Tech which guarantees a stable computing environment.

Meet our projects on metagenomics analysis

ARG-miner is a web service dedicated to the inspection of ARGs by using experts and crowdsourcing workers. It is designed with the aim to improve the quality of the ARG annotations. Indeed, the DeepARG-DB was the starting point for this database. Currently, few gene annotations have been inspected and accepted by our team. In the future, this database will be the used to update our deepARG models.

Visit Website
MetaStorm is a WebService developed for Functional and Taxonomic analysis of Metagenomes from Next Generation sequencing reads.
MetaStorm incorporates several functional databases such as ACLAME, COG, UNIPROT, CARD, ARDB, BACMET among others.

Visit Website
ARGPore (Antibiotic Resistance annotation from MinION nanopore sequencing reads) is a web platform developed for the analysis of antibiotic resistance from environmental metagenomics samples obtained with the MinION nanopore sequencer (under testing).

Visit Website


NSF Partnership in International Research and Education (PIRE).

HEARD: NSF Halting Environmental Antimicrobial Resistance Dissemination.
Effective Mitigation Strategies for Antimicrobial Resistance program.
The Virginia Tech Institute for Critical Technology and Applied Science Center for the Science and Engineering of the Exposome (SEE).

The Virginia Tech Sustainable Nanotechnology Interdisciplinary Graduate Education Program (IGEP).

Our team Who is behind this project

Contact us Keep in touch

The easiest and preferred way to get in touch with our team is by posting an issue, question or comment to our ARGminer blog

For instance if you are interested on knowing how to run deepARG using the docker contanier version, you can find a tutorial here: DeepARG docker execution

Or you can post any question regarding the deepARG usage as input for deepARG local version

We strongly recommend to contact us via our blog as other users may find your questions, issues or comments useful too.

You can still send us an email using the form below

Your message was successfully sent!

  • Address
  • Virginia Tech, VA 24061
  • Phone number
  • (+1) 202 717 5300
  • Email