Skip to content

Indexing and Visualization for AMRs and Virulence Genes in Genomes and Metagenomes

License

Notifications You must be signed in to change notification settings

NCBI-Hackathons/Virulence_AMR_Data_Index_Resource

Repository files navigation

Virulence_AMR_Data_Index

Indexing and Visualization for AMRs and Virulence Genes in Genomes and Metagenomes

Purpose

Develop a tool capable of evaluating metadata quality in the Sequence Read Archive (SRA), together with the contigs assembled from each of the runs (SRRs).

Provide visualization capabilities to assist in downstream interpretation.

Use cases:

  1. Find accession/contig by taxonomy
  2. Find accessions/contig by gene (drug resistance, virulence factor)
  3. Find accession/contig by genes cluster
  4. Find accession/contig by domain
  5. Find accession by host
  6. Find similar SRR/contigs
  7. Find accession/contig by sequence

Workflow

The overarching framework of our approach is to enable the integration of various metadata types in a large data structure that consists of multiple tables. An example of such a framework and the various data types that can be supported can be seen below. Metadata Pipeline

Q: Should similarity/difference in sequence content correlate with similarity/difference in metadata?

A: Let's find out what it looks like currently! Metadata Pipeline

Implementation

Please see the README files for replicating the analysis described in workflow #2. Note that Python3, R, and access to the Google Cloud Platform will be required.

Authors

Eric Holloway, Sergey Nurk, Brett E. Pickett, Ryan Connor

About

Indexing and Visualization for AMRs and Virulence Genes in Genomes and Metagenomes

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages