MicrobeDB Overview

      Morgan Langille
morgan.gi.langille@gmail.com
Main Features
   Centralized storage and access to completed archaeal and
      bacterial genomes
       Genomes obtained from NCBI RefSeq:
             http://www.ncbi.nlm.nih.gov/genomes/lproks.cgi
       Genome/Flat files are stored in one central location
                 Including files .gbk, .gff, .fna, .faa, etc.
         Unpublished genomes can be added as well
      

   Information at the genome project, chromosome, and gene level
       are parsed and stored in a MySQL database

   A Perl MicrobeDB API provides non-MySQL interface with the
       database.
Main MicrobeDB Tables
   Version
          Each download of genomes from NCBI is given a new version numbe
          Data will not change if you always use the same version number of
              microbedb
          Version date can be cited for any method publications
          A version can be saved by users so not automatically deleted.
   Genome Project
          Contains information about the genome project and the organism that was
              sequenced
          Each genome project contains one or more replicons
   Replicon
          Chromosome, plasmids, or contigs
          Each replicon contains one or more genes
   Gene
          Contains gene annotations and also the DNA and protein sequences (if
              protein coding gene)
MicrobeDB Annotations
Accessing MicrobeDB
• Any traditional MySQL programs
   – phpMyAdmin:

      – Web-based
      – http://phpmyadmin.net

   – MySQL Workbench

      – Local desktop client
      –   http://www.mysql.com/products/workbench/


• MicrobeDB Perl API
      – Allows interaction with database directly from within a Perl script
      – Requires no knowledge of SQL
MySQL Workbench
phpMyAdmin
MicrobeDB API Example
#Use the MicrobeDB Search library
use MicrobeDB::Search;

#create the search object
my $search_obj= new MicorbeDB::Search();

#Create an object with certain features that we want (i.e. only pathogens)
my $obj = new GenomeProject( version_id => '1', patho_status => 'pathogen' );

#This does the actual search and returns a list of all genome projects that match search parameters
my @result_objs = $search_obj->object_search($obj);

#Now we can iterate through each genome project
foreach my $gp_obj (@result_objs) {

      #get the name of the genome
      $gp_obj->org_name()
      foreach my $gene_obj ($gp_obj->genes()){
            if($gene_obj->gene_type() eq 'tRNA'){
                    #write the genes in fasta format with gid as the identifier
                    print '>',$gene_obj->gid,”n”,$gene_obj->gene_seq();
            }}}

More Related Content

PDF
Leveraging ancestral state reconstruction to infer community function from a ...
PPTX
GLBIO/CCBC Metagenomics Workshop
PDF
A Tovchigrechko - MGTAXA: a toolkit and webserver for predicting taxonomy of ...
PDF
Variant analysis and whole exome sequencing
PDF
Overview of Next Gen Sequencing Data Analysis
PPTX
Rare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
PDF
Intro to metagenomic binning
PPTX
Whole exome sequencing(wes)
Leveraging ancestral state reconstruction to infer community function from a ...
GLBIO/CCBC Metagenomics Workshop
A Tovchigrechko - MGTAXA: a toolkit and webserver for predicting taxonomy of ...
Variant analysis and whole exome sequencing
Overview of Next Gen Sequencing Data Analysis
Rare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
Intro to metagenomic binning
Whole exome sequencing(wes)

What's hot (20)

PPTX
ECCMID 2015 Meet-The-Expert: Bioinformatics Tools
PDF
2015.04.08-Next-generation-sequencing-issues
PDF
NGS: Mapping and de novo assembly
PDF
Next Generation Sequencing (NGS) in food safety-Game changer or just another ...
PPTX
Transcriptome project
PPTX
Functional genomics
PDF
BITS - Comparative genomics: the Contra tool
PPTX
Dgaston dec-06-2012
PPT
Genome annotation 2013
PDF
Tyler functional annotation thurs 1120
PPTX
Unknown Genes, Community Profiling, & Biotorrents.net
PPTX
Introduction to Bayesian phylogenetics and BEAST
PDF
Exploring Spark for Scalable Metagenomics Analysis: Spark Summit East talk by...
PDF
Adding Transparency and Automation into the Galaxy Tool Installation Process
PDF
Genome res. 2002-kent-656-64
PDF
A short introduction to single-cell RNA-seq analyses
PDF
Next Generation Sequencing Informatics - Challenges and Opportunities
PDF
Transcriptome Analysis & Applications
PPTX
Cloud bioinformatics 2
PDF
exRNA Data Analysis Tools in the Genboree Workbench
ECCMID 2015 Meet-The-Expert: Bioinformatics Tools
2015.04.08-Next-generation-sequencing-issues
NGS: Mapping and de novo assembly
Next Generation Sequencing (NGS) in food safety-Game changer or just another ...
Transcriptome project
Functional genomics
BITS - Comparative genomics: the Contra tool
Dgaston dec-06-2012
Genome annotation 2013
Tyler functional annotation thurs 1120
Unknown Genes, Community Profiling, & Biotorrents.net
Introduction to Bayesian phylogenetics and BEAST
Exploring Spark for Scalable Metagenomics Analysis: Spark Summit East talk by...
Adding Transparency and Automation into the Galaxy Tool Installation Process
Genome res. 2002-kent-656-64
A short introduction to single-cell RNA-seq analyses
Next Generation Sequencing Informatics - Challenges and Opportunities
Transcriptome Analysis & Applications
Cloud bioinformatics 2
exRNA Data Analysis Tools in the Genboree Workbench
Ad

Similar to MicrobeDB Overview (20)

PDF
AllBio and EU CodeFest 2014
PPTX
Biothings presentation
PPTX
BioThings API: Building a FAIR API Ecosystem for Biomedical Knowledge
PDF
Bonnal bosc2010 bio_ruby
PPTX
Cool Informatics Tools and Services for Biomedical Research
PPTX
Understanding Genome
PPTX
Functional ANNOTATION OF GENOME.pptx
PPT
XMLPipeDB
PPTX
Microarrays Databases.pptx
PPTX
Visualize genomes with Integrated Genome Browser
PDF
20120907 microbiome-intro
PPTX
Apache maven and its impact on java 9 (Java One 2017)
PPTX
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
PDF
Developing an open source community for cloud bioinformatics
PPTX
Bots & spiders
PPTX
BioThings API: Promoting Best-practices via a Biomedical API Development Ecos...
PDF
Michael Reich, GenomeSpace Workshop, fged_seattle_2013
PPTX
Visual COBOL Development for Unix and Java
KEY
Puppet
PDF
The Ruby UCSC API @ISMB2012
AllBio and EU CodeFest 2014
Biothings presentation
BioThings API: Building a FAIR API Ecosystem for Biomedical Knowledge
Bonnal bosc2010 bio_ruby
Cool Informatics Tools and Services for Biomedical Research
Understanding Genome
Functional ANNOTATION OF GENOME.pptx
XMLPipeDB
Microarrays Databases.pptx
Visualize genomes with Integrated Genome Browser
20120907 microbiome-intro
Apache maven and its impact on java 9 (Java One 2017)
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
Developing an open source community for cloud bioinformatics
Bots & spiders
BioThings API: Promoting Best-practices via a Biomedical API Development Ecos...
Michael Reich, GenomeSpace Workshop, fged_seattle_2013
Visual COBOL Development for Unix and Java
Puppet
The Ruby UCSC API @ISMB2012
Ad

More from Morgan Langille (7)

PPTX
Inferring microbial community function from taxonomic composition
PPTX
Characterizing Protein Families of Unknown Function
PPT
BioTorrents: A File Sharing Service for Scientific Data
PPTX
HMMER 3 & Community Profiling
PPT
Computational prediction and characterization of genomic islands: insights i...
PPT
Microbial Genomics 2008 Conference Review
PPT
A graduate student's experience in bioinformatics
Inferring microbial community function from taxonomic composition
Characterizing Protein Families of Unknown Function
BioTorrents: A File Sharing Service for Scientific Data
HMMER 3 & Community Profiling
Computational prediction and characterization of genomic islands: insights i...
Microbial Genomics 2008 Conference Review
A graduate student's experience in bioinformatics

Recently uploaded (20)

PPTX
Approach to a child with acute kidney injury
PPTX
Theoretical for class.pptxgshdhddhdhdhgd
PDF
Horaris_Grups_25-26_Definitiu_15_07_25.pdf
PDF
POM_Unit1_Notes.pdf Introduction to Management #mba #bba #bcom #bballb #class...
PDF
FYJC - Chemistry textbook - standard 11.
PPTX
Thinking Routines and Learning Engagements.pptx
PDF
LATAM’s Top EdTech Innovators Transforming Learning in 2025.pdf
PPTX
pharmaceutics-1unit-1-221214121936-550b56aa.pptx
PPTX
Neurological complocations of systemic disease
PDF
Health aspects of bilberry: A review on its general benefits
PPSX
namma_kalvi_12th_botany_chapter_9_ppt.ppsx
PPTX
Neurology of Systemic disease all systems
PPTX
Cite It Right: A Compact Illustration of APA 7th Edition.pptx
PDF
BSc-Zoology-02Sem-DrVijay-Comparative anatomy of vertebrates.pdf
PPTX
Key-Features-of-the-SHS-Program-v4-Slides (3) PPT2.pptx
PDF
Unleashing the Potential of the Cultural and creative industries
PDF
faiz-khans about Radiotherapy Physics-02.pdf
PDF
African Communication Research: A review
PPT
hemostasis and its significance, physiology
PPTX
MMW-CHAPTER-1-final.pptx major Elementary Education
Approach to a child with acute kidney injury
Theoretical for class.pptxgshdhddhdhdhgd
Horaris_Grups_25-26_Definitiu_15_07_25.pdf
POM_Unit1_Notes.pdf Introduction to Management #mba #bba #bcom #bballb #class...
FYJC - Chemistry textbook - standard 11.
Thinking Routines and Learning Engagements.pptx
LATAM’s Top EdTech Innovators Transforming Learning in 2025.pdf
pharmaceutics-1unit-1-221214121936-550b56aa.pptx
Neurological complocations of systemic disease
Health aspects of bilberry: A review on its general benefits
namma_kalvi_12th_botany_chapter_9_ppt.ppsx
Neurology of Systemic disease all systems
Cite It Right: A Compact Illustration of APA 7th Edition.pptx
BSc-Zoology-02Sem-DrVijay-Comparative anatomy of vertebrates.pdf
Key-Features-of-the-SHS-Program-v4-Slides (3) PPT2.pptx
Unleashing the Potential of the Cultural and creative industries
faiz-khans about Radiotherapy Physics-02.pdf
African Communication Research: A review
hemostasis and its significance, physiology
MMW-CHAPTER-1-final.pptx major Elementary Education

MicrobeDB Overview

  • 2. Main Features  Centralized storage and access to completed archaeal and bacterial genomes  Genomes obtained from NCBI RefSeq: http://www.ncbi.nlm.nih.gov/genomes/lproks.cgi  Genome/Flat files are stored in one central location  Including files .gbk, .gff, .fna, .faa, etc.  Unpublished genomes can be added as well   Information at the genome project, chromosome, and gene level are parsed and stored in a MySQL database  A Perl MicrobeDB API provides non-MySQL interface with the database.
  • 3. Main MicrobeDB Tables  Version  Each download of genomes from NCBI is given a new version numbe  Data will not change if you always use the same version number of microbedb  Version date can be cited for any method publications  A version can be saved by users so not automatically deleted.  Genome Project  Contains information about the genome project and the organism that was sequenced  Each genome project contains one or more replicons  Replicon  Chromosome, plasmids, or contigs  Each replicon contains one or more genes  Gene  Contains gene annotations and also the DNA and protein sequences (if protein coding gene)
  • 5. Accessing MicrobeDB • Any traditional MySQL programs – phpMyAdmin: – Web-based – http://phpmyadmin.net – MySQL Workbench – Local desktop client – http://www.mysql.com/products/workbench/ • MicrobeDB Perl API – Allows interaction with database directly from within a Perl script – Requires no knowledge of SQL
  • 8. MicrobeDB API Example #Use the MicrobeDB Search library use MicrobeDB::Search; #create the search object my $search_obj= new MicorbeDB::Search(); #Create an object with certain features that we want (i.e. only pathogens) my $obj = new GenomeProject( version_id => '1', patho_status => 'pathogen' ); #This does the actual search and returns a list of all genome projects that match search parameters my @result_objs = $search_obj->object_search($obj); #Now we can iterate through each genome project foreach my $gp_obj (@result_objs) { #get the name of the genome $gp_obj->org_name() foreach my $gene_obj ($gp_obj->genes()){ if($gene_obj->gene_type() eq 'tRNA'){ #write the genes in fasta format with gid as the identifier print '>',$gene_obj->gid,”n”,$gene_obj->gene_seq(); }}}