SlideShare a Scribd company logo
João André Carriço,
Microbiology Institute and Instituto de Medicina Molecular,
Faculty of Medicine, University of Lisbon
jcarrico@fm.ul.pt twitter: @jacarrico
ME081 – Meet-The-Expert Session
26th ECCMID, Amsterdam, Netherlands
7-12 April 2016
 This presentation is not intended to cover all available
software or databases (we would need several weeks or
months to do that)
 I’ll present what I use or intend to use in a near future
 I gladly accept any suggestions to included on similar
presentations in the future.
 It is supposed to be interactive so ask away during the
presentation.
 Available Databases
 Virulence Factors and AMR DBs
 Sequence-based typing databases: Pubmlst.org / Enterobase
 HighThroughput Sequencing data analysis (freeware)
 Prokka
 Roary
 Nullabor
 Microreact.org
 PHYLOViZ
 Commercial Solutions
 Bionumerics 7.5
 CLC GenomicsWorkbench (CLC Bio)
 Ridom Seqsphere+
Virulence Factor Databases
 VFDB (http://www.mgc.ac.cn/VFs/main.htm)
 Pathosystems Resource Integration Center (PATRIC)
VF (https)://www.patricbrc.org/)
 Victors (http://www.phidias.us/victors/)
 PHI-Base (http://www.phi-base.org/)
 MvirDB (http://mvirdb.llnl.gov/ )
To know more:
- Presentation on the Controversies in interpreting whole genome sequence data session :
http://eccmidlive.org/#resources/how-can-we-design-actionable-virulome-databases
 Comprehensive Antibiotic Resistance Database
(CARD) (https://card.mcmaster.ca/)
 Repository of Antibiotic resistanceCassetes
(RAC)(http://rac.aihi.mq.edu.au/rac/)
 Integrall :The integron database
(http://integrall.bio.ua.pt/)
(…)
To know more :
http://www.slideshare.net/nickloman/eccmid-2015-so-i-have-sequenced-my-genome-what-now
Reads
(fastq files)
contigs
(fasta files)
Annotated contigs
(gbk/gff files)
Roary :PanGenome Analysis
Enterobase
BIGSdb
Nullabor
PHYLOViZ:
Tree + metada
visualization
Microreact.org:
Tree +metadata
+vizualization
Prokka
De novo assembler
http://www.pubmlst.org
http://bigsdb.web.pasteur.fr/
slide by @happy_khan
Martin Sergeant
Mark Achtman
Nabil-Fareed Alikhan
Zhemin Zhou
 Genome annotation made easy byTorsten
Seemann (slides byTorsten)
 Genome annotation: adding biological
information to the sequence, by describing
features
To know more :
http://www.slideshare.net/torstenseemann/prokka-rapid-bacterial-genome-annotation-abphm-2013
Available at: https://github.com/tseemann/prokka
 Pan genome analysis by Andrew Page
 Available at: https://sangerpathogens.github.io/Roary/
Core
genome
Accessory
genome
Pan-genome
 Inputs:Annotated de novo assemblies (GFF files)
• Typically from the annotation pipeline
 Outputs:
• Spreadsheet with presence and absence of genes
• Multi-FASTA alignment of core genes so you can build a tree
without a reference
• Multi-FASTA alignments for each gene
• Plots for the open/closed genome, unique genes
• Integrates with iCANDY so you can visualise all structural variation
• QC report from Kraken to help identify suspect samples
(Slide by Andrew Page)
Core (n or n-1 strains)
Soft-Core
(n-2 or n-3 strains)
Shell
( 8(?) to n-3 strains)
Cloud
( <8 (?) strains)
Core genome:
Core + Soft-Core
Accessory genome:
Shell + Cloud
iCANDY output of presence and
absence of genes in accessory
genome.
S. Weltevreden & public S. enterica
genomes
(Slide by Andrew Page)
 Complete pipeline from reads to reports byTorsten
Seemann
 Objective is automate analysis for everyday use on
public health labs /research settings
 Uses and distills outputs by a lot of software
 Avaliable at: https://github.com/tseemann/nullarbor
Slide byTorsten Seeman
From: https://github.com/tseemann/nullarbor
Slides byTorsten Seeman
www.phyloviz.net
Inputs:
- Tab separated txt (profiles)
- Fasta files
- Automatic database retrieval
(MLST)
Outputs:
• goeBURST and goeBURST
MST
• Link quality assessment
• High quality images
Can be easily applied to:
- MLST/ cgMLST/wgMLST
- MLVA
- SNP data*
- Gene Presence/absence
New features:
• Hierarchical clustering
• Neighbor-Joining
• Project Saving
 Available at http://online.phyloviz.net
 Web based version of PHYLOViZ
 Allows users to create their own datasets, save them and share their data
(privately or publicly)
 REST API available
 Scalable to thousands of nodes
 Tree Analysis tools:
 Interactive distance matrix
 NLV graph
Slide by @happy_khan
Computational Resources In Infectious Disease
NLV Graph
Tree cut-off
Full MST
Computational Resources In Infectious Disease
Computational Resources In Infectious Disease
Create Selections
Change tree options
 Available at http://microreact.org/
 Presentation on session Harnessing whole genome sequence data
for public health applications : Novel open access tools forWGS-
based pathogen surveillance and the identification of high-risk
clones
 http://eccmidlive.org/#resources/novel-open-access-tools-for-
wgs-based-pathogen-surveillance-and-the-identification-of-high-
risk-clones
Computational Resources In Infectious Disease
• Ridom Seqsphere+ : http://www.ridom.de/seqsphere/
• Applied Maths Bionumerics 7.6: http://www.applied-maths.com/bionumerics
• CLCBioGenomicWorkbench : http://www.clcbio.com/blog/clc-genomics-workbench-7-5/
• Huge variety of software and database solutions
• There is no single One-Size-Fits-All solution (job
security for bioinformaticians)
• Different questions require different approaches
• Always questions the results and data provenance
 ECCMID2015 Meet-the-expert session on “What bioinformatic tools
should I use for analysis of HighThroughput Sequencing data for
molecular diagnostics? ”
 Nick Loman: http://www.slideshare.net/nickloman/eccmid-2015-
meettheexpert-bioinformatics-tools
 João André Carriço:
http://www.slideshare.net/joaoandrecarrico/eccmid-meet-
theexpert2015
 UMMI Members
 Bruno Gonçalves
 Mário Ramirez
 José Melo-Cristino
 INESC-ID
 Alexandre Francisco
 Cátia Vaz
 Marta Nascimento
 EFSA INNUENDO Project (https://sites.google.com/site/innuendocon/)
 Mirko Rossi
 FP7 PathoNGenTrace (http://www.patho-ngen-trace.eu/):
 Dag Harmsen (Univ. Muenster)
 Stefan Niemann (Research Center Borstel)
 Keith Jolley, James Bray and Martin Maiden (Univ. Oxford)
 Joerg Rothganger (RIDOM)
 Hannes Pouseele (Applied Maths)
 Genome Canada IRIDA project (www.irida.ca)
 Franklin Bristow, Thomas Matthews, Aaron Petkau, Morag Graham and Gary Van Domselaar(NLM , PHAC)
 Ed Taboada and Peter Kruczkiewicz (LabFoodborne Zoonoses, PHAC)
 Fiona Brinkman (SFU)
 William Hsiao (BCCDC)
INTEGRATED RAPID INFECTIOUS DISEASE ANALYSIS

More Related Content

PPT
Integrating phylogenetic inference and metadata visualization for NGS data
João André Carriço
 
PPTX
ECCMID 2016 - How to build actionable virulome databases
João André Carriço
 
PPTX
Eccmid meet the expert 2015
João André Carriço
 
PPTX
Genomic Epidemiology: How High Throughput Sequencing changed our view on bac...
João André Carriço
 
PPTX
Making Use of NGS Data: From Reads to Trees and Annotations
João André Carriço
 
PPTX
Software Pipelines: The Good, The Bad and The Ugly
João André Carriço
 
PPTX
Common languages in genomic epidemiology: from ontologies to algorithms
João André Carriço
 
PDF
Introduction to 16S Microbiome Analysis
Bioinformatics and Computational Biosciences Branch
 
Integrating phylogenetic inference and metadata visualization for NGS data
João André Carriço
 
ECCMID 2016 - How to build actionable virulome databases
João André Carriço
 
Eccmid meet the expert 2015
João André Carriço
 
Genomic Epidemiology: How High Throughput Sequencing changed our view on bac...
João André Carriço
 
Making Use of NGS Data: From Reads to Trees and Annotations
João André Carriço
 
Software Pipelines: The Good, The Bad and The Ugly
João André Carriço
 
Common languages in genomic epidemiology: from ontologies to algorithms
João André Carriço
 
Introduction to 16S Microbiome Analysis
Bioinformatics and Computational Biosciences Branch
 

What's hot (20)

PDF
A peek inside the bioinformatics black box - DCAMG Symposium - mon 20 july 2015
Torsten Seemann
 
PDF
Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...
Nathan Olson
 
PDF
WGS in public health microbiology - MDU/VIDRL Seminar - wed 17 jun 2015
Torsten Seemann
 
PDF
16S rRNA Analysis using Mothur Pipeline
Eman Abdelrazik
 
PDF
Approaches to analysing 1000s of bacterial isolates - ICEID 2015 Atlanta, USA...
Torsten Seemann
 
PPTX
Pipeline or pipe dream - Midlands Micro Meeting UK - mon 15 sep 2014
Torsten Seemann
 
PDF
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
VHIR Vall d’Hebron Institut de Recerca
 
PPTX
Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...
nist-spin
 
PPTX
Rapid outbreak characterisation - UK Genome Sciences 2014 - wed 3 sep 2014
Torsten Seemann
 
PDF
Errors and Limitaions of Next Generation Sequencing
Nixon Mendez
 
PDF
T-bioinfo overview
Jaclyn Williams
 
PPTX
Haendel clingenetics.3.14.14
mhaendel
 
PPT
20170209 ngs for_cancer_genomics_101
Ino de Bruijn
 
PPTX
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
jennomics
 
PDF
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
VHIR Vall d’Hebron Institut de Recerca
 
PPTX
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
Luca Cozzuto
 
PPT
Metagenomics sequencing
cdgenomics525
 
PDF
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
VHIR Vall d’Hebron Institut de Recerca
 
PDF
Next Generation Sequencing Informatics - Challenges and Opportunities
Chung-Tsai Su
 
PDF
SPIN Workshop Microbial Genomics @NIST
Nathan Olson
 
A peek inside the bioinformatics black box - DCAMG Symposium - mon 20 july 2015
Torsten Seemann
 
Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...
Nathan Olson
 
WGS in public health microbiology - MDU/VIDRL Seminar - wed 17 jun 2015
Torsten Seemann
 
16S rRNA Analysis using Mothur Pipeline
Eman Abdelrazik
 
Approaches to analysing 1000s of bacterial isolates - ICEID 2015 Atlanta, USA...
Torsten Seemann
 
Pipeline or pipe dream - Midlands Micro Meeting UK - mon 15 sep 2014
Torsten Seemann
 
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
VHIR Vall d’Hebron Institut de Recerca
 
Next Generation Sequencing for Identification and Subtyping of Foodborne Pat...
nist-spin
 
Rapid outbreak characterisation - UK Genome Sciences 2014 - wed 3 sep 2014
Torsten Seemann
 
Errors and Limitaions of Next Generation Sequencing
Nixon Mendez
 
T-bioinfo overview
Jaclyn Williams
 
Haendel clingenetics.3.14.14
mhaendel
 
20170209 ngs for_cancer_genomics_101
Ino de Bruijn
 
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
jennomics
 
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
VHIR Vall d’Hebron Institut de Recerca
 
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
Luca Cozzuto
 
Metagenomics sequencing
cdgenomics525
 
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
VHIR Vall d’Hebron Institut de Recerca
 
Next Generation Sequencing Informatics - Challenges and Opportunities
Chung-Tsai Su
 
SPIN Workshop Microbial Genomics @NIST
Nathan Olson
 
Ad

Similar to Computational Resources In Infectious Disease (20)

PPTX
Cool Informatics Tools and Services for Biomedical Research
David Ruau
 
PDF
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
Bonnie Hurwitz
 
PPT
Reproducible bioinformatics pipelines with Docker and Anduril
Christian Frech
 
PPTX
Reproducibility: 10 Simple Rules
Annika Eriksson
 
PPTX
BioThings API: Building a FAIR API Ecosystem for Biomedical Knowledge
Chunlei Wu
 
PPTX
Production Bioinformatics, emphasis on Production
Chris Dwan
 
PDF
Michael Reich, GenomeSpace Workshop, fged_seattle_2013
Functional Genomics Data Society
 
PPTX
2015 genome-center
c.titus.brown
 
PPTX
How to be a bioinformatician
Christian Frech
 
PDF
Bio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Yaoyu Wang
 
PPTX
Enabling Large Scale Sequencing Studies through Science as a Service
Justin Johnson
 
PDF
Open PHACTS April 2017 Science webinar Workflow tools
open_phacts
 
PDF
Reproducible Research and the Cloud
Microsoft Azure for Research
 
PPTX
Software Sustainability: Better Software Better Science
Carole Goble
 
PDF
Overview of Next Gen Sequencing Data Analysis
Bioinformatics and Computational Biosciences Branch
 
PPTX
Rare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
Golden Helix Inc
 
PPTX
CLC bio presentation at 5th SFAF 6/3/2010
Saul Kravitz
 
PDF
Initial steps towards a production platform for DNA sequence analysis on the ...
Barbera van Schaik
 
PDF
Ontologies Ontop Databases
Martín Rezk
 
PPTX
Enhancing the Quality of ImmPort Data
Barry Smith
 
Cool Informatics Tools and Services for Biomedical Research
David Ruau
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
Bonnie Hurwitz
 
Reproducible bioinformatics pipelines with Docker and Anduril
Christian Frech
 
Reproducibility: 10 Simple Rules
Annika Eriksson
 
BioThings API: Building a FAIR API Ecosystem for Biomedical Knowledge
Chunlei Wu
 
Production Bioinformatics, emphasis on Production
Chris Dwan
 
Michael Reich, GenomeSpace Workshop, fged_seattle_2013
Functional Genomics Data Society
 
2015 genome-center
c.titus.brown
 
How to be a bioinformatician
Christian Frech
 
Bio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Yaoyu Wang
 
Enabling Large Scale Sequencing Studies through Science as a Service
Justin Johnson
 
Open PHACTS April 2017 Science webinar Workflow tools
open_phacts
 
Reproducible Research and the Cloud
Microsoft Azure for Research
 
Software Sustainability: Better Software Better Science
Carole Goble
 
Overview of Next Gen Sequencing Data Analysis
Bioinformatics and Computational Biosciences Branch
 
Rare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
Golden Helix Inc
 
CLC bio presentation at 5th SFAF 6/3/2010
Saul Kravitz
 
Initial steps towards a production platform for DNA sequence analysis on the ...
Barbera van Schaik
 
Ontologies Ontop Databases
Martín Rezk
 
Enhancing the Quality of ImmPort Data
Barry Smith
 
Ad

Recently uploaded (20)

PDF
Paleoseismic activity in the moon’s Taurus-Littrowvalley inferred from boulde...
Sérgio Sacani
 
PDF
Evaluating Benchmark Quality: a Mutation-Testing- Based Methodology
ESUG
 
PPTX
ANTIANGINAL DRUGS.pptx m pharm pharmacology
46JaybhayAshwiniHari
 
PPTX
Sleep_pysilogy_types_REM_NREM_duration_Sleep center
muralinath2
 
PPTX
first COT (MATH).pptxCSAsCNKHPHCouAGSCAUO:GC/ZKVHxsacba
DitaSIdnay
 
PPTX
The Obesity Paradox. Friend or Foe ?pptx
drdgd1972
 
PPTX
Embark on a journey of cell division and it's stages
sakyierhianmontero
 
PPTX
Pharmacognosy: ppt :pdf :pharmacognosy :
Vishnukanchi darade
 
PPT
1a. Basic Principles of Medical Microbiology Part 2 [Autosaved].ppt
separatedwalk
 
PPTX
RED ROT DISEASE OF SUGARCANE.pptx
BikramjitDeuri
 
PDF
Challenges of Transpiling Smalltalk to JavaScript
ESUG
 
PPTX
Modifications in RuBisCO system to enhance photosynthesis .pptx
raghumolbiotech
 
PPTX
Hericium erinaceus, also known as lion's mane mushroom
TinaDadkhah1
 
PPTX
Quality control test for plastic & metal.pptx
shrutipandit17
 
PDF
Sujay Rao Mandavilli Multi-barreled appraoch to educational reform FINAL FINA...
Sujay Rao Mandavilli
 
PPTX
Qualification of.UV visible spectrophotometer pptx
shrutipandit17
 
PPTX
INTRO-TO-CRIM-THEORIES-OF-CRIME-2023 (1).pptx
ChrisFlickIII
 
PPTX
Nanofertilizer: Its potential benefits and associated challenges.pptx
BikramjitDeuri
 
PPTX
Laboratory design and safe microbiological practices
Akanksha Divkar
 
PPTX
Brain_stem_Medulla oblongata_functions of pons_mid brain
muralinath2
 
Paleoseismic activity in the moon’s Taurus-Littrowvalley inferred from boulde...
Sérgio Sacani
 
Evaluating Benchmark Quality: a Mutation-Testing- Based Methodology
ESUG
 
ANTIANGINAL DRUGS.pptx m pharm pharmacology
46JaybhayAshwiniHari
 
Sleep_pysilogy_types_REM_NREM_duration_Sleep center
muralinath2
 
first COT (MATH).pptxCSAsCNKHPHCouAGSCAUO:GC/ZKVHxsacba
DitaSIdnay
 
The Obesity Paradox. Friend or Foe ?pptx
drdgd1972
 
Embark on a journey of cell division and it's stages
sakyierhianmontero
 
Pharmacognosy: ppt :pdf :pharmacognosy :
Vishnukanchi darade
 
1a. Basic Principles of Medical Microbiology Part 2 [Autosaved].ppt
separatedwalk
 
RED ROT DISEASE OF SUGARCANE.pptx
BikramjitDeuri
 
Challenges of Transpiling Smalltalk to JavaScript
ESUG
 
Modifications in RuBisCO system to enhance photosynthesis .pptx
raghumolbiotech
 
Hericium erinaceus, also known as lion's mane mushroom
TinaDadkhah1
 
Quality control test for plastic & metal.pptx
shrutipandit17
 
Sujay Rao Mandavilli Multi-barreled appraoch to educational reform FINAL FINA...
Sujay Rao Mandavilli
 
Qualification of.UV visible spectrophotometer pptx
shrutipandit17
 
INTRO-TO-CRIM-THEORIES-OF-CRIME-2023 (1).pptx
ChrisFlickIII
 
Nanofertilizer: Its potential benefits and associated challenges.pptx
BikramjitDeuri
 
Laboratory design and safe microbiological practices
Akanksha Divkar
 
Brain_stem_Medulla oblongata_functions of pons_mid brain
muralinath2
 

Computational Resources In Infectious Disease

  • 1. João André Carriço, Microbiology Institute and Instituto de Medicina Molecular, Faculty of Medicine, University of Lisbon [email protected] twitter: @jacarrico ME081 – Meet-The-Expert Session 26th ECCMID, Amsterdam, Netherlands 7-12 April 2016
  • 2.  This presentation is not intended to cover all available software or databases (we would need several weeks or months to do that)  I’ll present what I use or intend to use in a near future  I gladly accept any suggestions to included on similar presentations in the future.  It is supposed to be interactive so ask away during the presentation.
  • 3.  Available Databases  Virulence Factors and AMR DBs  Sequence-based typing databases: Pubmlst.org / Enterobase  HighThroughput Sequencing data analysis (freeware)  Prokka  Roary  Nullabor  Microreact.org  PHYLOViZ  Commercial Solutions  Bionumerics 7.5  CLC GenomicsWorkbench (CLC Bio)  Ridom Seqsphere+
  • 4. Virulence Factor Databases  VFDB (http://www.mgc.ac.cn/VFs/main.htm)  Pathosystems Resource Integration Center (PATRIC) VF (https)://www.patricbrc.org/)  Victors (http://www.phidias.us/victors/)  PHI-Base (http://www.phi-base.org/)  MvirDB (http://mvirdb.llnl.gov/ ) To know more: - Presentation on the Controversies in interpreting whole genome sequence data session : http://eccmidlive.org/#resources/how-can-we-design-actionable-virulome-databases
  • 5.  Comprehensive Antibiotic Resistance Database (CARD) (https://card.mcmaster.ca/)  Repository of Antibiotic resistanceCassetes (RAC)(http://rac.aihi.mq.edu.au/rac/)  Integrall :The integron database (http://integrall.bio.ua.pt/) (…)
  • 6. To know more : http://www.slideshare.net/nickloman/eccmid-2015-so-i-have-sequenced-my-genome-what-now Reads (fastq files) contigs (fasta files) Annotated contigs (gbk/gff files) Roary :PanGenome Analysis Enterobase BIGSdb Nullabor PHYLOViZ: Tree + metada visualization Microreact.org: Tree +metadata +vizualization Prokka De novo assembler
  • 8. slide by @happy_khan Martin Sergeant Mark Achtman Nabil-Fareed Alikhan Zhemin Zhou
  • 9.  Genome annotation made easy byTorsten Seemann (slides byTorsten)  Genome annotation: adding biological information to the sequence, by describing features To know more : http://www.slideshare.net/torstenseemann/prokka-rapid-bacterial-genome-annotation-abphm-2013 Available at: https://github.com/tseemann/prokka
  • 10.  Pan genome analysis by Andrew Page  Available at: https://sangerpathogens.github.io/Roary/ Core genome Accessory genome Pan-genome
  • 11.  Inputs:Annotated de novo assemblies (GFF files) • Typically from the annotation pipeline  Outputs: • Spreadsheet with presence and absence of genes • Multi-FASTA alignment of core genes so you can build a tree without a reference • Multi-FASTA alignments for each gene • Plots for the open/closed genome, unique genes • Integrates with iCANDY so you can visualise all structural variation • QC report from Kraken to help identify suspect samples (Slide by Andrew Page)
  • 12. Core (n or n-1 strains) Soft-Core (n-2 or n-3 strains) Shell ( 8(?) to n-3 strains) Cloud ( <8 (?) strains) Core genome: Core + Soft-Core Accessory genome: Shell + Cloud
  • 13. iCANDY output of presence and absence of genes in accessory genome. S. Weltevreden & public S. enterica genomes (Slide by Andrew Page)
  • 14.  Complete pipeline from reads to reports byTorsten Seemann  Objective is automate analysis for everyday use on public health labs /research settings  Uses and distills outputs by a lot of software  Avaliable at: https://github.com/tseemann/nullarbor
  • 19. Inputs: - Tab separated txt (profiles) - Fasta files - Automatic database retrieval (MLST) Outputs: • goeBURST and goeBURST MST • Link quality assessment • High quality images Can be easily applied to: - MLST/ cgMLST/wgMLST - MLVA - SNP data* - Gene Presence/absence
  • 20. New features: • Hierarchical clustering • Neighbor-Joining • Project Saving
  • 21.  Available at http://online.phyloviz.net  Web based version of PHYLOViZ  Allows users to create their own datasets, save them and share their data (privately or publicly)  REST API available  Scalable to thousands of nodes  Tree Analysis tools:  Interactive distance matrix  NLV graph
  • 28.  Available at http://microreact.org/  Presentation on session Harnessing whole genome sequence data for public health applications : Novel open access tools forWGS- based pathogen surveillance and the identification of high-risk clones  http://eccmidlive.org/#resources/novel-open-access-tools-for- wgs-based-pathogen-surveillance-and-the-identification-of-high- risk-clones
  • 30. • Ridom Seqsphere+ : http://www.ridom.de/seqsphere/ • Applied Maths Bionumerics 7.6: http://www.applied-maths.com/bionumerics • CLCBioGenomicWorkbench : http://www.clcbio.com/blog/clc-genomics-workbench-7-5/
  • 31. • Huge variety of software and database solutions • There is no single One-Size-Fits-All solution (job security for bioinformaticians) • Different questions require different approaches • Always questions the results and data provenance
  • 32.  ECCMID2015 Meet-the-expert session on “What bioinformatic tools should I use for analysis of HighThroughput Sequencing data for molecular diagnostics? ”  Nick Loman: http://www.slideshare.net/nickloman/eccmid-2015- meettheexpert-bioinformatics-tools  João André Carriço: http://www.slideshare.net/joaoandrecarrico/eccmid-meet- theexpert2015
  • 33.  UMMI Members  Bruno Gonçalves  Mário Ramirez  José Melo-Cristino  INESC-ID  Alexandre Francisco  Cátia Vaz  Marta Nascimento  EFSA INNUENDO Project (https://sites.google.com/site/innuendocon/)  Mirko Rossi  FP7 PathoNGenTrace (http://www.patho-ngen-trace.eu/):  Dag Harmsen (Univ. Muenster)  Stefan Niemann (Research Center Borstel)  Keith Jolley, James Bray and Martin Maiden (Univ. Oxford)  Joerg Rothganger (RIDOM)  Hannes Pouseele (Applied Maths)  Genome Canada IRIDA project (www.irida.ca)  Franklin Bristow, Thomas Matthews, Aaron Petkau, Morag Graham and Gary Van Domselaar(NLM , PHAC)  Ed Taboada and Peter Kruczkiewicz (LabFoodborne Zoonoses, PHAC)  Fiona Brinkman (SFU)  William Hsiao (BCCDC) INTEGRATED RAPID INFECTIOUS DISEASE ANALYSIS