SlideShare a Scribd company logo
The next terminal – Jupyter
With examples from Bioinformatics
@lynnlangit
“
”
How often do you use
the terminal?
@lynnlangit
Terminal Customizations
Prompt Output Aesthetics Code Comments Graphics
@lynnlangit
Terminalimproved
Terminalimproved
What does this Code do?
@lynnlangit
“
”
But it’s not good enough
Why not?
@lynnlangit
Machine Learning
Too much data to process? Or too much code? Can you ‘see’ what is happening?
@lynnlangit
What does this Code do?
Which algorithm?
@lynnlangit
Visualizing Data Processing ML Code
Which algorithm?
@lynnlangit
Now – more data, much more…
IoT increases data volume and complexity exponentially
@lynnlangit
“
”
Inspired by
Mathematica
Thanks Steven Wolfram
If you can SEE it (your data and code), you can work with it better
@lynnlangit
Next terminal -> a better Python REPL
• Fernando Perez in 2001
• IPython (interactive)
• Modeled - Mathematica Notebooks
• IP(y): Notebook -> in a browser
• 2012 IPython -> Jupyter Notebook
@lynnlangit
Enter Jupyter Notebooks
@lynnlangit
Jupyter Notebooks supports ML Lifecycle
1. Collect
Data
Retrieve Files
Query SQL Databases
Call Web Services
“Scrape” Web Pages
2.
Prepare
Data
Explore Data
Validate Data
Clean Data
Features / Data
4.
Evaluate
Model
Test Performance
Compare Models
Validate Model
Visualize
5. Deploy
Model
Export Model File
Prepare Job
Deploy Container
Re-package Model
Execute code blocks:
- Python, R… code
- SQL queries
- Shell commands
3. Train
Model
Prepare Training Set
Experiment
Test Model
Visualize
Write Documentation:
- Markdown language
Visualize Data
- Viz tools…
Jupyter Visualizations –
so many possibilities
Notebook Customizations
Multiple
Runtimes
Languages
Share output
Code or
Equations
LaTex
Math
Comments
Markdown
Wiki-like
Graphics
Visualizations
Charting
Results
LIVE
DOCUMENTATION
Reproducible
Research
@lynnlangit
Example
Jupyter locally
@lynnlangit
Mathematica evolved…
Jupyter Notebook
Market leader
Started for single use
Academic community
GitHub integration
Added Jupyter Hub for
collaboration
Zeppelin Notebook
Start for collaboration
Enterprise
Security
Vendor Notebook
Databricks for Apache Spark
Jupyter-like, but proprietary
format
@lynnlangit
Running Notebooks
Desktop
Install and run
Local Server
Can use Jupyter Hub for groups
Cloud
Large number of options
@lynnlangit
Extending, Refactoring Open Notebooks
• Write functions in one notebook
• Link to another notebook
• Write extensions (nbextensions.com)
Up the bar
Personalized medicine via genomic analysis
@lynnlangit
Reproducible Research – Experiments as Code
@lynnlangit
Bioinformatics | Denis C. Bauer | @allPowerde|
GT-Scan2
How can genome engineering
be made more effective?
Variant Spark
How to find disease genes in
population-size cohorts?
Genomic
Research
Tools
Two
Examples
Transformational Bioinformatics | Denis C. Bauer | @allPowerde
Machine learning…
on 1.7 Trillion data points
https://www.projectmine.com/about/
Bioinformatics | Denis C. Bauer | @allPowerde|
VariantSpark - Parallelize Random Forest for scalability
• Spark ML’s RF was designed for ‘Big’ low dimensional data.
• The full genome-wide profile does NOT fit into the executors memory
“Cursed” BigData: e.g. Genomics
Moderate number of samples with many features
Feature set too large to be handled by single executer
Bioinformatics | Denis C. Bauer | @allPowerde|
Firas Abuzaid (Spark Summit 2016) YGGDRASIL: Faster Decision Trees Column Partitioning in SPARK
Flip the matrix: partition by column
VariantSpark - Parallelize RF to scale with features
Bioinformatics | Denis C. Bauer | @allPowerde|
Wide RF scalable with features and samples
# set up context and input parameters
spark = SparkSession(sc)
vc = VariantsContext(spark)
label = vc.load_label('dius/data/chr22-labels.csv', 'col_name')
features = vc.import_vcf('dius/data/chr22_1000.vcf')
# instantiate analysis (parameters are type-checked)
imp_analysis = features.importance_analysis(label)
# get significant factors as both a tuple list and a dataframe
imp_vars = imp_analysis.important_variables(20)
most_imp_var = imp_vars[0][0]
imp_df = imp_analysis.variable_importance()
oob_error = imp_analysis.oob_error()
# convert to work with common Python tools
pandas_imp_df = imp_df.toPandas()
New -- Python API for VariantSpark
Demo VariantSpark
Jupyter for Genomics Research
@lynnlangit
Understanding Jupyter notebooks using bioinformatics examples
Cloud-based Jupyter
PaaS
• AWS SageMaker
• Azure Notebooks
• Others…
@lynnlangit
Example - GT-Scan2
Jupyter for Genomics Research
@lynnlangit
Understanding Jupyter notebooks using bioinformatics examples
Tools for Jupyter
• Binder for GitHub
• Point to your GitHub Repo
• Jupyter Notebooks
• Requirements.txt
• It builds a Docker image
• You can run your Notebooks
@lynnlangit
Example
Binder
@lynnlangit
Future of Jupyter for Research
Academic
Institutions
and
Research
Labs
UC Berkeley, Davis, San Diego
Cal Poly San Luis Obispo
Clemson University
UC Boulder
U of Illinois, Minnesota, Missouri, Rochester, Texas
MIT
Michigan State U
Texas A & M
@lynnlangit

More Related Content

What's hot (18)

PDF
Accelerating Time to Science: Transforming Research in the Cloud
Jamie Kinney
 
PDF
UberCloud Webinar Abaqus and cloud computing
Thomas Francis
 
PDF
Unifying State-of-the-Art AI and Big Data in Apache Spark with Reynold Xin
Databricks
 
PDF
H2O Overview with Amy Wang at useR! Aalborg
Sri Ambati
 
PPTX
Dev Games!
Adam Haley
 
PDF
Charles_Qian_Resume
Charles Qian
 
PDF
CyberMLToolkit: Anomaly Detection as a Scalable Generic Service Over Apache S...
Databricks
 
PPT
Optimizing Bursty Hadoop on AWS - Big Data Cloud - June 3rd Meetup
BigDataCloud
 
PDF
AI on Spark for Malware Analysis and Anomalous Threat Detection
Databricks
 
PPT
AWS Dublin Briefing - Cool AWS Use Cases
Ian Massingham
 
PPTX
Dr. Elephant – Achieving Quicker, Easier, and Cost-Effective Big Data Analyti...
Akshay Rai
 
PPTX
IT Services - TCO Study by Frost & Sullivan
CTRLS
 
PDF
Recommender Systems at Scale
Eoin Hurrell, PhD
 
PDF
Tale of Two Workloads And One Cloud
Kenneth Hui
 
PPTX
Developing and deploying big data machine learning models
Narayana Swamy
 
PPTX
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
Jose Quesada (hiring)
 
PDF
Fast and Reliable Apache Spark SQL Engine
Databricks
 
Accelerating Time to Science: Transforming Research in the Cloud
Jamie Kinney
 
UberCloud Webinar Abaqus and cloud computing
Thomas Francis
 
Unifying State-of-the-Art AI and Big Data in Apache Spark with Reynold Xin
Databricks
 
H2O Overview with Amy Wang at useR! Aalborg
Sri Ambati
 
Dev Games!
Adam Haley
 
Charles_Qian_Resume
Charles Qian
 
CyberMLToolkit: Anomaly Detection as a Scalable Generic Service Over Apache S...
Databricks
 
Optimizing Bursty Hadoop on AWS - Big Data Cloud - June 3rd Meetup
BigDataCloud
 
AI on Spark for Malware Analysis and Anomalous Threat Detection
Databricks
 
AWS Dublin Briefing - Cool AWS Use Cases
Ian Massingham
 
Dr. Elephant – Achieving Quicker, Easier, and Cost-Effective Big Data Analyti...
Akshay Rai
 
IT Services - TCO Study by Frost & Sullivan
CTRLS
 
Recommender Systems at Scale
Eoin Hurrell, PhD
 
Tale of Two Workloads And One Cloud
Kenneth Hui
 
Developing and deploying big data machine learning models
Narayana Swamy
 
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
Jose Quesada (hiring)
 
Fast and Reliable Apache Spark SQL Engine
Databricks
 

Similar to Understanding Jupyter notebooks using bioinformatics examples (20)

PDF
PPT4: Frameworks & Libraries of Machine Learning & Deep Learning
akira-ai
 
DOCX
Predicting rainfall with data science in python
dhanushthurinjikuppa
 
PDF
London level39
Travis Oliphant
 
PDF
Jupyter machine learning crash course
Olga Scrivner
 
PPTX
2018 02 20-jeg_index
Chester Chen
 
PPT
BWU_BTA_22_381.ppt
SandipPradhan23
 
PDF
IRJET - Automated Fraud Detection Framework in Examination Halls
IRJET Journal
 
PDF
Python for Data Science: A Comprehensive Guide
priyanka rajput
 
PPTX
Introduction to Jupyter notebook and MS Azure Machine Learning Studio
Muralidharan Deenathayalan
 
PPTX
Introduction to Jupyter notebook and MS Azure Machine Learning Studio
Muralidharan Deenathayalan
 
PDF
Visualizing Model Selection with Scikit-Yellowbrick: An Introduction to Devel...
Benjamin Bengfort
 
PDF
2019 03-11 bio it-world west genepattern notebook slides
Michael Reich
 
PDF
Data analysis with Pandas and Spark
Felix Crisan
 
PDF
Jupyter notebooks on steroids
Jose Enrique Ruiz
 
PDF
Extending Machine Learning Algorithms with PySpark
Databricks
 
PPTX
Introduction to Machine Learning by MARK
MRKUsafzai0607
 
PPTX
Ml programming with python
Kumud Arora
 
PDF
Variant-Apache Spark for Bioinformatics with Piotr Szul
Spark Summit
 
PDF
Visual diagnostics at scale
Rebecca Bilbro
 
PPTX
Python ml
Shubham Sharma
 
PPT4: Frameworks & Libraries of Machine Learning & Deep Learning
akira-ai
 
Predicting rainfall with data science in python
dhanushthurinjikuppa
 
London level39
Travis Oliphant
 
Jupyter machine learning crash course
Olga Scrivner
 
2018 02 20-jeg_index
Chester Chen
 
BWU_BTA_22_381.ppt
SandipPradhan23
 
IRJET - Automated Fraud Detection Framework in Examination Halls
IRJET Journal
 
Python for Data Science: A Comprehensive Guide
priyanka rajput
 
Introduction to Jupyter notebook and MS Azure Machine Learning Studio
Muralidharan Deenathayalan
 
Introduction to Jupyter notebook and MS Azure Machine Learning Studio
Muralidharan Deenathayalan
 
Visualizing Model Selection with Scikit-Yellowbrick: An Introduction to Devel...
Benjamin Bengfort
 
2019 03-11 bio it-world west genepattern notebook slides
Michael Reich
 
Data analysis with Pandas and Spark
Felix Crisan
 
Jupyter notebooks on steroids
Jose Enrique Ruiz
 
Extending Machine Learning Algorithms with PySpark
Databricks
 
Introduction to Machine Learning by MARK
MRKUsafzai0607
 
Ml programming with python
Kumud Arora
 
Variant-Apache Spark for Bioinformatics with Piotr Szul
Spark Summit
 
Visual diagnostics at scale
Rebecca Bilbro
 
Python ml
Shubham Sharma
 
Ad

More from Lynn Langit (20)

PPTX
Serverless Architectures
Lynn Langit
 
PPTX
10+ Years of Teaching Kids Programming
Lynn Langit
 
PDF
Testing in Ballerina Language
Lynn Langit
 
PPTX
Teaching Kids to create Alexa Skills
Lynn Langit
 
PPTX
Practical cloud
Lynn Langit
 
PPTX
Teaching Kids Programming
Lynn Langit
 
PPTX
Practical Cloud
Lynn Langit
 
PPTX
Serverless Reality
Lynn Langit
 
PPTX
Genomic Scale Big Data Pipelines
Lynn Langit
 
PPTX
Bioinformatics Data Pipelines built by CSIRO on AWS
Lynn Langit
 
PPTX
Serverless Reality
Lynn Langit
 
PDF
Beyond Relational
Lynn Langit
 
PPTX
New AWS Services for Bioinformatics
Lynn Langit
 
PPTX
Google Cloud and Data Pipeline Patterns
Lynn Langit
 
PPTX
Scaling Galaxy on Google Cloud Platform
Lynn Langit
 
PPTX
SQL Server on Google Cloud Platform
Lynn Langit
 
PPTX
Redis Labs and SQL Server
Lynn Langit
 
PPT
Building a data warehouse with AWS Redshift, Matillion and Yellowfin
Lynn Langit
 
PPTX
What is 'Teaching Kids Programming'
Lynn Langit
 
PPTX
Teaching Kids Programming for Developers
Lynn Langit
 
Serverless Architectures
Lynn Langit
 
10+ Years of Teaching Kids Programming
Lynn Langit
 
Testing in Ballerina Language
Lynn Langit
 
Teaching Kids to create Alexa Skills
Lynn Langit
 
Practical cloud
Lynn Langit
 
Teaching Kids Programming
Lynn Langit
 
Practical Cloud
Lynn Langit
 
Serverless Reality
Lynn Langit
 
Genomic Scale Big Data Pipelines
Lynn Langit
 
Bioinformatics Data Pipelines built by CSIRO on AWS
Lynn Langit
 
Serverless Reality
Lynn Langit
 
Beyond Relational
Lynn Langit
 
New AWS Services for Bioinformatics
Lynn Langit
 
Google Cloud and Data Pipeline Patterns
Lynn Langit
 
Scaling Galaxy on Google Cloud Platform
Lynn Langit
 
SQL Server on Google Cloud Platform
Lynn Langit
 
Redis Labs and SQL Server
Lynn Langit
 
Building a data warehouse with AWS Redshift, Matillion and Yellowfin
Lynn Langit
 
What is 'Teaching Kids Programming'
Lynn Langit
 
Teaching Kids Programming for Developers
Lynn Langit
 
Ad

Recently uploaded (20)

PDF
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
PPTX
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
PDF
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
PDF
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PDF
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PDF
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
PDF
July Patch Tuesday
Ivanti
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PDF
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PDF
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PDF
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
July Patch Tuesday
Ivanti
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 

Understanding Jupyter notebooks using bioinformatics examples

  • 1. The next terminal – Jupyter With examples from Bioinformatics @lynnlangit
  • 2. “ ” How often do you use the terminal? @lynnlangit
  • 3. Terminal Customizations Prompt Output Aesthetics Code Comments Graphics @lynnlangit
  • 6. What does this Code do? @lynnlangit
  • 7. “ ” But it’s not good enough Why not? @lynnlangit
  • 8. Machine Learning Too much data to process? Or too much code? Can you ‘see’ what is happening? @lynnlangit
  • 9. What does this Code do? Which algorithm? @lynnlangit
  • 10. Visualizing Data Processing ML Code Which algorithm? @lynnlangit
  • 11. Now – more data, much more… IoT increases data volume and complexity exponentially @lynnlangit
  • 12. “ ” Inspired by Mathematica Thanks Steven Wolfram If you can SEE it (your data and code), you can work with it better @lynnlangit
  • 13. Next terminal -> a better Python REPL • Fernando Perez in 2001 • IPython (interactive) • Modeled - Mathematica Notebooks • IP(y): Notebook -> in a browser • 2012 IPython -> Jupyter Notebook @lynnlangit
  • 15. Jupyter Notebooks supports ML Lifecycle 1. Collect Data Retrieve Files Query SQL Databases Call Web Services “Scrape” Web Pages 2. Prepare Data Explore Data Validate Data Clean Data Features / Data 4. Evaluate Model Test Performance Compare Models Validate Model Visualize 5. Deploy Model Export Model File Prepare Job Deploy Container Re-package Model Execute code blocks: - Python, R… code - SQL queries - Shell commands 3. Train Model Prepare Training Set Experiment Test Model Visualize Write Documentation: - Markdown language Visualize Data - Viz tools…
  • 16. Jupyter Visualizations – so many possibilities
  • 17. Notebook Customizations Multiple Runtimes Languages Share output Code or Equations LaTex Math Comments Markdown Wiki-like Graphics Visualizations Charting Results LIVE DOCUMENTATION Reproducible Research @lynnlangit
  • 19. Mathematica evolved… Jupyter Notebook Market leader Started for single use Academic community GitHub integration Added Jupyter Hub for collaboration Zeppelin Notebook Start for collaboration Enterprise Security Vendor Notebook Databricks for Apache Spark Jupyter-like, but proprietary format @lynnlangit
  • 20. Running Notebooks Desktop Install and run Local Server Can use Jupyter Hub for groups Cloud Large number of options @lynnlangit
  • 21. Extending, Refactoring Open Notebooks • Write functions in one notebook • Link to another notebook • Write extensions (nbextensions.com)
  • 22. Up the bar Personalized medicine via genomic analysis @lynnlangit
  • 23. Reproducible Research – Experiments as Code @lynnlangit
  • 24. Bioinformatics | Denis C. Bauer | @allPowerde| GT-Scan2 How can genome engineering be made more effective? Variant Spark How to find disease genes in population-size cohorts? Genomic Research Tools Two Examples
  • 25. Transformational Bioinformatics | Denis C. Bauer | @allPowerde Machine learning… on 1.7 Trillion data points https://www.projectmine.com/about/
  • 26. Bioinformatics | Denis C. Bauer | @allPowerde| VariantSpark - Parallelize Random Forest for scalability • Spark ML’s RF was designed for ‘Big’ low dimensional data. • The full genome-wide profile does NOT fit into the executors memory “Cursed” BigData: e.g. Genomics Moderate number of samples with many features Feature set too large to be handled by single executer
  • 27. Bioinformatics | Denis C. Bauer | @allPowerde| Firas Abuzaid (Spark Summit 2016) YGGDRASIL: Faster Decision Trees Column Partitioning in SPARK Flip the matrix: partition by column VariantSpark - Parallelize RF to scale with features
  • 28. Bioinformatics | Denis C. Bauer | @allPowerde| Wide RF scalable with features and samples
  • 29. # set up context and input parameters spark = SparkSession(sc) vc = VariantsContext(spark) label = vc.load_label('dius/data/chr22-labels.csv', 'col_name') features = vc.import_vcf('dius/data/chr22_1000.vcf') # instantiate analysis (parameters are type-checked) imp_analysis = features.importance_analysis(label) # get significant factors as both a tuple list and a dataframe imp_vars = imp_analysis.important_variables(20) most_imp_var = imp_vars[0][0] imp_df = imp_analysis.variable_importance() oob_error = imp_analysis.oob_error() # convert to work with common Python tools pandas_imp_df = imp_df.toPandas() New -- Python API for VariantSpark
  • 30. Demo VariantSpark Jupyter for Genomics Research @lynnlangit
  • 32. Cloud-based Jupyter PaaS • AWS SageMaker • Azure Notebooks • Others… @lynnlangit
  • 33. Example - GT-Scan2 Jupyter for Genomics Research @lynnlangit
  • 35. Tools for Jupyter • Binder for GitHub • Point to your GitHub Repo • Jupyter Notebooks • Requirements.txt • It builds a Docker image • You can run your Notebooks @lynnlangit
  • 37. Future of Jupyter for Research Academic Institutions and Research Labs UC Berkeley, Davis, San Diego Cal Poly San Luis Obispo Clemson University UC Boulder U of Illinois, Minnesota, Missouri, Rochester, Texas MIT Michigan State U Texas A & M @lynnlangit

Editor's Notes

  • #5: http://www.omgubuntu.co.uk/2017/06/terminus-modern-highly-configurable-terminal-app-windows-mac-linux
  • #6: telnet towel.blinkenlights.nl
  • #11: Left-skewed, negative distribution
  • #14: History talk from Cristian Prieto (NDC Oslo 2016) -- https://vimeo.com/223984769 http://blog.fperez.org/2012/01/ipython-notebook-historical.html
  • #20: Local install pip install –iPython all -OR- can use anaconda, which installs Jupyter notebooks by default pip install jupyter[all] and you can pip install R You can use Docker – 2.1 GB image contains all libraries or you can use Azure Notebooks or AWS SageMaker Notebooks Only Python2 is installed by default, you can install other runtimes Start and run in local browser (no database, uses local .json files) IPython notebook -> localhost:8888/tree Use GitHub-flavor Markdown (by default) https://dwhsys.com/2017/03/25/apache-zeppelin-vs-jupyter-notebook/
  • #22: https://github.com/ipython-contrib/jupyter_contrib_nbextensions pip install jupyter_contrib_nbextensions –OR- conda install -c conda-forge jupyter_contrib_nbextensions
  • #24: https://github.com/Microsoft/Elevation/blob/master/notebooks/aggregation.ipynb https://www.microsoft.com/en-us/research/project/crispr/
  • #26: Using this instead?
  • #27: Less conclusion, more implementation
  • #32: https://www.gt-scan.net/ --AND- AMA with Dr, Bauer -- https://www.reddit.com/r/science/comments/5fiicm/science_ama_series_im_denis_bauer_a_team_leader/
  • #33: https://medium.com/@lynnlangit/aws-sagemaker-for-bioinformatics-b8e8a96479d8 Jupyter on GCE VM -- https://towardsdatascience.com/running-jupyter-notebook-in-google-cloud-platform-in-15-min-61e16da34d52
  • #36: https://mybinder.org/ -ALSO- https://nbviewer.jupyter.org/ - allows you to run notebooks stored in GitHub
  • #38: http://jupyterhub-tutorial.readthedocs.io/en/latest/ https://github.com/jupyterhub/jupyterhub-tutorial/blob/master/JupyterHub.pdf http://jupyterhub.readthedocs.io/en/latest/gallery-jhub-deployments.html