SlideShare a Scribd company logo
José Enrique Ruiz
Segundo Encuentro Python Granada
Centro de Enseñanzas Virtuales de la Universidad de Granada – CEVUG
26th June 2015
Astronomy research lifecycle is entirely digital
»  Observation proposals
»  Data reduction pipelines
»  Analysis of science ready data
»  Catalogs of objects and data archives
»  Publish process - ADS/arXiv
›  Materials and Methods
›  Results
›  Discussion
Reproducible research is still not
possible in a digital world
A rich infrastructure of data is not
efficiently used
A normalized preservation of
methodology is needed
Tools
Digital Astronomy
The Reproducibility Crisis
http://xkcd.com/242/
Optimize return on investments made on big facilities
»  Avoid duplication of efforts and reinvention
»  How to discover and not duplicate ?
»  How to re-use and not duplicate ?
»  How to make use of best practices ?
»  How to use the rich infrastructure of data ?
»  Intellectual contributions encoded in software
More data in archives do not imply more
knowledge
»  Expose complete scientific process, not the story
»  Allow easy discovery of methods and tools
Visibility, Efficiency and Reuse
The Reproducibility Crisis
More R’s than Pirates !
Prof. David de Roure
http://www.scilogs.com/eresearch/replacing-the-paper-the-twelve-rs-of-the-e-research-record/
http://www.scilogs.com/eresearch/more-rs-than-pirates/
Open Science is much more than Open Data
Repeatable
Tools and methodology available
I could repeat the experiment
Reproducible
Clear methodology and all resources available
I could reproduce the results
Reusable
I know how it could be useful for my needs
I could use all or some parts as it is
I could modify and adapt it even for other purposes
Story Capture
Going beyond automation
Organization
# CIG Vhel e_Vhel r_Vhel Dist MType e_MType OptAssym r_MType Bmag e_Bmag
1 7299.0 3.0 1 96.9 5.0 1.5 1 1 14.167 0.271 0.173 0.571 0.040 13.383
2 6983.0 6.0 2 94.7 6.0 1.5 0 1 15.722 0.324 0.255 0.278 0.031 15.157
3 4.0 1.5 0 1 16.057 0.507 0.246 0.354 15.457
4 2310.0 1.0 3 31.9 3.0 1.5 0 1 12.818 0.424 0.252 0.863 0.017 11.685
5 7865.0 10.0 3 105.9 0.0 1.5 0 1 15.602 0.364 0.225 0.131 0.118 15.128
72 5164.0 9.0 2 68.5 5.0 1.5 1 1 14.445 0.325 0.315 0.367 0.028 13.735
Story Capture
# CIG Vhel e_Vhel r_Vhel Dist MType e_MType OptAssym r_MType Bmag e_Bmag
1 7299.0 3.0 1 96.9 5.0 1.5 1 1 14.167 0.271 0.173 0.571 0.040 13.383
2 6983.0 6.0 2 94.7 6.0 1.5 0 1 15.722 0.324 0.255 0.278 0.031 15.157
3 4.0 1.5 0 1 16.057 0.507 0.246 0.354 15.457
4 2310.0 1.0 3 31.9 3.0 1.5 0 1 12.818 0.424 0.252 0.863 0.017 11.685
5 7865.0 10.0 3 105.9 0.0 1.5 0 1 15.602 0.364 0.225 0.131 0.118 15.128
72 5164.0 9.0 2 68.5 5.0 1.5 1 1 14.445 0.325 0.315 0.367 0.028 13.735
Capture
Actions, Tasks, Dependencies, Provenance
Improve Clarity / Documentation
Story Capture
Research Objects
Distributed
Technical Objects Social Objects
Expose experimental context in a structured way in order to be understood
http://www.researchobject.org/
The Social Dimension
Visibility and Discovery
The Executable Paper
Time has come to go beyond the PDF
IPython Notebooks
All started as a browser-wrapper for iPython
A working methodology
»  A Tool – web browser as the working desktop
»  Code, plots and data, living with rich-text documentation
»  Capture exploratory and analysis tasks
»  Multi-language in the very same script / recipe
»  Fully extensible and customizable
»  Desktop and server-side working environment
»  Cloud-based adaptive to scalable computing environment
»  Wiki-like scripts fully shareable, re-usable and executable
»  Training - executable tutorials that reduce learning curve
»  Published in social platforms with Git versioning
»  From scripts/ recipes towards bigger modular projects /books
IPython Jupyter Notebooks
Try it !
Local desktop: ipython notebook in the terminal
In the Cloud: http://try.jupyter.org in the browser
The Library
Visualization of static notebooks previously published as gists
shareable via URL link, may be downloaded and executed
locally
http://nbviewer.ipython.org
Cloud Solutions
http://lmgtfy.com/?q=ipython+notebook+examples#
17
The Library
Visualization of static notebooks previously published as gists
shareable via URL link, may be downloaded and executed
locally
http://nbviewer.ipython.org
Cloud Solutions
http://lmgtfy.com/?q=ipython+notebook+examples#
https://github.com/ipython/ipython/wiki/IPython-kernels-for-other-languages
https://ipython.org/ipython-doc/dev/interactive/magics.html
http://nbviewer.ipython.org/github/ipython/ipython/blob/1.x/examples/notebooks/Cell%20Magics.ipynb
https://ipython.org/ipython-doc/dev/interactive/magics.html
http://nbviewer.ipython.org/github/ipython/ipython/blob/1.x/examples/notebooks/Cell%20Magics.ipynb
https://github.com/ipython/ipython/wiki/Extensions-Index
Customizing a Notebook Server
https://github.com/ipython-contrib/IPython-notebook-extensions
Javascript extensions for added UI functionality
Customizing a Notebook Server
Customizing a Notebook Server
»  Access to common global functions and variables
»  Functions as wrappers to access CLI server-side software
»  Integrated access to server-side databases
»  Customized execution framework for a research group
»  Internal library of executable recipes
»  Access restricted content
Modular Complex Notebooks
Interactive Plotting Exploration
Seamless work with databases
Widgets
https://ipython.org/ipython-doc/dev/whatsnew/version2.0.html#interactive-widgets
https://github.com/margudo/LSSGALPY
The Executable Paper
Time has come to go beyond the PDF
The Executable Paper
http://www.nature.com/news/interactive-notebooks-sharing-the-code-1.16261
Publishing Research Objects
Publishing Research Objects
Digital Science - Towards the Executable Paper
http://youtu.be/1dXkmgkYuEg
http://youtu.be/FQ5FaeHcdwo
Collaborative Writing and Versioning
http://youtu.be/ObJG3i3jjPg

More Related Content

PDF
Digital Science: Reproducibility and Visibility in Astronomy
Jose Enrique Ruiz
 
PDF
Research Objects in Wf4Ever
Jose Enrique Ruiz
 
PDF
Digital Science: Towards the executable paper
Jose Enrique Ruiz
 
PDF
Open Science and Executable Papers
Jose Enrique Ruiz
 
PPTX
Virtual Science in the Cloud
thetfoot
 
PDF
Workflows to access and massage VOData
Jose Enrique Ruiz
 
PDF
Workflows in the Virtual Observatory
Jose Enrique Ruiz
 
PDF
Implementing a VO archive for datacubes of galaxies
Jose Enrique Ruiz
 
Digital Science: Reproducibility and Visibility in Astronomy
Jose Enrique Ruiz
 
Research Objects in Wf4Ever
Jose Enrique Ruiz
 
Digital Science: Towards the executable paper
Jose Enrique Ruiz
 
Open Science and Executable Papers
Jose Enrique Ruiz
 
Virtual Science in the Cloud
thetfoot
 
Workflows to access and massage VOData
Jose Enrique Ruiz
 
Workflows in the Virtual Observatory
Jose Enrique Ruiz
 
Implementing a VO archive for datacubes of galaxies
Jose Enrique Ruiz
 

What's hot (20)

PPTX
Big data at experimental facilities
Ian Foster
 
ODP
Introduction NL-HUG (April)
Evert Lammerts
 
PDF
Velocity cubes of galaxies
Jose Enrique Ruiz
 
PPTX
The Discovery Cloud: Accelerating Science via Outsourcing and Automation
Ian Foster
 
PPTX
Empowering Transformational Science
Chelle Gentemann
 
PPTX
Data-intensive applications on cloud computing resources: Applications in lif...
Ola Spjuth
 
PDF
Big Data Modeling Challenges and Machine Learning with No Code
Liana Ye
 
PPT
A New Partnership for Cross-Scale, Cross-Domain eScience
University of Washington
 
PPTX
Scaling collaborative data science with Globus and Jupyter
Ian Foster
 
PPTX
XLDB South America Keynote: eScience Institute and Myria
University of Washington
 
PDF
Big Data Visualization
bigdataviz_bay
 
PDF
A Biological Internet?: Eywa
Eugene Siow
 
PDF
What's New in Cytoscape
Keiichiro Ono
 
PPTX
Accelerating Discovery via Science Services
Ian Foster
 
PPTX
Taming Big Data!
Ian Foster
 
PPTX
Virtual Appliances, Cloud Computing, and Reproducible Research
University of Washington
 
PDF
ieee cloud 2015 keynote talk
Microsoft Azure for Research
 
PPTX
What to Expect of the LSST Archive: The LSST Science Platform
Mario Juric
 
PDF
Big data visualization frameworks and applications at Kitware
bigdataviz_bay
 
PDF
Reproducible Research and the Cloud
Microsoft Azure for Research
 
Big data at experimental facilities
Ian Foster
 
Introduction NL-HUG (April)
Evert Lammerts
 
Velocity cubes of galaxies
Jose Enrique Ruiz
 
The Discovery Cloud: Accelerating Science via Outsourcing and Automation
Ian Foster
 
Empowering Transformational Science
Chelle Gentemann
 
Data-intensive applications on cloud computing resources: Applications in lif...
Ola Spjuth
 
Big Data Modeling Challenges and Machine Learning with No Code
Liana Ye
 
A New Partnership for Cross-Scale, Cross-Domain eScience
University of Washington
 
Scaling collaborative data science with Globus and Jupyter
Ian Foster
 
XLDB South America Keynote: eScience Institute and Myria
University of Washington
 
Big Data Visualization
bigdataviz_bay
 
A Biological Internet?: Eywa
Eugene Siow
 
What's New in Cytoscape
Keiichiro Ono
 
Accelerating Discovery via Science Services
Ian Foster
 
Taming Big Data!
Ian Foster
 
Virtual Appliances, Cloud Computing, and Reproducible Research
University of Washington
 
ieee cloud 2015 keynote talk
Microsoft Azure for Research
 
What to Expect of the LSST Archive: The LSST Science Platform
Mario Juric
 
Big data visualization frameworks and applications at Kitware
bigdataviz_bay
 
Reproducible Research and the Cloud
Microsoft Azure for Research
 
Ad

Similar to IPython Notebooks - Hacia los papers ejecutables (20)

PDF
JupyterHub for Interactive Data Science Collaboration
Carol Willing
 
PDF
London level39
Travis Oliphant
 
PDF
Language-agnostic data analysis workflows and reproducible research
Andrew Lowe
 
PDF
Jupyter notebooks on steroids
Jose Enrique Ruiz
 
PDF
Jupyter, A Platform for Data Science at Scale
Matthias Bussonnier
 
PPTX
Research Objects for FAIRer Science
Carole Goble
 
PPTX
2014 nicta-reproducibility
c.titus.brown
 
PDF
USING JUPYTERHUB IN THE CLASSROOM: SETUP AND LESSONS LEARNED
ijseajournal
 
PDF
Travis Oliphant "Python for Speed, Scale, and Science"
Fwdays
 
PDF
Scientific Python
Eueung Mulyana
 
PDF
Jupyter: A Gateway for Scientific Collaboration and Education
Carol Willing
 
PPTX
2014 manchester-reproducibility
c.titus.brown
 
PPTX
Blastn plus jupyter on Docker
Lynn Langit
 
PDF
PyData Boston 2013
Travis Oliphant
 
PPT
Some examples of Open Source computational tools for researchers
Ashish Sharma
 
PDF
SciPy Latin America 2019
Travis Oliphant
 
PDF
PLOTCON NYC: The Architecture of Jupyter: Protocols for Interactive Data Expl...
Plotly
 
PDF
Python For Scientists
aeberspaecher
 
PPTX
What is Python? An overview of Python for science.
Nicholas Pringle
 
PDF
Central Iowa Linux Users group July 2019--Jupyter Notebook on a Raspberry Pi
Andrew Denner
 
JupyterHub for Interactive Data Science Collaboration
Carol Willing
 
London level39
Travis Oliphant
 
Language-agnostic data analysis workflows and reproducible research
Andrew Lowe
 
Jupyter notebooks on steroids
Jose Enrique Ruiz
 
Jupyter, A Platform for Data Science at Scale
Matthias Bussonnier
 
Research Objects for FAIRer Science
Carole Goble
 
2014 nicta-reproducibility
c.titus.brown
 
USING JUPYTERHUB IN THE CLASSROOM: SETUP AND LESSONS LEARNED
ijseajournal
 
Travis Oliphant "Python for Speed, Scale, and Science"
Fwdays
 
Scientific Python
Eueung Mulyana
 
Jupyter: A Gateway for Scientific Collaboration and Education
Carol Willing
 
2014 manchester-reproducibility
c.titus.brown
 
Blastn plus jupyter on Docker
Lynn Langit
 
PyData Boston 2013
Travis Oliphant
 
Some examples of Open Source computational tools for researchers
Ashish Sharma
 
SciPy Latin America 2019
Travis Oliphant
 
PLOTCON NYC: The Architecture of Jupyter: Protocols for Interactive Data Expl...
Plotly
 
Python For Scientists
aeberspaecher
 
What is Python? An overview of Python for science.
Nicholas Pringle
 
Central Iowa Linux Users group July 2019--Jupyter Notebook on a Raspberry Pi
Andrew Denner
 
Ad

More from Jose Enrique Ruiz (14)

PDF
Curation and Characterization of Web Services
Jose Enrique Ruiz
 
PDF
Digital Science
Jose Enrique Ruiz
 
PDF
Wf4Ever: Workflow Preservation
Jose Enrique Ruiz
 
PDF
Use of CharDM in an archive of velocity cubes
Jose Enrique Ruiz
 
PDF
Workflow Preservation
Jose Enrique Ruiz
 
PDF
VO web-services-based astronomy workflows
Jose Enrique Ruiz
 
PDF
Web services based workflows to deal with 3D data
Jose Enrique Ruiz
 
PDF
Curating and Preserving Collaborative Digital Experiments
Jose Enrique Ruiz
 
PDF
Collaborative Digital Experiments
Jose Enrique Ruiz
 
PDF
SVO Activities - SEA 2008
Jose Enrique Ruiz
 
PDF
El Observatorio Virtual - eCA
Jose Enrique Ruiz
 
PDF
Multidimensional Data in the VO
Jose Enrique Ruiz
 
PDF
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
Jose Enrique Ruiz
 
PDF
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Jose Enrique Ruiz
 
Curation and Characterization of Web Services
Jose Enrique Ruiz
 
Digital Science
Jose Enrique Ruiz
 
Wf4Ever: Workflow Preservation
Jose Enrique Ruiz
 
Use of CharDM in an archive of velocity cubes
Jose Enrique Ruiz
 
Workflow Preservation
Jose Enrique Ruiz
 
VO web-services-based astronomy workflows
Jose Enrique Ruiz
 
Web services based workflows to deal with 3D data
Jose Enrique Ruiz
 
Curating and Preserving Collaborative Digital Experiments
Jose Enrique Ruiz
 
Collaborative Digital Experiments
Jose Enrique Ruiz
 
SVO Activities - SEA 2008
Jose Enrique Ruiz
 
El Observatorio Virtual - eCA
Jose Enrique Ruiz
 
Multidimensional Data in the VO
Jose Enrique Ruiz
 
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
Jose Enrique Ruiz
 
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Jose Enrique Ruiz
 

Recently uploaded (20)

PDF
KIPER4D situs Exclusive Game dari server Star Gaming Asia
hokimamad0
 
PPTX
原版北不列颠哥伦比亚大学毕业证文凭UNBC成绩单2025年新版在线制作学位证书
e7nw4o4
 
PDF
Cybersecurity Awareness Presentation ppt.
banodhaharshita
 
PDF
Data Protection & Resilience in Focus.pdf
AmyPoblete3
 
PPTX
Google SGE SEO: 5 Critical Changes That Could Wreck Your Rankings in 2025
Reversed Out Creative
 
PPTX
The Internet of Things (IoT) refers to a vast network of interconnected devic...
chethana8182
 
PPTX
办理方法西班牙假毕业证蒙德拉贡大学成绩单MULetter文凭样本
xxxihn4u
 
PPTX
谢尔丹学院毕业证购买|Sheridan文凭不见了怎么办谢尔丹学院成绩单
mookxk3
 
PDF
APNIC Update, presented at PHNOG 2025 by Shane Hermoso
APNIC
 
PDF
PDF document: World Game (s) Great Redesign.pdf
Steven McGee
 
PPTX
Slides Powerpoint: Eco Economic Epochs.pptx
Steven McGee
 
PPTX
Artificial-Intelligence-in-Daily-Life (2).pptx
nidhigoswami335
 
PDF
UI/UX Developer Guide: Tools, Trends, and Tips for 2025
Penguin peak
 
PPTX
Microsoft PowerPoint Student PPT slides.pptx
Garleys Putin
 
PPTX
Different Generation Of Computers .pptx
divcoder9507
 
PPTX
LESSON-2-Roles-of-ICT-in-Teaching-for-learning_123922 (1).pptx
renavieramopiquero
 
PPTX
Blue and Dark Blue Modern Technology Presentation.pptx
ap177979
 
PPTX
dns domain name system history work.pptx
MUHAMMADKAVISHSHABAN
 
PDF
LOGENVIDAD DANNYFGRETRRTTRRRTRRRRRRRRR.pdf
juan456ytpro
 
PPT
Introduction to dns domain name syst.ppt
MUHAMMADKAVISHSHABAN
 
KIPER4D situs Exclusive Game dari server Star Gaming Asia
hokimamad0
 
原版北不列颠哥伦比亚大学毕业证文凭UNBC成绩单2025年新版在线制作学位证书
e7nw4o4
 
Cybersecurity Awareness Presentation ppt.
banodhaharshita
 
Data Protection & Resilience in Focus.pdf
AmyPoblete3
 
Google SGE SEO: 5 Critical Changes That Could Wreck Your Rankings in 2025
Reversed Out Creative
 
The Internet of Things (IoT) refers to a vast network of interconnected devic...
chethana8182
 
办理方法西班牙假毕业证蒙德拉贡大学成绩单MULetter文凭样本
xxxihn4u
 
谢尔丹学院毕业证购买|Sheridan文凭不见了怎么办谢尔丹学院成绩单
mookxk3
 
APNIC Update, presented at PHNOG 2025 by Shane Hermoso
APNIC
 
PDF document: World Game (s) Great Redesign.pdf
Steven McGee
 
Slides Powerpoint: Eco Economic Epochs.pptx
Steven McGee
 
Artificial-Intelligence-in-Daily-Life (2).pptx
nidhigoswami335
 
UI/UX Developer Guide: Tools, Trends, and Tips for 2025
Penguin peak
 
Microsoft PowerPoint Student PPT slides.pptx
Garleys Putin
 
Different Generation Of Computers .pptx
divcoder9507
 
LESSON-2-Roles-of-ICT-in-Teaching-for-learning_123922 (1).pptx
renavieramopiquero
 
Blue and Dark Blue Modern Technology Presentation.pptx
ap177979
 
dns domain name system history work.pptx
MUHAMMADKAVISHSHABAN
 
LOGENVIDAD DANNYFGRETRRTTRRRTRRRRRRRRR.pdf
juan456ytpro
 
Introduction to dns domain name syst.ppt
MUHAMMADKAVISHSHABAN
 

IPython Notebooks - Hacia los papers ejecutables