SlideShare a Scribd company logo
@dhianadeva
AGILE DATA SCIENCE
Agile Tour 2015 - Niterói
AGENDA
Goal:
Encourage you to be agile using data science on your project.
● About me
● Machine Learning
● Similarities
● Misconceptions
● Non-agile Stories
● Agile meets Data Science
● Brighter Days
ABOUT ME
Electronics Engineering, Software Development and Data
Science… Why not?
DHIANA DEVA
NEURALTB
CERN
NEURALRINGER
DJBRAZIL
DAILY SMALL DATA
HIGGS CHALLENGE
HIGGS CHALLENGE
MACHINE LEARNING
It's all about learning!
CLASSIFICATION
?
A
B
REGRESSION
? 8 15 7 1
11 13 6 3
CLUSTERING
DIMENSIONALITY
REDUCTION
SIMILARITIES
We're born to be <3
LEARNING
NEURAL NETWORKS
SELF-ORGANIZING
MAPS
OCCAM'S RAZOR
Among competing hypotheses, the
one with the fewest assumptions
should be selected.
NON-AGILE STORIES
Wish I knew Martin Fowler back then!
MEMORY LEAK AT CERN
CODE AT CERN
DJBRAZIL?
MISCONCEPTIONS
Agile is not for data science...
BIG UPFRONT
INVESTMENTS
IT TAKES TOO LONG
ONLY FOR PHDs
SILOS
VANITY METRICS
HIPPO
AGILE MEETS
DATA SCIENCE
Agile and data science: a match made in heaven!
COLLABORATION
CONTINUOUS
DEVELOPMENT
STRONG ENGINEERING
PRACTICES
EARLY INSIGHTS
https://www.thoughtworks.com/big-data-analytics
ACTIONABLE INSIGHTS
https://www.thoughtworks.com/big-data-analytics
VALUE DRIVEN
https://www.thoughtworks.com/live/2014/europe/different-approaches-to-agile-analytics-and-customer-engagement
DATA LAKE
http://martinfowler.com/bliki/DataLake.html
AGILE ANALYTICS
BRIGHTER DAYS
We're living it!
MASSIVE ONLINE
OPEN COURSES
OPEN SOURCE TOOLS
PAY-AS-YOU-GO
SERVICES
A/B TESTING TOOLS
ANALYTICS TOOLS
VISUALIZATION TOOLS
WANT MORE?
THANK YOU
Questions?
Dhiana Deva
ddeva@thoughtworks.com

More Related Content

PDF
Agile data science
Joel Horwitz
 
PDF
Agile Data Science
Volodymyr Kazantsev
 
PPTX
Agile Data Science
Alexander Bauer
 
PPT
Agile Data Science by Russell Jurney_ The Hive_Janruary 29 2014
The Hive
 
PPSX
Data Science 101
odsc
 
PPTX
Leveraging Open Source Automated Data Science Tools
Domino Data Lab
 
PPTX
Hadoop Meets Scrum
Rommel Garcia
 
Agile data science
Joel Horwitz
 
Agile Data Science
Volodymyr Kazantsev
 
Agile Data Science
Alexander Bauer
 
Agile Data Science by Russell Jurney_ The Hive_Janruary 29 2014
The Hive
 
Data Science 101
odsc
 
Leveraging Open Source Automated Data Science Tools
Domino Data Lab
 
Hadoop Meets Scrum
Rommel Garcia
 

What's hot (11)

PDF
From Data to Visualization, what happens in between?
Krist Wongsuphasawat
 
PDF
Agile Data Science 2.0
Russell Jurney
 
PDF
Intro to Python for Data Science
TJ Stalcup
 
PPTX
Fortune Teller API - Doing Data Science with Apache Spark
Bas Geerdink
 
PDF
Applied Data Science Course Part 1: Concepts & your first ML model
Dataiku
 
PPTX
Dataiku - From Big Data To Machine Learning
Dataiku
 
PDF
Dataiku productive application to production - pap is may 2015
Dataiku
 
PPTX
Python for Data Science with Anaconda
Travis Oliphant
 
PDF
Python for Data Science
Gabriel Moreira
 
PDF
Open Data Science Conference Agile Data
DataKitchen
 
PPTX
Introduction to Data Science
Caserta
 
From Data to Visualization, what happens in between?
Krist Wongsuphasawat
 
Agile Data Science 2.0
Russell Jurney
 
Intro to Python for Data Science
TJ Stalcup
 
Fortune Teller API - Doing Data Science with Apache Spark
Bas Geerdink
 
Applied Data Science Course Part 1: Concepts & your first ML model
Dataiku
 
Dataiku - From Big Data To Machine Learning
Dataiku
 
Dataiku productive application to production - pap is may 2015
Dataiku
 
Python for Data Science with Anaconda
Travis Oliphant
 
Python for Data Science
Gabriel Moreira
 
Open Data Science Conference Agile Data
DataKitchen
 
Introduction to Data Science
Caserta
 
Ad

Viewers also liked (7)

PDF
Agile data science: Distributed, Interactive, Integrated, Semantic, Micro Ser...
Andy Petrella
 
PDF
Agile Analytics: The Secret to Test, Improve, Fail & Succeed Quickly.
Venveo
 
PDF
Agile Big Data Analytics Development: An Architecture-Centric Approach
SoftServe
 
PDF
Lean Product Management for Enterprises: The Art of Known Unknowns
Thoughtworks
 
PDF
Agile Data Science 2.0 - Big Data Science Meetup
Russell Jurney
 
PDF
You Can't be Agile When you are Knee Deep in Mud
Thoughtworks
 
PDF
7 Dimensions of Agile Analytics by Ken Collier
Thoughtworks
 
Agile data science: Distributed, Interactive, Integrated, Semantic, Micro Ser...
Andy Petrella
 
Agile Analytics: The Secret to Test, Improve, Fail & Succeed Quickly.
Venveo
 
Agile Big Data Analytics Development: An Architecture-Centric Approach
SoftServe
 
Lean Product Management for Enterprises: The Art of Known Unknowns
Thoughtworks
 
Agile Data Science 2.0 - Big Data Science Meetup
Russell Jurney
 
You Can't be Agile When you are Knee Deep in Mud
Thoughtworks
 
7 Dimensions of Agile Analytics by Ken Collier
Thoughtworks
 
Ad

Similar to Agile Data Science (20)

PDF
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Agile Impact Conference
 
PDF
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Agile Impact
 
PDF
Agile Analytics: Delivering on Promises by Atif Abdul Rahman
Agile ME
 
PPTX
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
Vivian S. Zhang
 
PDF
Data kitchen 7 agile steps - big data fest 9-18-2015
DataKitchen
 
PPTX
Agile Analytics
Atif Shaikh
 
PPTX
DataOps: Nine steps to transform your data science impact Strata London May 18
Harvinder Atwal
 
PPTX
Data science for BE subject code is 2cs642
Sanjay Kumar
 
PPTX
Everything you wanted to know about data ops
Enov8
 
PPTX
This is Data Types of Python Programming Language
uf980966
 
PDF
Agile Data
odsc
 
PDF
Success Through an Actionable Data Science Stack
Domino Data Lab
 
PDF
Big Data LA 2016: Backstage to a Data Driven Culture
Pauline Chow
 
PDF
Data science hypes and reality
Helge Johannessen Bjorland
 
PDF
Data and data scientists are not equal to money david hoyle
Institute of Contemporary Sciences
 
PDF
GTU GeekDay Data Science and Applications
Kürşat İNCE
 
DOCX
What is Data Science?
Ahmed Banafa
 
PDF
Lean Analytics: How to get more out of your data science team
Digital Transformation EXPO Event Series
 
PPTX
Göteborg university(condensed)
Zenodia Charpy
 
PDF
Where to study Data Science Course in Kerala
nitro1998arun
 
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Agile Impact Conference
 
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Agile Impact
 
Agile Analytics: Delivering on Promises by Atif Abdul Rahman
Agile ME
 
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
Vivian S. Zhang
 
Data kitchen 7 agile steps - big data fest 9-18-2015
DataKitchen
 
Agile Analytics
Atif Shaikh
 
DataOps: Nine steps to transform your data science impact Strata London May 18
Harvinder Atwal
 
Data science for BE subject code is 2cs642
Sanjay Kumar
 
Everything you wanted to know about data ops
Enov8
 
This is Data Types of Python Programming Language
uf980966
 
Agile Data
odsc
 
Success Through an Actionable Data Science Stack
Domino Data Lab
 
Big Data LA 2016: Backstage to a Data Driven Culture
Pauline Chow
 
Data science hypes and reality
Helge Johannessen Bjorland
 
Data and data scientists are not equal to money david hoyle
Institute of Contemporary Sciences
 
GTU GeekDay Data Science and Applications
Kürşat İNCE
 
What is Data Science?
Ahmed Banafa
 
Lean Analytics: How to get more out of your data science team
Digital Transformation EXPO Event Series
 
Göteborg university(condensed)
Zenodia Charpy
 
Where to study Data Science Course in Kerala
nitro1998arun
 

More from Dhiana Deva (10)

PDF
Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019
Dhiana Deva
 
PDF
Machine Learning in Python - PyLadies Stockholm
Dhiana Deva
 
PDF
Machine Learning for Everyone
Dhiana Deva
 
PDF
Um Pouquinho Sobre Métodos Ágeis - Rails Girls SP
Dhiana Deva
 
PDF
QCon Rio - Machine Learning for Everyone
Dhiana Deva
 
PDF
We love NLTK
Dhiana Deva
 
PDF
My First Attempt on Kaggle - Higgs Machine Learning Challenge: 755st and Proud!
Dhiana Deva
 
PDF
AR Post-its @ CBSOFT
Dhiana Deva
 
PDF
Self-Organizing Maps 101 (Dhiana Deva)
Dhiana Deva
 
PDF
Sistemas de recomendação
Dhiana Deva
 
Machine Learning: Opening the Pandora's Box - Dhiana Deva @ QCon São Paulo 2019
Dhiana Deva
 
Machine Learning in Python - PyLadies Stockholm
Dhiana Deva
 
Machine Learning for Everyone
Dhiana Deva
 
Um Pouquinho Sobre Métodos Ágeis - Rails Girls SP
Dhiana Deva
 
QCon Rio - Machine Learning for Everyone
Dhiana Deva
 
We love NLTK
Dhiana Deva
 
My First Attempt on Kaggle - Higgs Machine Learning Challenge: 755st and Proud!
Dhiana Deva
 
AR Post-its @ CBSOFT
Dhiana Deva
 
Self-Organizing Maps 101 (Dhiana Deva)
Dhiana Deva
 
Sistemas de recomendação
Dhiana Deva
 

Recently uploaded (20)

PPTX
Introduction to Biostatistics Presentation.pptx
AtemJoshua
 
PPTX
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
PDF
oop_java (1) of ice or cse or eee ic.pdf
sabiquntoufiqlabonno
 
PDF
Mastering Financial Analysis Materials.pdf
SalamiAbdullahi
 
PDF
Research about a FoodFolio app for personalized dietary tracking and health o...
AustinLiamAndres
 
PPTX
Blue and Dark Blue Modern Technology Presentation.pptx
ap177979
 
PDF
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
PPTX
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
PDF
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
PDF
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
PPTX
Data Security Breach: Immediate Action Plan
varmabhuvan266
 
PPTX
Databricks-DE-Associate Certification Questions-june-2024.pptx
pedelli41
 
PDF
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
PDF
Chad Readey - An Independent Thinker
Chad Readey
 
PDF
Practical Measurement Systems Analysis (Gage R&R) for design
Rob Schubert
 
PPTX
short term internship project on Data visualization
JMJCollegeComputerde
 
PDF
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
PPTX
Probability systematic sampling methods.pptx
PrakashRajput19
 
PDF
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
PPTX
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
Introduction to Biostatistics Presentation.pptx
AtemJoshua
 
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
oop_java (1) of ice or cse or eee ic.pdf
sabiquntoufiqlabonno
 
Mastering Financial Analysis Materials.pdf
SalamiAbdullahi
 
Research about a FoodFolio app for personalized dietary tracking and health o...
AustinLiamAndres
 
Blue and Dark Blue Modern Technology Presentation.pptx
ap177979
 
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
Data Security Breach: Immediate Action Plan
varmabhuvan266
 
Databricks-DE-Associate Certification Questions-june-2024.pptx
pedelli41
 
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
Chad Readey - An Independent Thinker
Chad Readey
 
Practical Measurement Systems Analysis (Gage R&R) for design
Rob Schubert
 
short term internship project on Data visualization
JMJCollegeComputerde
 
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
Probability systematic sampling methods.pptx
PrakashRajput19
 
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 

Agile Data Science