SlideShare a Scribd company logo
Data + Data Scientists ≠ Money
Dr. David Hoyle
My background
PRODUCING DATA SCIENCE
• 20 yrs in academia
• dunnhumby
• dunnhumby
APPLYING DATA SCIENCE
• Lloyds Banking Group
• AutoTrader UK
• InfinityWorks
The challenges in applying Data Science are very different
Data
Science
Doesn’t
Work
How did we get here?
Your Company Inc.
Cultural and
organizational
challenges
are always
harder than
technical
challenges
• Which parts of the business?
• How should we organize?
• How should we work?
• How should we communicate?
• What support do we need?
• What data should we use?
• SVM vs Logistic Regression?
Where do
companies need
Data Science?
Data touchpoints
Bad guys Account
Manager
Marketing
Developers OEMs
Private seller
£, €, $
Finance
Car Dealer
Consumer
External Data
Internal Data
£ Trade£ Retail
‘Why?’ is a
powerful Data
Science tool How will you consume the outputs?
‘We need a neural network’
‘We want to predict if users will
click this link’
‘Not clicking indicates low user
engagement’
Why?
Why?
‘We can alter the content in
session if engagement is low’
Can you respond to the neural
network output fast enough?
‘Hmmm… No.’
Build cross-functional teams
Data Scientist ≠ Data Engineer
Data
Science Data Engineering
Product
10
Get close to the
business Analytics
Team
Product
Area C
Analyst
Product
Area A
Analyst
Product
Area B
Analyst
Business
Area 2
Analyst
Business
Area 1
Analyst
Data Scientists
+
Data Analysts
Does Agile
always
work for
Data
Science?
All parts of Data Science have
outputs
INFORMATION
PRESENTATION
ALGORITHM
DEVELOPMENT
Easier to communicate outputs
Easier to communicate progress
Harder to communicate outputs
Harder to communicate progress
Always
communicate what
the outputs will be
Understand
business problem
Map to
appropriate
abstraction
Mathematical
statement of
abstraction
Identify type of
mathematical
model required
Identify & explore
potential data
sources
Build, validate, &
test model,
e.g. CRISP-DM
Productionize
model
Deploy
production model
artefacts
Consume model
outputs
Monitor
production model
Re-build
production model Improve model
Data
Science
Data
Engineering
Data
Science
Data
Engineering
Data
Engineering
Data
Engineering
Data
Science
Data
Science
Understand &
conceptualize
the problem
Understand
resources
available & build
model
Incorporate
model into
business
process
Monitor &
improve
The Data Science innovation lifecycle is longer than you think
Data & Compute should
be close together
Operational
Operational
+
Data Warehouse
SQL, BI
Not all data is valuable
Either
Your data is valuable to
you – e.g. helps improve
business processes
Or
Your data is valuable to
someone else – e.g.
gives a market wide view
To make Data Science pay you need to
1. Work on the projects with direct P&L impact
2. …..by asking the right business questions up-front
3. …..using teams that have the technical right skills
4. …..and understand the business challenges
5. …..using Agile methodologies where appropriate
6. …..always communicating what you are doing and why
7. …..with the right tools and on the right data
Data and data scientists are not equal to money   david hoyle

More Related Content

PDF
Playing Nice in the Product Playground #StrataHadoop
Intuit Inc.
 
PPTX
Playing Nice in the Product Playground
Intuit Inc.
 
PDF
BA and Beyond 19 Andrej Guštin - Mirror mirror on the wall Who's the wisest o...
BA and Beyond
 
PDF
BA and Beyond 20 - Antonio Gonzalez Sanchis - Add some RICE to your organisation
BA and Beyond
 
PDF
Agile Analytics: The Secret to Test, Improve, Fail & Succeed Quickly.
Venveo
 
PDF
Making Big Data Projects Successful - Data Science Pop-up Seattle
Domino Data Lab
 
PDF
Run the good race with Collaborative innovation
IBM (Middle East and Africa)
 
PDF
Sketching out a cognitive masterpiece
IBM (Middle East and Africa)
 
Playing Nice in the Product Playground #StrataHadoop
Intuit Inc.
 
Playing Nice in the Product Playground
Intuit Inc.
 
BA and Beyond 19 Andrej Guštin - Mirror mirror on the wall Who's the wisest o...
BA and Beyond
 
BA and Beyond 20 - Antonio Gonzalez Sanchis - Add some RICE to your organisation
BA and Beyond
 
Agile Analytics: The Secret to Test, Improve, Fail & Succeed Quickly.
Venveo
 
Making Big Data Projects Successful - Data Science Pop-up Seattle
Domino Data Lab
 
Run the good race with Collaborative innovation
IBM (Middle East and Africa)
 
Sketching out a cognitive masterpiece
IBM (Middle East and Africa)
 

What's hot (19)

PPTX
Idiots guide to setting up a data science team
Ashish Bansal
 
PPTX
AI as a platform
Aarthi Srinivasan
 
PDF
BA and Beyond 19 - Adrian Reed - Don't bring me solutions Bring me problems
BA and Beyond
 
PPTX
Using Data To Tranform Your Business - Marketing Business
Marco Garcia
 
PPTX
How to Build a Successful Data Team - Florian Douetteau (@Dataiku)
Dataiku
 
PDF
7 Dimensions of Agile Analytics by Ken Collier
Thoughtworks
 
PDF
1000 track 1 groves_using our laptop
Rising Media, Inc.
 
PPTX
CASE STUDY SEW WHAT? Inc
Feby Sandra
 
PDF
BA and Beyond 19 - Lynda Girvan - User story workshop
BA and Beyond
 
PDF
Data Strategy - Enabling the Data-Guided Enterprise
Thoughtworks
 
PDF
Dave Elliman - Applying Continuous Intelligence ThoughtWorks Live UK 2018
Thoughtworks
 
PDF
BA and Beyond 19 Sponsor spotlight - The Business Analysts - Why is agile mak...
BA and Beyond
 
PPTX
Agile Analytics
Atif Shaikh
 
PPT
Chetan Karkhanis Profile
chetan_karkhanis
 
PDF
Artificial Intelligence - 3 Weeks to Success
Andrew Painter
 
PPSX
Teknasoft IT Services & Consulting Presentation
Teknasoft IT Services & Consulting
 
PDF
Going Beyond 'What Success Looks Like' - Using Data to Achieve Successful Pro...
IIBA UK Chapter
 
PPTX
My mistakes as a ba pankaj kanchankar
BAConfPune
 
PPTX
My mistakes as a Business Analyst
Pankaj Kanchankar
 
Idiots guide to setting up a data science team
Ashish Bansal
 
AI as a platform
Aarthi Srinivasan
 
BA and Beyond 19 - Adrian Reed - Don't bring me solutions Bring me problems
BA and Beyond
 
Using Data To Tranform Your Business - Marketing Business
Marco Garcia
 
How to Build a Successful Data Team - Florian Douetteau (@Dataiku)
Dataiku
 
7 Dimensions of Agile Analytics by Ken Collier
Thoughtworks
 
1000 track 1 groves_using our laptop
Rising Media, Inc.
 
CASE STUDY SEW WHAT? Inc
Feby Sandra
 
BA and Beyond 19 - Lynda Girvan - User story workshop
BA and Beyond
 
Data Strategy - Enabling the Data-Guided Enterprise
Thoughtworks
 
Dave Elliman - Applying Continuous Intelligence ThoughtWorks Live UK 2018
Thoughtworks
 
BA and Beyond 19 Sponsor spotlight - The Business Analysts - Why is agile mak...
BA and Beyond
 
Agile Analytics
Atif Shaikh
 
Chetan Karkhanis Profile
chetan_karkhanis
 
Artificial Intelligence - 3 Weeks to Success
Andrew Painter
 
Teknasoft IT Services & Consulting Presentation
Teknasoft IT Services & Consulting
 
Going Beyond 'What Success Looks Like' - Using Data to Achieve Successful Pro...
IIBA UK Chapter
 
My mistakes as a ba pankaj kanchankar
BAConfPune
 
My mistakes as a Business Analyst
Pankaj Kanchankar
 
Ad

Similar to Data and data scientists are not equal to money david hoyle (20)

PPTX
intro to data science Clustering and visualization of data science subfields ...
jybufgofasfbkpoovh
 
PDF
Making an impact with data science
Jordan Engbers
 
PPTX
introduction to data science
bhavesh lande
 
PPTX
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
Vivian S. Zhang
 
PDF
Lean Analytics: How to get more out of your data science team
Digital Transformation EXPO Event Series
 
PDF
Why Data Science Is Important for the Future of Work | IABAC
vamshit5
 
PPTX
The Power of Data Science by DICS INNOVATIVE.pptx
gs5545791
 
PDF
Introduction to Data Science.pdf
University of Sindh
 
PDF
Embracing data science
Vipul Kalamkar
 
PPTX
Data Science PPT _basics of data science.pptx
KuldeepSinghBrar3
 
DOCX
What is Data Science?
Ahmed Banafa
 
PPTX
Impact of Data Science
kumari36
 
PPTX
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
Madhumitha N
 
PDF
5_Data Analytics, Data Science and Machine Learning
AyushSrivastava673855
 
PDF
iTrain Malaysia: Data Science by Tarun Sukhani
iTrain
 
PPTX
Data science in business Administration Nagarajan.pptx
NagarajanG35
 
PDF
Understanding Data Science: Concepts, Techniques, and Applications | IABAC
IABAC
 
PDF
How Data Science Can Transform Your Business. | IABAC
IABAC
 
PPTX
introduction TO DS 1.pptxvbvcbvcbvcbvcbvcb
zmulani8
 
PPTX
Data Science Training in Chandigarh h
asmeerana605
 
intro to data science Clustering and visualization of data science subfields ...
jybufgofasfbkpoovh
 
Making an impact with data science
Jordan Engbers
 
introduction to data science
bhavesh lande
 
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
Vivian S. Zhang
 
Lean Analytics: How to get more out of your data science team
Digital Transformation EXPO Event Series
 
Why Data Science Is Important for the Future of Work | IABAC
vamshit5
 
The Power of Data Science by DICS INNOVATIVE.pptx
gs5545791
 
Introduction to Data Science.pdf
University of Sindh
 
Embracing data science
Vipul Kalamkar
 
Data Science PPT _basics of data science.pptx
KuldeepSinghBrar3
 
What is Data Science?
Ahmed Banafa
 
Impact of Data Science
kumari36
 
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
Madhumitha N
 
5_Data Analytics, Data Science and Machine Learning
AyushSrivastava673855
 
iTrain Malaysia: Data Science by Tarun Sukhani
iTrain
 
Data science in business Administration Nagarajan.pptx
NagarajanG35
 
Understanding Data Science: Concepts, Techniques, and Applications | IABAC
IABAC
 
How Data Science Can Transform Your Business. | IABAC
IABAC
 
introduction TO DS 1.pptxvbvcbvcbvcbvcbvcb
zmulani8
 
Data Science Training in Chandigarh h
asmeerana605
 
Ad

More from Institute of Contemporary Sciences (20)

PDF
First 5 years of PSI:ML - Filip Panjevic
Institute of Contemporary Sciences
 
PPTX
Building valuable (online and offline) Data Science communities - Experience ...
Institute of Contemporary Sciences
 
PPT
Data Science Master 4.0 on Belgrade University - Drazen Draskovic
Institute of Contemporary Sciences
 
PPTX
Deep learning fast and slow, a responsible and explainable AI framework - Ahm...
Institute of Contemporary Sciences
 
PPTX
Solving churn challenge in Big Data environment - Jelena Pekez
Institute of Contemporary Sciences
 
PDF
Application of Business Intelligence in bank risk management - Dimitar Dilov
Institute of Contemporary Sciences
 
PPTX
Trends and practical applications of AI/ML in Fin Tech industry - Milos Kosan...
Institute of Contemporary Sciences
 
PPTX
Recommender systems for personalized financial advice from concept to product...
Institute of Contemporary Sciences
 
PDF
Advanced tools in real time analytics and AI in customer support - Milan Sima...
Institute of Contemporary Sciences
 
PPTX
Complex AI forecasting methods for investments portfolio optimization - Pawel...
Institute of Contemporary Sciences
 
PPTX
From Zero to ML Hero for Underdogs - Amir Tabakovic
Institute of Contemporary Sciences
 
PPSX
The price is right - Tomislav Krizan
Institute of Contemporary Sciences
 
PPTX
When it's raining gold, bring a bucket - Andjela Culibrk
Institute of Contemporary Sciences
 
PPTX
Reality and traps of real time data engineering - Milos Solujic
Institute of Contemporary Sciences
 
PPTX
Sensor networks for personalized health monitoring - Vladimir Brusic
Institute of Contemporary Sciences
 
PDF
Improving Data Quality with Product Similarity Search
Institute of Contemporary Sciences
 
PPTX
Prediction of good patterns for future sales using image recognition
Institute of Contemporary Sciences
 
PPTX
Using data to fight corruption: full budget transparency in local government
Institute of Contemporary Sciences
 
PPTX
Geospatial Analysis and Open Data - Forest and Climate
Institute of Contemporary Sciences
 
PPTX
Machine Learning-Driven Injury Prediction for a Professional Sports Team
Institute of Contemporary Sciences
 
First 5 years of PSI:ML - Filip Panjevic
Institute of Contemporary Sciences
 
Building valuable (online and offline) Data Science communities - Experience ...
Institute of Contemporary Sciences
 
Data Science Master 4.0 on Belgrade University - Drazen Draskovic
Institute of Contemporary Sciences
 
Deep learning fast and slow, a responsible and explainable AI framework - Ahm...
Institute of Contemporary Sciences
 
Solving churn challenge in Big Data environment - Jelena Pekez
Institute of Contemporary Sciences
 
Application of Business Intelligence in bank risk management - Dimitar Dilov
Institute of Contemporary Sciences
 
Trends and practical applications of AI/ML in Fin Tech industry - Milos Kosan...
Institute of Contemporary Sciences
 
Recommender systems for personalized financial advice from concept to product...
Institute of Contemporary Sciences
 
Advanced tools in real time analytics and AI in customer support - Milan Sima...
Institute of Contemporary Sciences
 
Complex AI forecasting methods for investments portfolio optimization - Pawel...
Institute of Contemporary Sciences
 
From Zero to ML Hero for Underdogs - Amir Tabakovic
Institute of Contemporary Sciences
 
The price is right - Tomislav Krizan
Institute of Contemporary Sciences
 
When it's raining gold, bring a bucket - Andjela Culibrk
Institute of Contemporary Sciences
 
Reality and traps of real time data engineering - Milos Solujic
Institute of Contemporary Sciences
 
Sensor networks for personalized health monitoring - Vladimir Brusic
Institute of Contemporary Sciences
 
Improving Data Quality with Product Similarity Search
Institute of Contemporary Sciences
 
Prediction of good patterns for future sales using image recognition
Institute of Contemporary Sciences
 
Using data to fight corruption: full budget transparency in local government
Institute of Contemporary Sciences
 
Geospatial Analysis and Open Data - Forest and Climate
Institute of Contemporary Sciences
 
Machine Learning-Driven Injury Prediction for a Professional Sports Team
Institute of Contemporary Sciences
 

Recently uploaded (20)

PPTX
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
PDF
An Uncut Conversation With Grok | PDF Document
Mike Hydes
 
PPTX
short term internship project on Data visualization
JMJCollegeComputerde
 
PPTX
INFO8116 - Week 10 - Slides.pptx data analutics
guddipatel10
 
PPTX
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
PPTX
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
PDF
Blitz Campinas - Dia 24 de maio - Piettro.pdf
fabigreek
 
PPTX
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
PDF
Mastering Financial Analysis Materials.pdf
SalamiAbdullahi
 
PPTX
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
PDF
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
PDF
blockchain123456789012345678901234567890
tanvikhunt1003
 
PPTX
INFO8116 -Big data architecture and analytics
guddipatel10
 
PDF
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
PPT
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
PPTX
Probability systematic sampling methods.pptx
PrakashRajput19
 
PPTX
M1-T1.pptxM1-T1.pptxM1-T1.pptxM1-T1.pptx
teodoroferiarevanojr
 
PPTX
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
PPTX
MR and reffffffvvvvvvvfversal_083605.pptx
manjeshjain
 
PPTX
HSE WEEKLY REPORT for dummies and lazzzzy.pptx
ahmedibrahim691723
 
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
An Uncut Conversation With Grok | PDF Document
Mike Hydes
 
short term internship project on Data visualization
JMJCollegeComputerde
 
INFO8116 - Week 10 - Slides.pptx data analutics
guddipatel10
 
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
Blitz Campinas - Dia 24 de maio - Piettro.pdf
fabigreek
 
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
Mastering Financial Analysis Materials.pdf
SalamiAbdullahi
 
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
blockchain123456789012345678901234567890
tanvikhunt1003
 
INFO8116 -Big data architecture and analytics
guddipatel10
 
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
Probability systematic sampling methods.pptx
PrakashRajput19
 
M1-T1.pptxM1-T1.pptxM1-T1.pptxM1-T1.pptx
teodoroferiarevanojr
 
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
MR and reffffffvvvvvvvfversal_083605.pptx
manjeshjain
 
HSE WEEKLY REPORT for dummies and lazzzzy.pptx
ahmedibrahim691723
 

Data and data scientists are not equal to money david hoyle

  • 1. Data + Data Scientists ≠ Money Dr. David Hoyle
  • 2. My background PRODUCING DATA SCIENCE • 20 yrs in academia • dunnhumby • dunnhumby APPLYING DATA SCIENCE • Lloyds Banking Group • AutoTrader UK • InfinityWorks The challenges in applying Data Science are very different
  • 4. How did we get here? Your Company Inc.
  • 5. Cultural and organizational challenges are always harder than technical challenges • Which parts of the business? • How should we organize? • How should we work? • How should we communicate? • What support do we need? • What data should we use? • SVM vs Logistic Regression?
  • 7. Data touchpoints Bad guys Account Manager Marketing Developers OEMs Private seller £, €, $ Finance Car Dealer Consumer External Data Internal Data £ Trade£ Retail
  • 8. ‘Why?’ is a powerful Data Science tool How will you consume the outputs? ‘We need a neural network’ ‘We want to predict if users will click this link’ ‘Not clicking indicates low user engagement’ Why? Why? ‘We can alter the content in session if engagement is low’ Can you respond to the neural network output fast enough? ‘Hmmm… No.’
  • 9. Build cross-functional teams Data Scientist ≠ Data Engineer Data Science Data Engineering Product
  • 10. 10 Get close to the business Analytics Team Product Area C Analyst Product Area A Analyst Product Area B Analyst Business Area 2 Analyst Business Area 1 Analyst Data Scientists + Data Analysts
  • 12. All parts of Data Science have outputs INFORMATION PRESENTATION ALGORITHM DEVELOPMENT Easier to communicate outputs Easier to communicate progress Harder to communicate outputs Harder to communicate progress Always communicate what the outputs will be
  • 13. Understand business problem Map to appropriate abstraction Mathematical statement of abstraction Identify type of mathematical model required Identify & explore potential data sources Build, validate, & test model, e.g. CRISP-DM Productionize model Deploy production model artefacts Consume model outputs Monitor production model Re-build production model Improve model Data Science Data Engineering Data Science Data Engineering Data Engineering Data Engineering Data Science Data Science Understand & conceptualize the problem Understand resources available & build model Incorporate model into business process Monitor & improve The Data Science innovation lifecycle is longer than you think
  • 14. Data & Compute should be close together Operational Operational + Data Warehouse SQL, BI
  • 15. Not all data is valuable Either Your data is valuable to you – e.g. helps improve business processes Or Your data is valuable to someone else – e.g. gives a market wide view
  • 16. To make Data Science pay you need to 1. Work on the projects with direct P&L impact 2. …..by asking the right business questions up-front 3. …..using teams that have the technical right skills 4. …..and understand the business challenges 5. …..using Agile methodologies where appropriate 6. …..always communicating what you are doing and why 7. …..with the right tools and on the right data