SlideShare a Scribd company logo
Why we fail at ml ai why we fail at ml_ai
Data Science on GCP
4th Meeting!
Meetings every month.
Website: https://www.gcpchi.com/
Open discussions, presentations and competitions around data science, web
analytics and everything GCP (Google Cloud Platform). Come learn, present,
eat, and drink
Why we fail at ML/AI?
How to innovate and maintain love & science.
Brian Ray, Managing Director of Data Science at Maven Wave Partners
CHECK out my podcast http://ai-podcast.com
CHECK out my blog https://medium.com/@brianray_7981/
Join the team: Recruiting@mavenwave.com
75%
Of all enterprises FAILED to have an AI/ML Strategy. And over half (60%) know
they are failing. Is it our fault?
IDC survey
Are data scientists bad at
innovation?
Failure to innovate on a
corporate level.
Netflix’s 10-Year 4,000% Rally
Underlines Shift to Streaming
Ryan Vlastelica, Bloomberg,
December 30, 2019, 7:00 AM CST
Typical Timeline
No Data, Data Security, Quality of Data
Technology isn’t available, too expensive, or accessible
Internal team too busy or not qualified
It’s just not a priority for now
No technology
Yay, we won a
project, let’s
get to work
No Data? Unclear on
results
No Production
No Updates
No Review
Not
reproducible
Caveman
goes
hunting
Attacked by
dinosaur
Drops Food
Goes back
Tells other
caveman
Invest crop
into
growing
Grows food
Tells others
Wife is mad
others
are
mad?
WHAT is the
SECRET
SAUCE?
Secret sauce in prevent failure
Factors Paleoanthropology Data Scientists Business
Trust Does the wife trust
results?
Is the process and
data supportive?
Does business
understand
prediction?
Census Does the tribe? Do data scientists
agree on results?
Will the business
agree to invest?
Process Reap what you sow Reproducible results
along the way
Can investment
produce reward?
Tools Stone tools Machine Learning Visualization Tools
Success Criteria Not Starve Not Starve Not Starve
Better Timeline
Yay, we won a
project, let’s
get to work
Relevant Data
Enough Data
Enriched Data
Explainable
Results
Path to
production
With human in
the loop
Better
Exploratory
Data Science
Sooner
Code reviewed
Reproducible
And
retrainable
Top 15 goals to help you not fail.
1. Stop using your laptop
2. Start showing data sooner
3. Show data better
4. Fear not to change directions
5. Show other data
6. Learn to try more, faster
7. Keep track of what you tried
8. Focus on explainability
9. Teach the art of measuring prediction
10. Learn to get along with other Data Scientists
11. Take the time to show and tell
12. Forget not what was done and were
13. Build production pipelines
14. Treat each prediction as a learning lesson
15. Repeat
With Examples from:
1. Stop using your laptop
[2]
SECRET SAUCE: Cloud computing
2. Show data sooner
Depending on the data consider:
● Looker
● GCP DataLab
● TensorFlow Board / Projector
● Consider notebook friendly
visualization like:
○ Bokeh
○ Plotly
○ ….
● Google Sheet: Data Connectors!
SECRET SAUCE: Rapid Dashboarding
SECRET SAUCE: Data Visualization
TensorFlow
Ai Platform
Notebooks
Looker
3. Show data better
[3]
SECRET SAUCE: Exploratory Data Analysis
SECRET SAUCE: Unsupervised Machine Learning
4. Fear not change directions
A Model to classify 100’s of beers?
VS
100 of models to classify 100’s of beers?
SECRET SAUCE: Goal based process
SECRET SAUCE: discovery head-room
5. Show Other Data
SECRET SAUCE: Data Access (datalake, swamp, ..)
SECRET SAUCE: Cloud computing
Reference table
BigQuery
Unstructured
Cloud Storage
Data matching
Cloud Dataflow
6. Learn to try more, faster
SECRET SAUCE: Data Science Pipelines
SECRET SAUCE: Automation
Kubeflow
GKE/Kubernetes
ETL
Dataflow
7. Keep track of what you tried
SECRET SAUCE: Logging and reporting
SECRET SAUCE: Document Collaboration
8. Focus on explainability
SECRET SAUCE: Game Theory (Shapley)
9. Teach the art of measuring prediction
[5]
SECRET SAUCE: show and tell, storytelling
SECRET SAUCE: regular checkpoints
10. Learn to get along with other Data Scientists
SECRET SAUCE: collaborate
SECRET SAUCE: else compromise
[5]
11. Take the time to show and tell
SECRET SAUCE: story telling
SECRET SAUCE: checkpoints
[5]
12. Forget not what was done and were
https://www.youtube.com/watch?v=xU_xdog
XFeE&feature=youtu.be
SECRET SAUCE: use cloud platform tools
SECRET SAUCE: use version controls
Git Repos
Compute Engine
Ai Platform
Notebooks
13. Build production pipelines
25
Train / Retrain Model
Review AUC ROC
accuracies and signoff
acquired from business
Deploy Production Model
Live Prediction
Model Life Cycle
Model Repository
Version control of model
(github, bitbucket …)
SECRET SAUCE: CI/CD
SECRET SAUCE: O&M
[6]
14. Treat each prediction as a learning lesson
26
Train / Retrain Model
Review AUC ROC
accuracies and signoff
acquired from business
Deploy Production Model
Live Prediction
Reviewed by a SME
(Subject Matter Expert)
Model Life Cycle
Model Repository
Version control of model
(github, bitbucket …)
Ground Truth Corrections
New Feature Types
Corrected Records
New Records
SECRET SAUCE: Human in the loop
[6]
15. Repeat
Brian Ray, Managing Director of Data
Science at Maven Wave Partners
@brianray
CHECK out my podcast http://ai-
podcast.com
CHECK out my blog
https://medium.com/@brianray_7981/
Join the team:
Recruiting@mavenwave.com
SECRET SAUCE: Success!
Credits
[1] 1. Paleoanthropology – The History of Our Tribe: Hominini icon
[2] https://blog.dominodatalab.com/cost-data-science-laptops/
[3] https://blog.magrathealabs.com/choosing-one-of-many-python-visualization-tools-7eb36fa5855f
[4] https://www.businessinsider.com/everything-you-need-to-know-about-beer-in-one-chart-2015-8
[5] https://xkcd.com/605/
[6] Copyright Maven Wave Partners
Giffy: “Machine Times GIF - Find & Share on GIPHY_files”, “Confused Curb Your Enthusiasm GIF - Find & Share on GIPHY_files”, “Stewie Griffin Laughing GIF - Find & Share
on GIPHY_files”, “Laptop Goodbye GIF - Find & Share on GIPHY_files”, “Domestic Violence Caveman GIF - Find & Share on GIPHY_files”, “Fail Star Wars GIF - Find & Share on
GIPHY_files”

More Related Content

Similar to Why we fail at ml ai why we fail at ml_ai (20)

PPTX
best data science course institutes in Hyderabad
rajasrichalamala3zen
 
PPTX
Data Science course in Hyderabad .
rajasrichalamala3zen
 
PPTX
Data Science course in Hyderabad .
rajasrichalamala3zen
 
PPTX
data science course training in Hyderabad
madhupriya3zen
 
PDF
Data science course in ameerpet Hyderabad
ShivaKanukuntla33
 
PPTX
data science course in Hyderabad data science course in Hyderabad
akhilamadupativibhin
 
PPTX
data science.pptx
shaikruhiarsha3zenco
 
PPTX
This is Data Types of Python Programming Language
uf980966
 
PPTX
Data Science is a growing field, you refer it
uf980966
 
PDF
"What we learned from 5 years of building a data science software that actual...
Dataconomy Media
 
PPTX
Managing Data Science Projects
Danielle Dean
 
PDF
The Research Blueprint: Excelling in Data science, Data Analysis and AI
Loreta Jugu
 
PPTX
33A1660F-datascience.pptx Data analyst at the end
rahulmg141891
 
PDF
Data science course in madhapur,Hyderabad
neeraja0480
 
PDF
Building a Successful Career in Data Science_ A Comprehensive Guide - Uncodem...
Ahana Sharma
 
PPTX
Dot Net Full Stack course in madhapur,Hyderabad
neeraja0480
 
PPTX
Managing Data Science | Lessons from the Field
Domino Data Lab
 
PPTX
Getting past the hype .pptx
SUNEEL37
 
PDF
10 Tips From A Young Data Scientist
Nuno Carneiro
 
PDF
DS Life Cycle
Knoldus Inc.
 
best data science course institutes in Hyderabad
rajasrichalamala3zen
 
Data Science course in Hyderabad .
rajasrichalamala3zen
 
Data Science course in Hyderabad .
rajasrichalamala3zen
 
data science course training in Hyderabad
madhupriya3zen
 
Data science course in ameerpet Hyderabad
ShivaKanukuntla33
 
data science course in Hyderabad data science course in Hyderabad
akhilamadupativibhin
 
data science.pptx
shaikruhiarsha3zenco
 
This is Data Types of Python Programming Language
uf980966
 
Data Science is a growing field, you refer it
uf980966
 
"What we learned from 5 years of building a data science software that actual...
Dataconomy Media
 
Managing Data Science Projects
Danielle Dean
 
The Research Blueprint: Excelling in Data science, Data Analysis and AI
Loreta Jugu
 
33A1660F-datascience.pptx Data analyst at the end
rahulmg141891
 
Data science course in madhapur,Hyderabad
neeraja0480
 
Building a Successful Career in Data Science_ A Comprehensive Guide - Uncodem...
Ahana Sharma
 
Dot Net Full Stack course in madhapur,Hyderabad
neeraja0480
 
Managing Data Science | Lessons from the Field
Domino Data Lab
 
Getting past the hype .pptx
SUNEEL37
 
10 Tips From A Young Data Scientist
Nuno Carneiro
 
DS Life Cycle
Knoldus Inc.
 

Recently uploaded (20)

PDF
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 
PDF
apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...
apidays
 
PPTX
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
PPT
deep dive data management sharepoint apps.ppt
novaprofk
 
PPTX
apidays Helsinki & North 2025 - APIs at Scale: Designing for Alignment, Trust...
apidays
 
PDF
AUDITABILITY & COMPLIANCE OF AI SYSTEMS IN HEALTHCARE
GAHI Youssef
 
PPTX
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
PPTX
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
PDF
2_Management_of_patients_with_Reproductive_System_Disorders.pdf
motbayhonewunetu
 
PPTX
recruitment Presentation.pptxhdhshhshshhehh
devraj40467
 
PPTX
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
PDF
Copia de Strategic Roadmap Infographics by Slidesgo.pptx (1).pdf
ssuserd4c6911
 
PPTX
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
PDF
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
PPTX
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
PPTX
Resmed Rady Landis May 4th - analytics.pptx
Adrian Limanto
 
PDF
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
PDF
Context Engineering for AI Agents, approaches, memories.pdf
Tamanna
 
PDF
apidays Helsinki & North 2025 - API-Powered Journeys: Mobility in an API-Driv...
apidays
 
PDF
Context Engineering vs. Prompt Engineering, A Comprehensive Guide.pdf
Tamanna
 
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 
apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...
apidays
 
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
deep dive data management sharepoint apps.ppt
novaprofk
 
apidays Helsinki & North 2025 - APIs at Scale: Designing for Alignment, Trust...
apidays
 
AUDITABILITY & COMPLIANCE OF AI SYSTEMS IN HEALTHCARE
GAHI Youssef
 
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
2_Management_of_patients_with_Reproductive_System_Disorders.pdf
motbayhonewunetu
 
recruitment Presentation.pptxhdhshhshshhehh
devraj40467
 
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
Copia de Strategic Roadmap Infographics by Slidesgo.pptx (1).pdf
ssuserd4c6911
 
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
Resmed Rady Landis May 4th - analytics.pptx
Adrian Limanto
 
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
Context Engineering for AI Agents, approaches, memories.pdf
Tamanna
 
apidays Helsinki & North 2025 - API-Powered Journeys: Mobility in an API-Driv...
apidays
 
Context Engineering vs. Prompt Engineering, A Comprehensive Guide.pdf
Tamanna
 
Ad

Why we fail at ml ai why we fail at ml_ai

  • 2. Data Science on GCP 4th Meeting! Meetings every month. Website: https://www.gcpchi.com/ Open discussions, presentations and competitions around data science, web analytics and everything GCP (Google Cloud Platform). Come learn, present, eat, and drink
  • 3. Why we fail at ML/AI? How to innovate and maintain love & science. Brian Ray, Managing Director of Data Science at Maven Wave Partners CHECK out my podcast http://ai-podcast.com CHECK out my blog https://medium.com/@brianray_7981/ Join the team: [email protected]
  • 4. 75% Of all enterprises FAILED to have an AI/ML Strategy. And over half (60%) know they are failing. Is it our fault? IDC survey
  • 5. Are data scientists bad at innovation?
  • 6. Failure to innovate on a corporate level. Netflix’s 10-Year 4,000% Rally Underlines Shift to Streaming Ryan Vlastelica, Bloomberg, December 30, 2019, 7:00 AM CST
  • 7. Typical Timeline No Data, Data Security, Quality of Data Technology isn’t available, too expensive, or accessible Internal team too busy or not qualified It’s just not a priority for now No technology Yay, we won a project, let’s get to work No Data? Unclear on results No Production No Updates No Review Not reproducible
  • 8. Caveman goes hunting Attacked by dinosaur Drops Food Goes back Tells other caveman Invest crop into growing Grows food Tells others Wife is mad others are mad?
  • 10. Secret sauce in prevent failure Factors Paleoanthropology Data Scientists Business Trust Does the wife trust results? Is the process and data supportive? Does business understand prediction? Census Does the tribe? Do data scientists agree on results? Will the business agree to invest? Process Reap what you sow Reproducible results along the way Can investment produce reward? Tools Stone tools Machine Learning Visualization Tools Success Criteria Not Starve Not Starve Not Starve
  • 11. Better Timeline Yay, we won a project, let’s get to work Relevant Data Enough Data Enriched Data Explainable Results Path to production With human in the loop Better Exploratory Data Science Sooner Code reviewed Reproducible And retrainable
  • 12. Top 15 goals to help you not fail. 1. Stop using your laptop 2. Start showing data sooner 3. Show data better 4. Fear not to change directions 5. Show other data 6. Learn to try more, faster 7. Keep track of what you tried 8. Focus on explainability 9. Teach the art of measuring prediction 10. Learn to get along with other Data Scientists 11. Take the time to show and tell 12. Forget not what was done and were 13. Build production pipelines 14. Treat each prediction as a learning lesson 15. Repeat With Examples from:
  • 13. 1. Stop using your laptop [2] SECRET SAUCE: Cloud computing
  • 14. 2. Show data sooner Depending on the data consider: ● Looker ● GCP DataLab ● TensorFlow Board / Projector ● Consider notebook friendly visualization like: ○ Bokeh ○ Plotly ○ …. ● Google Sheet: Data Connectors! SECRET SAUCE: Rapid Dashboarding SECRET SAUCE: Data Visualization TensorFlow Ai Platform Notebooks Looker
  • 15. 3. Show data better [3] SECRET SAUCE: Exploratory Data Analysis SECRET SAUCE: Unsupervised Machine Learning
  • 16. 4. Fear not change directions A Model to classify 100’s of beers? VS 100 of models to classify 100’s of beers? SECRET SAUCE: Goal based process SECRET SAUCE: discovery head-room
  • 17. 5. Show Other Data SECRET SAUCE: Data Access (datalake, swamp, ..) SECRET SAUCE: Cloud computing Reference table BigQuery Unstructured Cloud Storage Data matching Cloud Dataflow
  • 18. 6. Learn to try more, faster SECRET SAUCE: Data Science Pipelines SECRET SAUCE: Automation Kubeflow GKE/Kubernetes ETL Dataflow
  • 19. 7. Keep track of what you tried SECRET SAUCE: Logging and reporting SECRET SAUCE: Document Collaboration
  • 20. 8. Focus on explainability SECRET SAUCE: Game Theory (Shapley)
  • 21. 9. Teach the art of measuring prediction [5] SECRET SAUCE: show and tell, storytelling SECRET SAUCE: regular checkpoints
  • 22. 10. Learn to get along with other Data Scientists SECRET SAUCE: collaborate SECRET SAUCE: else compromise [5]
  • 23. 11. Take the time to show and tell SECRET SAUCE: story telling SECRET SAUCE: checkpoints [5]
  • 24. 12. Forget not what was done and were https://www.youtube.com/watch?v=xU_xdog XFeE&feature=youtu.be SECRET SAUCE: use cloud platform tools SECRET SAUCE: use version controls Git Repos Compute Engine Ai Platform Notebooks
  • 25. 13. Build production pipelines 25 Train / Retrain Model Review AUC ROC accuracies and signoff acquired from business Deploy Production Model Live Prediction Model Life Cycle Model Repository Version control of model (github, bitbucket …) SECRET SAUCE: CI/CD SECRET SAUCE: O&M [6]
  • 26. 14. Treat each prediction as a learning lesson 26 Train / Retrain Model Review AUC ROC accuracies and signoff acquired from business Deploy Production Model Live Prediction Reviewed by a SME (Subject Matter Expert) Model Life Cycle Model Repository Version control of model (github, bitbucket …) Ground Truth Corrections New Feature Types Corrected Records New Records SECRET SAUCE: Human in the loop [6]
  • 27. 15. Repeat Brian Ray, Managing Director of Data Science at Maven Wave Partners @brianray CHECK out my podcast http://ai- podcast.com CHECK out my blog https://medium.com/@brianray_7981/ Join the team: [email protected] SECRET SAUCE: Success!
  • 28. Credits [1] 1. Paleoanthropology – The History of Our Tribe: Hominini icon [2] https://blog.dominodatalab.com/cost-data-science-laptops/ [3] https://blog.magrathealabs.com/choosing-one-of-many-python-visualization-tools-7eb36fa5855f [4] https://www.businessinsider.com/everything-you-need-to-know-about-beer-in-one-chart-2015-8 [5] https://xkcd.com/605/ [6] Copyright Maven Wave Partners Giffy: “Machine Times GIF - Find & Share on GIPHY_files”, “Confused Curb Your Enthusiasm GIF - Find & Share on GIPHY_files”, “Stewie Griffin Laughing GIF - Find & Share on GIPHY_files”, “Laptop Goodbye GIF - Find & Share on GIPHY_files”, “Domestic Violence Caveman GIF - Find & Share on GIPHY_files”, “Fail Star Wars GIF - Find & Share on GIPHY_files”

Editor's Notes

  • #2: Daniel’s slide