SlideShare a Scribd company logo
ROADMAP:SIX MONTHS TO MACHINE LEARNING
ZACHARIAH MILLER - 5/20/17
WHO AM I?AND WHY SHOULD
YOU LISTEN TO ME?
This is the STAR Detector at 

Brookhaven National Laboratory.
IF YOU ONLY REMEMBER
ONE THING FROM THIS TALK
JUST BUILD
SOMETHING WITH DATA
AND EXPECT TO SUCK AT IT, FOR A WHILE
My First
Data Project
Unedited.
My First
Data Project
Edited.
But the first one made the
plots in my thesis.
THE
(SOMEWHAT)

UNFORTUNATE 

TRUTH
MATH
MATH & MACHINE LEARNING
▸ Linear Algebra
▸ Calculus
▸ Statistics
▸ Probability
MATH & MACHINE LEARNING
▸ MIT Linear Algebra Open Course
▸ MIT Calculus Open Course
▸ MIT Stats and Probability Course
VS
Python R
VS
BOTH ARE GOOD
PICK ONE AND LEARN
Roadmap: How to Learn Machine Learning in 6 Months
THE MACHINE LEARNING PIPELINE
▸ Find Data (Webscrape, APIs, CSVs)
▸ Clean the data (remove NaNs and Infinities, should that be
a string? Probably not, maybe I can categorize it…)
YOU’LL SPEND A LOT OF TIME IN
THESE SECTIONS. THAT’S
NORMAL.
THE MACHINE LEARNING PIPELINE
▸ Find Data (Webscrape, APIs, CSVs)
▸ Clean the data (remove NaNs and Infinities, should that be
a string? Probably not, maybe I can categorize it…)
▸ Choose and tune your algorithm
▸ Visualize results
GOOGLE AND STACK
OVERFLOW ARE YOUR FRIENDS
GOOGLE AND STACK
OVERFLOW ARE YOUR FRIENDS
THE MACHINE LEARNING PIPELINE
▸ Find Data (Webscrape, APIs, CSVs)
▸ Clean the data (remove NaNs and Infinities, should that be
a string? Probably not, maybe I can categorize it…)
▸ Choose and tune your algorithm
▸ Visualize results
Roadmap: How to Learn Machine Learning in 6 Months
SO… SIX MONTHS?
▸ Learn the math. (2 - 3 months)
▸ Learn the programming language (1 month)
▸ Machine learning tutorials and test projects
(1 - 2 months)
▸ Short term passion projects (1+ month)
DATA SCIENCE AND
MACHINE LEARNING
CLASSES/BOOTCAMPS
SOME LAST NOTES
▸ You’re going to fail. A lot. It’s software…
so who cares.
▸ Your models may not be predictive.
That’s a result. Null is just as good as non-
null if you did it right.
▸ Track your projects on GitHub and write
up your results.
THANKS!LET’S CHAT. I’D LOVE TO TALK ABOUT
PROJECTS YOU’RE CONSIDERING.
PHOTO CREDITS
▸ https://c1.staticflickr.com/4/3310/3313585315_4874a81f77_b.jpg
(STAR Detector)
▸ https://c1.staticflickr.com/6/5531/9638435181_7e3e44c2b8_b.jpg
(Highway)
▸ https://www.mathworks.com/matlabcentral/mlc-downloads/
downloads/submissions/35389/versions/1/screenshot.png (Gradient
Descent)
▸ http://blog.yhat.com/static/img/roc-auc.png (AUC)
▸ http://scikit-learn.org/stable/tutorial/machine_learning_map/ (SkLearn
Map)

More Related Content

What's hot (20)

PPTX
Introduction to machine learning
Ganesh Satpute
 
PPTX
INTRODUCTION TO MACHINE LEARNING.pptx
AbhigyanMishra17
 
PDF
Generative AI - The Future of Creation (Presentation by Aishwarya Ramesh)
Aishwarya Ramesh
 
PPTX
Deep Learning Explained
Melanie Swan
 
PPTX
Machine Learning
Kumar P
 
PPTX
Unsupervised learning clustering
Arshad Farhad
 
PPTX
Graph Neural Networks.pptx
Kumar Iyer
 
PPTX
Data Visualization & Data Storytelling
彭其捷 Jack
 
PPT
Machine Learning presentation.
butest
 
PPTX
Linear Regression and Logistic Regression in ML
Kumud Arora
 
PDF
Deep Learning
Shaikh Shahzad
 
PPTX
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...
Simplilearn
 
PPTX
Random forest algorithm
Rashid Ansari
 
PPTX
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...
Simplilearn
 
PDF
Text classification presentation
Marijn van Zelst
 
PPT
Lect#1 (Artificial Intelligence )
Zeeshan_Jadoon
 
PPTX
A.i. ppt by suvinsh mishra
Suvinsh Mishra
 
PDF
Lecture 1: What is Machine Learning?
Marina Santini
 
PDF
Is Machine learning useful for Fraud Prevention?
Andrea Dal Pozzolo
 
PPTX
Introduction to Machine Learning
Sujith Jayaprakash
 
Introduction to machine learning
Ganesh Satpute
 
INTRODUCTION TO MACHINE LEARNING.pptx
AbhigyanMishra17
 
Generative AI - The Future of Creation (Presentation by Aishwarya Ramesh)
Aishwarya Ramesh
 
Deep Learning Explained
Melanie Swan
 
Machine Learning
Kumar P
 
Unsupervised learning clustering
Arshad Farhad
 
Graph Neural Networks.pptx
Kumar Iyer
 
Data Visualization & Data Storytelling
彭其捷 Jack
 
Machine Learning presentation.
butest
 
Linear Regression and Logistic Regression in ML
Kumud Arora
 
Deep Learning
Shaikh Shahzad
 
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...
Simplilearn
 
Random forest algorithm
Rashid Ansari
 
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...
Simplilearn
 
Text classification presentation
Marijn van Zelst
 
Lect#1 (Artificial Intelligence )
Zeeshan_Jadoon
 
A.i. ppt by suvinsh mishra
Suvinsh Mishra
 
Lecture 1: What is Machine Learning?
Marina Santini
 
Is Machine learning useful for Fraud Prevention?
Andrea Dal Pozzolo
 
Introduction to Machine Learning
Sujith Jayaprakash
 

Similar to Roadmap: How to Learn Machine Learning in 6 Months (20)

PDF
How to start your journey as a data scientist
Parvaneh Shafiei
 
PPTX
Introduction to Machine Learning - An overview and first step for candidate d...
Lucas Jellema
 
PDF
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
Lucas Jellema
 
PDF
Odsc machine-learning-guide-v1
Harsh Khatke
 
PPTX
The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...
Lucas Jellema
 
PPTX
Introduction overviewmachinelearning sig Door Lucas Jellema
Getting value from IoT, Integration and Data Analytics
 
PPTX
Data scientist roadmap
Sonu Kumar
 
PDF
Using machine learning to try and predict taxi availability by Narahari Allam...
PYCON MY PLT
 
PDF
Ml masterclass
Maxwell Rebo
 
PDF
Data Science Accelerator Program
GoDataDriven
 
PPT
Machine learning for complete beginners.ppt
hyliuqd
 
PDF
Data science presentation
MSDEVMTL
 
PDF
A step towards machine learning at accionlabs
Chetan Khatri
 
PPTX
Data Science Roadmap by Swapnil Microsoft
geekism12
 
PDF
100 days of machine learning
Harsha Nath Jha
 
PPTX
The Art of Intelligence – Introduction Machine Learning for Oracle profession...
Lucas Jellema
 
DOCX
Self Study Business Approach to DS_01022022.docx
Shanmugasundaram M
 
PDF
Practical machine learning - Part 1
Traian Rebedea
 
PDF
How to become a Data Scientist?
HackerEarth
 
PPTX
Roadmap of Data Science only for beginner
SplendiousAntonio
 
How to start your journey as a data scientist
Parvaneh Shafiei
 
Introduction to Machine Learning - An overview and first step for candidate d...
Lucas Jellema
 
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
Lucas Jellema
 
Odsc machine-learning-guide-v1
Harsh Khatke
 
The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...
Lucas Jellema
 
Introduction overviewmachinelearning sig Door Lucas Jellema
Getting value from IoT, Integration and Data Analytics
 
Data scientist roadmap
Sonu Kumar
 
Using machine learning to try and predict taxi availability by Narahari Allam...
PYCON MY PLT
 
Ml masterclass
Maxwell Rebo
 
Data Science Accelerator Program
GoDataDriven
 
Machine learning for complete beginners.ppt
hyliuqd
 
Data science presentation
MSDEVMTL
 
A step towards machine learning at accionlabs
Chetan Khatri
 
Data Science Roadmap by Swapnil Microsoft
geekism12
 
100 days of machine learning
Harsha Nath Jha
 
The Art of Intelligence – Introduction Machine Learning for Oracle profession...
Lucas Jellema
 
Self Study Business Approach to DS_01022022.docx
Shanmugasundaram M
 
Practical machine learning - Part 1
Traian Rebedea
 
How to become a Data Scientist?
HackerEarth
 
Roadmap of Data Science only for beginner
SplendiousAntonio
 
Ad

More from IDEAS - Int'l Data Engineering and Science Association (20)

PPTX
How to deliver effective data science projects
IDEAS - Int'l Data Engineering and Science Association
 
PPTX
Digital cracks in banking--Sid Nandi
IDEAS - Int'l Data Engineering and Science Association
 
PDF
“Full Stack” Data Science with R for Startups: Production-ready with Open-Sou...
IDEAS - Int'l Data Engineering and Science Association
 
PPTX
Battling Skynet: The Role of Humanity in Artificial Intelligence
IDEAS - Int'l Data Engineering and Science Association
 
PPTX
Implementing Artificial Intelligence with Big Data
IDEAS - Int'l Data Engineering and Science Association
 
PPSX
Data Architecture (i.e., normalization / relational algebra) and Database Sec...
IDEAS - Int'l Data Engineering and Science Association
 
PDF
Blockchain Application in Real Estate Transactions
IDEAS - Int'l Data Engineering and Science Association
 
PDF
Learning to learn Model Behavior: How to use "human-in-the-loop" to explain d...
IDEAS - Int'l Data Engineering and Science Association
 
PPTX
Practical Machine Learning at Work
IDEAS - Int'l Data Engineering and Science Association
 
PDF
Artificial Intelligence: Hype, Reality, Vision.
IDEAS - Int'l Data Engineering and Science Association
 
PPTX
Operationalizing your Data Lake: Get Ready for Advanced Analytics
IDEAS - Int'l Data Engineering and Science Association
 
PDF
Introduction to Deep Reinforcement Learning
IDEAS - Int'l Data Engineering and Science Association
 
PPTX
Best Practices in Data Partnerships Between Mayor's Office and Academia
IDEAS - Int'l Data Engineering and Science Association
 
PDF
Everything You Wish You Knew About Search
IDEAS - Int'l Data Engineering and Science Association
 
PPTX
AliMe Bot Platform Technical Practice - Alibaba`s Personal Intelligent Assist...
IDEAS - Int'l Data Engineering and Science Association
 
PPTX
Data-Driven AI for Entertainment and Healthcare
IDEAS - Int'l Data Engineering and Science Association
 
PDF
Generating Creative Works with AI
IDEAS - Int'l Data Engineering and Science Association
 
PDF
Using AI to Tackle the Future of Health Care Data
IDEAS - Int'l Data Engineering and Science Association
 
PDF
Hot Dog, Not Hot Dog! Generate new training data without taking more photos.
IDEAS - Int'l Data Engineering and Science Association
 
How to deliver effective data science projects
IDEAS - Int'l Data Engineering and Science Association
 
Digital cracks in banking--Sid Nandi
IDEAS - Int'l Data Engineering and Science Association
 
“Full Stack” Data Science with R for Startups: Production-ready with Open-Sou...
IDEAS - Int'l Data Engineering and Science Association
 
Battling Skynet: The Role of Humanity in Artificial Intelligence
IDEAS - Int'l Data Engineering and Science Association
 
Implementing Artificial Intelligence with Big Data
IDEAS - Int'l Data Engineering and Science Association
 
Data Architecture (i.e., normalization / relational algebra) and Database Sec...
IDEAS - Int'l Data Engineering and Science Association
 
Blockchain Application in Real Estate Transactions
IDEAS - Int'l Data Engineering and Science Association
 
Learning to learn Model Behavior: How to use "human-in-the-loop" to explain d...
IDEAS - Int'l Data Engineering and Science Association
 
Practical Machine Learning at Work
IDEAS - Int'l Data Engineering and Science Association
 
Artificial Intelligence: Hype, Reality, Vision.
IDEAS - Int'l Data Engineering and Science Association
 
Operationalizing your Data Lake: Get Ready for Advanced Analytics
IDEAS - Int'l Data Engineering and Science Association
 
Introduction to Deep Reinforcement Learning
IDEAS - Int'l Data Engineering and Science Association
 
Best Practices in Data Partnerships Between Mayor's Office and Academia
IDEAS - Int'l Data Engineering and Science Association
 
Everything You Wish You Knew About Search
IDEAS - Int'l Data Engineering and Science Association
 
AliMe Bot Platform Technical Practice - Alibaba`s Personal Intelligent Assist...
IDEAS - Int'l Data Engineering and Science Association
 
Data-Driven AI for Entertainment and Healthcare
IDEAS - Int'l Data Engineering and Science Association
 
Using AI to Tackle the Future of Health Care Data
IDEAS - Int'l Data Engineering and Science Association
 
Hot Dog, Not Hot Dog! Generate new training data without taking more photos.
IDEAS - Int'l Data Engineering and Science Association
 
Ad

Recently uploaded (20)

PDF
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
PDF
Product Management in HealthTech (Case Studies from SnappDoctor)
Hamed Shams
 
PPTX
apidays Helsinki & North 2025 - APIs at Scale: Designing for Alignment, Trust...
apidays
 
PPTX
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
PDF
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 
PDF
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
PPTX
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
PPTX
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
PDF
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
PDF
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
PDF
What does good look like - CRAP Brighton 8 July 2025
Jan Kierzyk
 
PDF
Data Chunking Strategies for RAG in 2025.pdf
Tamanna
 
PDF
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays
 
PPT
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
PDF
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
PDF
Context Engineering for AI Agents, approaches, memories.pdf
Tamanna
 
PDF
WEF_Future_of_Global_Fintech_Second_Edition_2025.pdf
AproximacionAlFuturo
 
PPTX
ER_Model_Relationship_in_DBMS_Presentation.pptx
dharaadhvaryu1992
 
PPTX
Advanced_NLP_with_Transformers_PPT_final 50.pptx
Shiwani Gupta
 
PPTX
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
Product Management in HealthTech (Case Studies from SnappDoctor)
Hamed Shams
 
apidays Helsinki & North 2025 - APIs at Scale: Designing for Alignment, Trust...
apidays
 
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
What does good look like - CRAP Brighton 8 July 2025
Jan Kierzyk
 
Data Chunking Strategies for RAG in 2025.pdf
Tamanna
 
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays
 
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
Context Engineering for AI Agents, approaches, memories.pdf
Tamanna
 
WEF_Future_of_Global_Fintech_Second_Edition_2025.pdf
AproximacionAlFuturo
 
ER_Model_Relationship_in_DBMS_Presentation.pptx
dharaadhvaryu1992
 
Advanced_NLP_with_Transformers_PPT_final 50.pptx
Shiwani Gupta
 
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 

Roadmap: How to Learn Machine Learning in 6 Months

  • 1. ROADMAP:SIX MONTHS TO MACHINE LEARNING ZACHARIAH MILLER - 5/20/17
  • 2. WHO AM I?AND WHY SHOULD YOU LISTEN TO ME? This is the STAR Detector at 
 Brookhaven National Laboratory.
  • 3. IF YOU ONLY REMEMBER ONE THING FROM THIS TALK
  • 4. JUST BUILD SOMETHING WITH DATA AND EXPECT TO SUCK AT IT, FOR A WHILE
  • 6. My First Data Project Edited. But the first one made the plots in my thesis.
  • 9. MATH & MACHINE LEARNING ▸ Linear Algebra ▸ Calculus ▸ Statistics ▸ Probability
  • 10. MATH & MACHINE LEARNING ▸ MIT Linear Algebra Open Course ▸ MIT Calculus Open Course ▸ MIT Stats and Probability Course
  • 12. VS BOTH ARE GOOD PICK ONE AND LEARN
  • 14. THE MACHINE LEARNING PIPELINE ▸ Find Data (Webscrape, APIs, CSVs) ▸ Clean the data (remove NaNs and Infinities, should that be a string? Probably not, maybe I can categorize it…) YOU’LL SPEND A LOT OF TIME IN THESE SECTIONS. THAT’S NORMAL.
  • 15. THE MACHINE LEARNING PIPELINE ▸ Find Data (Webscrape, APIs, CSVs) ▸ Clean the data (remove NaNs and Infinities, should that be a string? Probably not, maybe I can categorize it…) ▸ Choose and tune your algorithm ▸ Visualize results GOOGLE AND STACK OVERFLOW ARE YOUR FRIENDS
  • 16. GOOGLE AND STACK OVERFLOW ARE YOUR FRIENDS THE MACHINE LEARNING PIPELINE ▸ Find Data (Webscrape, APIs, CSVs) ▸ Clean the data (remove NaNs and Infinities, should that be a string? Probably not, maybe I can categorize it…) ▸ Choose and tune your algorithm ▸ Visualize results
  • 18. SO… SIX MONTHS? ▸ Learn the math. (2 - 3 months) ▸ Learn the programming language (1 month) ▸ Machine learning tutorials and test projects (1 - 2 months) ▸ Short term passion projects (1+ month)
  • 19. DATA SCIENCE AND MACHINE LEARNING CLASSES/BOOTCAMPS
  • 20. SOME LAST NOTES ▸ You’re going to fail. A lot. It’s software… so who cares. ▸ Your models may not be predictive. That’s a result. Null is just as good as non- null if you did it right. ▸ Track your projects on GitHub and write up your results.
  • 21. THANKS!LET’S CHAT. I’D LOVE TO TALK ABOUT PROJECTS YOU’RE CONSIDERING.
  • 22. PHOTO CREDITS ▸ https://c1.staticflickr.com/4/3310/3313585315_4874a81f77_b.jpg (STAR Detector) ▸ https://c1.staticflickr.com/6/5531/9638435181_7e3e44c2b8_b.jpg (Highway) ▸ https://www.mathworks.com/matlabcentral/mlc-downloads/ downloads/submissions/35389/versions/1/screenshot.png (Gradient Descent) ▸ http://blog.yhat.com/static/img/roc-auc.png (AUC) ▸ http://scikit-learn.org/stable/tutorial/machine_learning_map/ (SkLearn Map)