SlideShare a Scribd company logo
classification edition
Machine Learning in 5 Minutes
Brian Lange
hi, i’m a
data scientist
classification
algorithms
popular examples
-spam filters
-the Sorting Hat
things to know
- you need data labeled with the correct answers to
“train” these algorithms before they work
- feature = dimension = attribute of the data
- class = category = Harry Potter house
linear discriminants
“draw a line through it”
linear discriminants
“draw a line through it”
linear discriminants
“draw a line through it”
linear discriminants
“draw a line through it”
🎉
define what “shitty” means
6 wrong
define what “shitty” means
4 wrong
a map of shittiness
to find the least shitty line
shittiness
slope
intercept
probably don’t use these
linear discriminants:
logistic regression
“divide it with a log function”
logistic regression
“divide it with a log function”
🎉🎉🎉🎉🎉🎉🎉🎉🎉🎉🎉
+ gives you probabilities
+ the model is a formula
+ can “threshold” to make model more or less
conservative
💩💩💩💩💩💩💩💩💩💩💩
- only works with linear decision boundaries
SVMs (support vector machines)
“*advanced* draw a line through it”
- better definition of “shitty”
- lines can turn into non-linear
shapes if you transform your
data
Machine Learning in 5 Minutes— Classification
Machine Learning in 5 Minutes— Classification
💩
Machine Learning in 5 Minutes— Classification
💩
“the kernel trick”
Machine Learning in 5 Minutes— Classification
🎉
woooooooooooo
🎉🎉
SVMs (support vector machines)
“*advanced* draw a line through it”
SVMs (support vector machines)
“*advanced* draw a line through it”
🎉🎉🎉🎉🎉🎉🎉🎉🎉🎉🎉
works well on a lot of different shapes of data
thanks to the kernel trick
💩💩💩💩💩💩💩💩💩💩💩
not super easy to explain to people
can only kinda do probabilities
KNN (k-nearest neighbors)
“what do similar cases look like?”
KNN (k-nearest neighbors)
“what do similar cases look like?”
k=1
KNN (k-nearest neighbors)
“what do similar cases look like?”
k=2
KNN (k-nearest neighbors)
“what do similar cases look like?”
k=1
KNN (k-nearest neighbors)
“what do similar cases look like?”
k=2
KNN (k-nearest neighbors)
“what do similar cases look like?”
k=3
KNN (k-nearest neighbors)
“what do similar cases look like?”
KNN (k-nearest neighbors)
“what do similar cases look like?”
🎉🎉🎉🎉🎉🎉🎉🎉🎉🎉🎉
+ no training, adding new data is easy
+ you get to define “distance” 

💩💩💩💩💩💩💩💩💩💩💩
- can be outlier-sensitive
- you have to define “distance”
decision tree learners
make a flow chart of it
decision tree learners
make a flow chart of it
x < 3?
yes no
3
decision tree learners
make a flow chart of it
x < 3?
yes no
y < 4?
yes no
3
4
decision tree learners
make a flow chart of it
x < 3?
yes no
y < 4?
yes no
x < 5?
yes no
3 5
4
decision tree learners
make a flow chart of it
🎉🎉🎉🎉🎉🎉🎉🎉🎉🎉🎉
+ fit all kinds of arbitrary shapes
+ output is a clear set of
conditionals

💩💩💩💩💩💩💩💩💩💩💩
- extremely prone to overfitting
- have to rebuild when you get new
data
- no probability estimates
ensemble models
make a bunch of models and combine them
ensemble models
make a bunch of models and combine them
🎉🎉🎉🎉🎉🎉🎉🎉🎉🎉🎉
- don’t overfit as much as their component parts
- Generally don’t require much parameter tweaking
- If data doesn’t change very often, you can make
them semi-online by just adding new trees
- Can provide probabilities
💩💩💩💩💩💩💩💩💩💩💩
- Slower than their component parts (though if
those are fast, it doesn’t matter)

More Related Content

What's hot (20)

PPT
Object Oriented Analysis and Design
Haitham El-Ghareeb
 
PDF
Expert systems Artificial Intelligence
itti rehan
 
PDF
Naive Bayes
CloudxLab
 
PPTX
Data mining primitives
lavanya marichamy
 
PDF
Linear Regression Algorithm | Linear Regression in Python | Machine Learning ...
Edureka!
 
PDF
Vc dimension in Machine Learning
VARUN KUMAR
 
PPTX
Software Requirement Specification
Niraj Kumar
 
PPTX
Android Architecture.pptx
priya Nithya
 
PDF
Rough K Means - Numerical Example
Dr.E.N.Sathishkumar
 
PPTX
software project management Artifact set(spm)
REHMAT ULLAH
 
PPTX
Data Mining: Outlier analysis
DataminingTools Inc
 
PPTX
K-Nearest Neighbor(KNN)
Abdullah al Mamun
 
PPT
Machine Learning presentation.
butest
 
PPTX
1.1. the central concepts of automata theory
Sampath Kumar S
 
PDF
Android chapter02-setup2-emulator
guru472
 
PPTX
Ooad unit – 1 introduction
Babeetha Muruganantham
 
PPTX
Machine learning and types
Padma Metta
 
PPTX
Classification in data mining
Sulman Ahmed
 
Object Oriented Analysis and Design
Haitham El-Ghareeb
 
Expert systems Artificial Intelligence
itti rehan
 
Naive Bayes
CloudxLab
 
Data mining primitives
lavanya marichamy
 
Linear Regression Algorithm | Linear Regression in Python | Machine Learning ...
Edureka!
 
Vc dimension in Machine Learning
VARUN KUMAR
 
Software Requirement Specification
Niraj Kumar
 
Android Architecture.pptx
priya Nithya
 
Rough K Means - Numerical Example
Dr.E.N.Sathishkumar
 
software project management Artifact set(spm)
REHMAT ULLAH
 
Data Mining: Outlier analysis
DataminingTools Inc
 
K-Nearest Neighbor(KNN)
Abdullah al Mamun
 
Machine Learning presentation.
butest
 
1.1. the central concepts of automata theory
Sampath Kumar S
 
Android chapter02-setup2-emulator
guru472
 
Ooad unit – 1 introduction
Babeetha Muruganantham
 
Machine learning and types
Padma Metta
 
Classification in data mining
Sulman Ahmed
 

Similar to Machine Learning in 5 Minutes— Classification (17)

PDF
It's Not Magic - Explaining classification algorithms
Brian Lange
 
PPT
Mathematical reasoning
Safe Passages AmeriCorps
 
PDF
A Scala Corrections Library
Paul Phillips
 
PDF
Barga Data Science lecture 7
Roger Barga
 
PPTX
Machine Learning course Lecture number 1.pptx
hamedj21
 
PPTX
Understanding Basics of Machine Learning
Pranav Ainavolu
 
PDF
Efficient Simplification: The (im)possibilities
Neeldhara Misra
 
PDF
Geneticalgorithms 100403002207-phpapp02
Amna Saeed
 
KEY
Declarative Syntax Definition - Pretty Printing
Guido Wachsmuth
 
PDF
DFS-model Graph Modeling (CES 417) Lecture 6
DanialKhawaja4
 
PPTX
07-Classification.pptx
Shree Shree
 
PPTX
Lecture 9 -Clustering(ML algorithms: Clustering, KNN, DBScan).pptx
ShishirOyshi20143346
 
PPTX
An Introduction To Python - Variables, Math
Blue Elephant Consulting
 
PDF
DutchMLSchool. Logistic Regression, Deepnets, Time Series
BigML, Inc
 
PDF
Nearest Neighbor And Decision Tree - NN DT
julianaantunes58
 
PDF
What's "For Free" on Craigslist?
Josh Mayer
 
PDF
Multiplication -- More Than Repeated Addition and Times Tables
Chris Hunter
 
It's Not Magic - Explaining classification algorithms
Brian Lange
 
Mathematical reasoning
Safe Passages AmeriCorps
 
A Scala Corrections Library
Paul Phillips
 
Barga Data Science lecture 7
Roger Barga
 
Machine Learning course Lecture number 1.pptx
hamedj21
 
Understanding Basics of Machine Learning
Pranav Ainavolu
 
Efficient Simplification: The (im)possibilities
Neeldhara Misra
 
Geneticalgorithms 100403002207-phpapp02
Amna Saeed
 
Declarative Syntax Definition - Pretty Printing
Guido Wachsmuth
 
DFS-model Graph Modeling (CES 417) Lecture 6
DanialKhawaja4
 
07-Classification.pptx
Shree Shree
 
Lecture 9 -Clustering(ML algorithms: Clustering, KNN, DBScan).pptx
ShishirOyshi20143346
 
An Introduction To Python - Variables, Math
Blue Elephant Consulting
 
DutchMLSchool. Logistic Regression, Deepnets, Time Series
BigML, Inc
 
Nearest Neighbor And Decision Tree - NN DT
julianaantunes58
 
What's "For Free" on Craigslist?
Josh Mayer
 
Multiplication -- More Than Repeated Addition and Times Tables
Chris Hunter
 
Ad

Recently uploaded (20)

PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
PDF
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
PDF
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
PDF
Blockchain Transactions Explained For Everyone
CIFDAQ
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PDF
The Builder’s Playbook - 2025 State of AI Report.pdf
jeroen339954
 
PPTX
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
PDF
Timothy Rottach - Ramp up on AI Use Cases, from Vector Search to AI Agents wi...
AWS Chicago
 
PDF
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
PDF
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
Python basic programing language for automation
DanialHabibi2
 
PDF
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
PDF
From Code to Challenge: Crafting Skill-Based Games That Engage and Reward
aiyshauae
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PDF
July Patch Tuesday
Ivanti
 
PDF
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
PDF
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
PDF
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
Blockchain Transactions Explained For Everyone
CIFDAQ
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
The Builder’s Playbook - 2025 State of AI Report.pdf
jeroen339954
 
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
Timothy Rottach - Ramp up on AI Use Cases, from Vector Search to AI Agents wi...
AWS Chicago
 
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Python basic programing language for automation
DanialHabibi2
 
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
From Code to Challenge: Crafting Skill-Based Games That Engage and Reward
aiyshauae
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
July Patch Tuesday
Ivanti
 
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
Ad

Machine Learning in 5 Minutes— Classification