SlideShare a Scribd company logo
Machine Learning
Nicolas P. Rougier
July 2, 2018
Inria – Institute of Neurodegenerative Diseases – Bordeaux
What to learn?
Considering an input x (that can be an image, a text, a vector, a
sequence, a set of features, a scalar, etc.)
Classification
To find the category Cx of some input x
Regression
To find the value f(x) of some input x
Clustering
To divide all inputs {x} into distinct groups
Optimization
To find a sequence {ai} such as to optimize
∑
f(ai)
1
Classification
We know that (x1, y1) is blue, (x2, y2) is blue, (x3, y3) is red., etc.
What is the color of (x, y)?
2
Regression
We know that f(x1) = y1, f(x2) = y2, f(x3) = y3, etc.
x
What is the value of f(x)?
3
Clustering
We have (x1, y1), (x2, y2), (x3, y3), etc.
What group (x, y) belongs to?
4
Optimization
You chose C and got no reward, you chose F and got no reward, you
chose N and got no reward, etc.
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
What do you choose next?
5
How to learn?
Supervised learning
To learn a function that maps an input to an output based on
example input-output pairs
Reinforcement learning
To find a policy that maps states to action such as to maximize
long-term reward
Unsupervised learning
To infer a function that describes the structure of some (unlabeled)
data
6
Machine learning ̸= Human learning
To learn anything, at any time, using any method and a few samples.
Explicit & implicit learning, lifelong learning, imitation learning,
one-shot learning, declarative and procedural memory, short and
long term memory, working memory, forgetting, motivation, emotion,
etc.
7
The machine-learning pipeline
Data (60% of your time)
Analysis, acquisition, cleaning,
preparation, augmentation, balance
Model (30% of your time)
Hyperparameters, pre-training, training,
tweaking, testing
Deployment (10% of your time)
Optimization, dockerization, evaluation
8
Common pitfalls
Data
Small, unbalanced, biased, leakage, variance, etc
Model
Initialization, local minima, curse of dimensionality, etc
Deployment
Scalability, fragility, ethics, etc
9
How big is “big”?
Deep convolutional network (Hinton, 2010)
• 5 convolutional layers
• 650,000 units
• 60,000,000 parameters
• 1,200,000 train images
• 100,000 test images
• 1,000 classes
Latest OpenAI DOTA player (June 2018): 180 years worth of games, 256
GPUs and 128,000 CPU cores.
10
Do I need deep learning?
If all you have is a hammer, everything looks like a nail.
11
Conclusion
Lot of challenges ahead
• Learn with a few samples
• Lifelong learning
• Transfer learning
• Unbiased learning
• Robust learning
• Unsupervised learning
• Reinforcement learning
Lot of opportunities as well...
12

More Related Content

PDF
LR1. Summary Day 1
Machine Learning Valencia
 
PPTX
Interpreting machine learning models
andosa
 
PDF
L3. Decision Trees
Machine Learning Valencia
 
PDF
A Random Forest Approach To Skin Detection With R
Auro Tripathy
 
PDF
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
Sebastian Raschka
 
PDF
L13. Cluster Analysis
Machine Learning Valencia
 
PPT
Active learning lecture
azuring
 
PDF
Building Random Forest at Scale
Sri Ambati
 
LR1. Summary Day 1
Machine Learning Valencia
 
Interpreting machine learning models
andosa
 
L3. Decision Trees
Machine Learning Valencia
 
A Random Forest Approach To Skin Detection With R
Auro Tripathy
 
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
Sebastian Raschka
 
L13. Cluster Analysis
Machine Learning Valencia
 
Active learning lecture
azuring
 
Building Random Forest at Scale
Sri Ambati
 

What's hot (20)

PDF
Jan vitek distributedrandomforest_5-2-2013
Sri Ambati
 
PDF
CudaTree (GTC 2014)
Alex Rubinsteyn
 
PPTX
Session 06 machine learning.pptx
bodaceacat
 
PDF
Random forest using apache mahout
Gaurav Kasliwal
 
PPTX
Machine Learning - Dummy Variable Conversion
Andrew Ferlitsch
 
PDF
Introduction to Data Analytics with R
Wei Zhong Toh
 
PDF
Gradient Boosted Regression Trees in scikit-learn
DataRobot
 
PPTX
Decision trees
Jagjit Wilku
 
PDF
Overview of tree algorithms from decision tree to xgboost
Takami Sato
 
PPTX
Improvement of id3 algorithm based on simplified information entropy and coor...
MdAhasanulAlam
 
ODP
Machine Learning with Decision trees
Knoldus Inc.
 
PDF
CVPR2015 reading "Global refinement of random forest"
Akisato Kimura
 
PDF
[系列活動] Machine Learning 機器學習課程
台灣資料科學年會
 
PPTX
Comparison Study of Decision Tree Ensembles for Regression
Seonho Park
 
PPT
Chapter II.6 (Book Part VI) Learning
butest
 
DOCX
Dystopian infogramchecklist
megbegnaud
 
PPTX
Machine Learning
Girish Khanzode
 
PDF
Introduction to Some Tree based Learning Method
Honglin Yu
 
PDF
Ml intro
Si Krishan
 
PPTX
Self taught clustering
SOYEON KIM
 
Jan vitek distributedrandomforest_5-2-2013
Sri Ambati
 
CudaTree (GTC 2014)
Alex Rubinsteyn
 
Session 06 machine learning.pptx
bodaceacat
 
Random forest using apache mahout
Gaurav Kasliwal
 
Machine Learning - Dummy Variable Conversion
Andrew Ferlitsch
 
Introduction to Data Analytics with R
Wei Zhong Toh
 
Gradient Boosted Regression Trees in scikit-learn
DataRobot
 
Decision trees
Jagjit Wilku
 
Overview of tree algorithms from decision tree to xgboost
Takami Sato
 
Improvement of id3 algorithm based on simplified information entropy and coor...
MdAhasanulAlam
 
Machine Learning with Decision trees
Knoldus Inc.
 
CVPR2015 reading "Global refinement of random forest"
Akisato Kimura
 
[系列活動] Machine Learning 機器學習課程
台灣資料科學年會
 
Comparison Study of Decision Tree Ensembles for Regression
Seonho Park
 
Chapter II.6 (Book Part VI) Learning
butest
 
Dystopian infogramchecklist
megbegnaud
 
Machine Learning
Girish Khanzode
 
Introduction to Some Tree based Learning Method
Honglin Yu
 
Ml intro
Si Krishan
 
Self taught clustering
SOYEON KIM
 
Ad

Similar to Machine learning (20)

PDF
know Machine Learning Basic Concepts.pdf
hemangppatel
 
PDF
Presentation-19.08.2024hvug7gugyvuvugugugugugug
amanna7980
 
PDF
Machine Learning ebook.pdf
HODIT12
 
PDF
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
MostafaHazemMostafaa
 
PPT
Useful Techniques in Artificial Intelligence
Ila Group
 
PDF
Machine Learning : why we should know and how it works
Kevin Lee
 
PDF
When Classifier Selection meets Information Theory: A Unifying View
Mohamed Farouk
 
PDF
Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...
PyData
 
PPTX
Introduction to Machine Learning
AI Summary
 
PDF
Machine learning for_finance
Stefan Duprey
 
PDF
IndabaX Ghana Poster.pdf
kwadwoAmedi
 
PDF
LR2. Summary Day 2
Machine Learning Valencia
 
PPTX
Machine learning ppt unit one syllabuspptx
VenkateswaraBabuRavi
 
PPT
Machine Learning: Decision Trees Chapter 18.1-18.3
butest
 
PPT
Machine Learning: Foundations Course Number 0368403401
butest
 
PPTX
Classification: MNIST, training a Binary classifier, performance measure, mul...
BMS Institute of Technology and Management
 
PPT
Introduction to Machine Learning.
butest
 
PDF
Deep Learning: concepts and use cases (October 2018)
Julien SIMON
 
PPT
Introduction
butest
 
PDF
Chris Dyer - 2017 - Neural MT Workshop Invited Talk: The Neural Noisy Channel...
Association for Computational Linguistics
 
know Machine Learning Basic Concepts.pdf
hemangppatel
 
Presentation-19.08.2024hvug7gugyvuvugugugugugug
amanna7980
 
Machine Learning ebook.pdf
HODIT12
 
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
MostafaHazemMostafaa
 
Useful Techniques in Artificial Intelligence
Ila Group
 
Machine Learning : why we should know and how it works
Kevin Lee
 
When Classifier Selection meets Information Theory: A Unifying View
Mohamed Farouk
 
Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...
PyData
 
Introduction to Machine Learning
AI Summary
 
Machine learning for_finance
Stefan Duprey
 
IndabaX Ghana Poster.pdf
kwadwoAmedi
 
LR2. Summary Day 2
Machine Learning Valencia
 
Machine learning ppt unit one syllabuspptx
VenkateswaraBabuRavi
 
Machine Learning: Decision Trees Chapter 18.1-18.3
butest
 
Machine Learning: Foundations Course Number 0368403401
butest
 
Classification: MNIST, training a Binary classifier, performance measure, mul...
BMS Institute of Technology and Management
 
Introduction to Machine Learning.
butest
 
Deep Learning: concepts and use cases (October 2018)
Julien SIMON
 
Introduction
butest
 
Chris Dyer - 2017 - Neural MT Workshop Invited Talk: The Neural Noisy Channel...
Association for Computational Linguistics
 
Ad

More from Nicolas Rougier (11)

PDF
The Art of Braincrafting
Nicolas Rougier
 
PDF
Neural fields, a cognitive approach
Nicolas Rougier
 
PDF
SIGGRAPH 2018 - Digital typography
Nicolas Rougier
 
PDF
Modern OpenGL scientific visualization
Nicolas Rougier
 
PDF
Open science
Nicolas Rougier
 
PDF
On ne voit que ce que l’on regarde (French)
Nicolas Rougier
 
PDF
Cybercriminalité (French)
Nicolas Rougier
 
PDF
Where is my mind?
Nicolas Rougier
 
PDF
Scientific visualization
Nicolas Rougier
 
PDF
ReScience
Nicolas Rougier
 
PDF
One critic, two actors, one decision
Nicolas Rougier
 
The Art of Braincrafting
Nicolas Rougier
 
Neural fields, a cognitive approach
Nicolas Rougier
 
SIGGRAPH 2018 - Digital typography
Nicolas Rougier
 
Modern OpenGL scientific visualization
Nicolas Rougier
 
Open science
Nicolas Rougier
 
On ne voit que ce que l’on regarde (French)
Nicolas Rougier
 
Cybercriminalité (French)
Nicolas Rougier
 
Where is my mind?
Nicolas Rougier
 
Scientific visualization
Nicolas Rougier
 
ReScience
Nicolas Rougier
 
One critic, two actors, one decision
Nicolas Rougier
 

Recently uploaded (20)

PDF
A Complete Guide to Data Migration Services for Modern Businesses
Aurnex
 
PPTX
Pakistan’s Leading Manpower Export Agencies for Qatar
Glassrooms Dubai
 
PDF
Alan Stalcup - Principal Of GVA Real Estate Investments
Alan Stalcup
 
PDF
Using Innovative Solar Manufacturing to Drive India's Renewable Energy Revolu...
Insolation Energy
 
PPTX
Brain Based Enterprises - Harmonising Man, Woman and Machine
Peter Cook
 
PDF
Retinal Disorder Treatment Market 2030: The Impact of Advanced Diagnostics an...
Kumar Satyam
 
PDF
Infrastructure and geopolitics.AM.ENG.docx.pdf
Andrea Mennillo
 
PDF
GenAI for Risk Management: Refresher for the Boards and Executives
Alexei Sidorenko, CRMP
 
PPTX
Memorandum and articles of association explained.pptx
Keerthana Chinnathambi
 
PPTX
E-Way Bill under GST – Transport & Logistics.pptx
Keerthana Chinnathambi
 
PDF
High Capacity Core IC Pneumatic Spec-Sheet
Forklift Trucks in Minnesota
 
PDF
India Cold Chain Storage And Logistics Market: From Farm Gate to Consumer – T...
Kumar Satyam
 
PDF
NewBase 26 July 2025 Energy News issue - 1806 by Khaled Al Awadi_compressed.pdf
Khaled Al Awadi
 
PDF
Top 10 Corporates in India Investing in Sustainable Energy.pdf
Essar Group
 
PPTX
Final PPT on DAJGUA, EV Charging, Meter Devoloution, CGRF, Annual Accounts & ...
directord
 
PDF
Gregory Felber - An Accomplished Underwater Marine Biologist
Gregory Felber
 
PPTX
E-commerce and its impact on business.
pandeyranjan5483
 
PPTX
Appreciations - July 25.pptxffsdjjjjjjjjjjjj
anushavnayak
 
PPTX
Virbyze_Our company profile_Preview.pptx
myckwabs
 
PPTX
Chapter 3 Distributive Negotiation: Claiming Value
badranomar1990
 
A Complete Guide to Data Migration Services for Modern Businesses
Aurnex
 
Pakistan’s Leading Manpower Export Agencies for Qatar
Glassrooms Dubai
 
Alan Stalcup - Principal Of GVA Real Estate Investments
Alan Stalcup
 
Using Innovative Solar Manufacturing to Drive India's Renewable Energy Revolu...
Insolation Energy
 
Brain Based Enterprises - Harmonising Man, Woman and Machine
Peter Cook
 
Retinal Disorder Treatment Market 2030: The Impact of Advanced Diagnostics an...
Kumar Satyam
 
Infrastructure and geopolitics.AM.ENG.docx.pdf
Andrea Mennillo
 
GenAI for Risk Management: Refresher for the Boards and Executives
Alexei Sidorenko, CRMP
 
Memorandum and articles of association explained.pptx
Keerthana Chinnathambi
 
E-Way Bill under GST – Transport & Logistics.pptx
Keerthana Chinnathambi
 
High Capacity Core IC Pneumatic Spec-Sheet
Forklift Trucks in Minnesota
 
India Cold Chain Storage And Logistics Market: From Farm Gate to Consumer – T...
Kumar Satyam
 
NewBase 26 July 2025 Energy News issue - 1806 by Khaled Al Awadi_compressed.pdf
Khaled Al Awadi
 
Top 10 Corporates in India Investing in Sustainable Energy.pdf
Essar Group
 
Final PPT on DAJGUA, EV Charging, Meter Devoloution, CGRF, Annual Accounts & ...
directord
 
Gregory Felber - An Accomplished Underwater Marine Biologist
Gregory Felber
 
E-commerce and its impact on business.
pandeyranjan5483
 
Appreciations - July 25.pptxffsdjjjjjjjjjjjj
anushavnayak
 
Virbyze_Our company profile_Preview.pptx
myckwabs
 
Chapter 3 Distributive Negotiation: Claiming Value
badranomar1990
 

Machine learning

  • 1. Machine Learning Nicolas P. Rougier July 2, 2018 Inria – Institute of Neurodegenerative Diseases – Bordeaux
  • 2. What to learn? Considering an input x (that can be an image, a text, a vector, a sequence, a set of features, a scalar, etc.) Classification To find the category Cx of some input x Regression To find the value f(x) of some input x Clustering To divide all inputs {x} into distinct groups Optimization To find a sequence {ai} such as to optimize ∑ f(ai) 1
  • 3. Classification We know that (x1, y1) is blue, (x2, y2) is blue, (x3, y3) is red., etc. What is the color of (x, y)? 2
  • 4. Regression We know that f(x1) = y1, f(x2) = y2, f(x3) = y3, etc. x What is the value of f(x)? 3
  • 5. Clustering We have (x1, y1), (x2, y2), (x3, y3), etc. What group (x, y) belongs to? 4
  • 6. Optimization You chose C and got no reward, you chose F and got no reward, you chose N and got no reward, etc. A B C D E F G H I J K L M N O P What do you choose next? 5
  • 7. How to learn? Supervised learning To learn a function that maps an input to an output based on example input-output pairs Reinforcement learning To find a policy that maps states to action such as to maximize long-term reward Unsupervised learning To infer a function that describes the structure of some (unlabeled) data 6
  • 8. Machine learning ̸= Human learning To learn anything, at any time, using any method and a few samples. Explicit & implicit learning, lifelong learning, imitation learning, one-shot learning, declarative and procedural memory, short and long term memory, working memory, forgetting, motivation, emotion, etc. 7
  • 9. The machine-learning pipeline Data (60% of your time) Analysis, acquisition, cleaning, preparation, augmentation, balance Model (30% of your time) Hyperparameters, pre-training, training, tweaking, testing Deployment (10% of your time) Optimization, dockerization, evaluation 8
  • 10. Common pitfalls Data Small, unbalanced, biased, leakage, variance, etc Model Initialization, local minima, curse of dimensionality, etc Deployment Scalability, fragility, ethics, etc 9
  • 11. How big is “big”? Deep convolutional network (Hinton, 2010) • 5 convolutional layers • 650,000 units • 60,000,000 parameters • 1,200,000 train images • 100,000 test images • 1,000 classes Latest OpenAI DOTA player (June 2018): 180 years worth of games, 256 GPUs and 128,000 CPU cores. 10
  • 12. Do I need deep learning? If all you have is a hammer, everything looks like a nail. 11
  • 13. Conclusion Lot of challenges ahead • Learn with a few samples • Lifelong learning • Transfer learning • Unbiased learning • Robust learning • Unsupervised learning • Reinforcement learning Lot of opportunities as well... 12