SlideShare a Scribd company logo
Train, explain, acclaim
Build a good model in three steps
AI & NLP workshop day
Przemysław Biecek
Samsung SRPOL
Warsaw University of Technology
Train
with AutoML
Train, explain, acclaim.  Build a good model in three steps
Doing your n+1 dataset …, should you take into account what
you have learnt on previous datasets?
Train, explain, acclaim.  Build a good model in three steps
Defaults (package
defaults (Def.P) and
optimal defaults
(Def.O)),
tunability of the
hyperparameters
with the package
defaults (Tun.P) and
our optimal defaults
(Tun.O) as reference
and tuning space
quantiles (q0.05 and
q0.95) for different
parameters of the
algorithms
https://autodl.chalearn.org/
Sequential Model-based
Algorithm Configuration
The open-source solution of AAD
Freiburg uses a heterogeneous
ensemble of learning machines
(AutoSklearn (Feurer et al.,
2015a,c)) combining the machine
learning library scikit-learn
(Pedregosa et al., 2011) with the
state-of-the-art SMBO method
SMAC to find suitable machine
learning pipelines for a data set at
hand. This is essentially a
reimplementation of Auto-WEKA.
To speed up the optimization
process they employed a
metalearning technique (Feurer et
al., 2015b) which starts SMAC
from promising configurations of
scikit-learn. Furthermore, they
used the outputs of all models and
combined these into an ensemble
using ensemble selection.
This work has been supported in part by the Defense Advanced Research Projects Agency
(DARPA) Data-Driven Discovery of Models (D3M) Program.
https://github.com/automl/HpBandSter
http://www.jmlr.org/papers/volume13/bergstra12a/bergstra12a.pdf
https://towardsdatascience.com/shallow-understanding-on-bayesian-optimization-324b6c1f7083
Train, explain, acclaim.  Build a good model in three steps
This work has been supported in part by the Defense Advanced Research Projects Agency
(DARPA) Data-Driven Discovery of Models (D3M) Program.
https://github.com/hibayesian/awesome-automl-papers
https://www.ml4aad.org/automl/literature-on-neural-architecture-search/
Auto-Keras

auto-sklearn

automl-gs

Auto-Weka

FeatureTools

h2o automl

Ludwig

mljar-supervised

Neural Network
Intelligence (NNI)

tpot

TransmografAI

Auto_ml
https://www.linkedin.com/in/igor2k/
Explain
with Interpretable Machine Learning
Train, explain, acclaim.  Build a good model in three steps
https://www.wired.com/2015/10/can-learn-epic-failure-google-flu-trends/
• “You don’t see a lot of skepticism,” she says. “The algorithms are like shiny new
toys that we can’t resist using. We trust them so much that we project meaning on to
them.”
• Ultimately algorithms, according to O’Neil, reinforce discrimination and widen
inequality, “using people’s fear and trust of mathematics to prevent them from
asking questions”.
https://www.theguardian.com/books/2016/oct/27/cathy-oneil-weapons-of-math-
destruction-algorithms-big-data !22
Cathy O'Neil:
The era of blind faith
in big data must end
black boxes
Why do we need explanations for complex models?
Right to explanation
!23
Why do we need explanations for complex models?
https://panoptykon.org/wiadomosc/prawo-do-wyjasnienia-decyzji-kredytowej-dla-
kazdego-sukces-panoptykonu
Train, explain, acclaim.  Build a good model in three steps
Train, explain, acclaim.  Build a good model in three steps
Train, explain, acclaim.  Build a good model in three steps
Train, explain, acclaim.  Build a good model in three steps
!29
Local Model approximations
"Why Should I Trust You?" Explaining the Predictions of Any Classifier.
Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin (2016). https://arxiv.org/pdf/1602.04938.pdf
Port to R: Thomas Lin Pedersen (2017) https://github.com/thomasp85/lime
Other implementations: live (Staniak, Biecek 2018) and iml (Molnar 2018)
A different approach to model explanation is to locally approximate
the complex black-box model with an easier to interpret white-box
model constructed on interpretable features.
!30
Local Model approximations
1.Generate a fake dataset around x.
2.Use black-box estimator to get target values y.
3.Train a new white-box estimator for (y,x).
4.Check prediction quality of a white-box classifier.
5.Use white-box estimator as an explanation of black-box model.
Properties:
model-agnostic
interpretable representation
local fidelity
Train, explain, acclaim.  Build a good model in three steps
LIME / live
vs

Break Down
Model debugging
Biecek P (2018). “DALEX: Explainers for Complex Predictive Models in R.”
Journal of Machine Learning Research, 19(84), 1-5. URL:http://jmlr.org/papers/v19/18-416.html>
What would you explain?
https://kmichael08.github.io
What would you explain?
https://kmichael08.github.io
What If?
Why?
Train, explain, acclaim.  Build a good model in three steps
https://www.encyclopedia-titanica.org/
What are the odds of surviving?
What If?
Ceteris Paribus
Individual Conditional Expectations
Ceteris Paribus
Individual Conditional Expectations
Champion - Challenger
Champion - Challenger
Train, explain, acclaim.  Build a good model in three steps
Train, explain, acclaim.  Build a good model in three steps
Why?
iBreakDown: Uncertainty of Model Explanations for Non-additive Predictive Models
Alicja Gosiewska, Przemyslaw Biecek (2019) https://arxiv.org/abs/1903.11420v1
Train, explain, acclaim.  Build a good model in three steps
SHAP (SHapley Additive exPlanations) Lundberg (2017)
IME complexity is O(2
p
). Shapley values are known for some
time and we have methods to approximate them efficiently.
Order does matter
https://github.com/MI2DataLab/modelDown
https://chudekm.shinyapps.io/model_explorer_example/
https://breakdeeper.netlify.com/
What If? Interactive with D3
What If? Comparison of models between languages
Acclaim
With Human Centered AI
https://www.massdevice.com/report-ibm-watson-delivered-unsafe-and-inaccurate-
cancer-recommendations/
Train, explain, acclaim.  Build a good model in three steps
https://www.propublica.org/article/how-we-analyzed-the-compas-recidivism-algorithm
Train, explain, acclaim.  Build a good model in three steps
Train, explain, acclaim.  Build a good model in three steps
On Explainable Machine Learning Misconceptions and A More Human-Centered
Machine Learning; Patrick Hall
Predictive Models: Visual Exploration, Explanation and Debugging
Production
Development
Concept ValidateForge
Model debugging
Model development is an
iterative process. Each
iteration brings new insights.
Early phases:
Crisp modeling, general
understanding of the problem.
Medium phases:
Selective modeling, here we
select the best type of model.
Late phases:
Fine tuning of model
parameters or variable
engineering .
In each iteration model development starts with some concepts, ideas, then the model is trained and finally
model needs to be validated.
Predictions need to be
explained.
Here the instance level
explanation helps.
With time the model
performance may deteriorate,
thus it requires constant
monitoring, e.g. with the drifter
package.
Drop-out loss
Variable importance
GBM
baseline
ditrict
surface
floor
construction.year
no. rooms
full model
250 500 750 1000 1250
3000
3200
3400
surface
prediction
3600
3800
Surface
6019 100 148
Random Forest
feature influence
Variables attributions
GBM
intercept
district: Srodmiescie
surface: 22
no.rooms: 2
construction.year: 2005
floor: 1
prediction
Random Forest
intercept
district: Srodmiescie
surface: 22
no.rooms: 2
construction.year: 2005
floor: 1
prediction
LM
intercept
district: Srodmiescie
surface: 22
no.rooms: 2
construction.year: 2005
floor: 1
prediction
2000 2100 2200 2300 2400 2500 2600 2700 2800 2900
2046
2614.9
+358
+160
+78
-39.5
+12.4
2800
2425
-338
-112
+74
-53
+26
2378
2324.9
-239
+160
+68
+39.5
-12.4
feature influence
Variables attributions
GBM
intercept
district: Srodmiescie
surface: 22
no.rooms: 2
construction.year: 2005
floor: 1
prediction
2000 2100 2200 2300 2400 2500 2600 2700
2046
2614.9
+358
+160
+78
-39.5
+12.4
3000
3200
3400
surface
prediction
3600
3800
Surface
6019 100 148
Random Forest
3000
3200
3400
2 4 6
prediction
3600
3800
3000
3200
3400
3600
3800
Surface
1920 1940 1960 1980 2010
no.roomsfloor
surfaceconstruction.year
4020 80 120 144
2.50.8 1.15.0 7.5 11.2
Variable selection
Feature engineering
Random Forest
Factor Merger
Srodmiescie
Ochota
Mokotow
Zoliborz
Ursus
Bielany
Bemowo
Wola
Ursynow
Praga
0 2000 4000
group frequencyname price mean
5109.19
3954.83
3946.96
3918.55
3058.52
3045.79
3028.58
3011.69
3009.72
2991.48
nr
1
2
2
3
4
4
5
6
6
7
Prediction explanations What-If analysis Concept drift detection
explain
Train, explain, acclaim.  Build a good model in three steps
Like it? Let us know! Find a bug? Fire an issue!

More Related Content

What's hot (20)

PDF
Interpretable machine learning : Methods for understanding complex models
Manojit Nandi
 
PPTX
Practical Tips for Interpreting Machine Learning Models - Patrick Hall, H2O.ai
Sri Ambati
 
DOCX
dod abstract
Barbara Jean Neal
 
PDF
Interpretable Machine Learning
inovex GmbH
 
PDF
Keepler | Understanding your own predictive models
Keepler Data Tech
 
PDF
Keepler Data Tech | Entendiendo tus propios modelos predictivos
Keepler Data Tech
 
PDF
Philip Rathle- Graph Boosted Artificial Intelligence
Neo4j
 
PDF
The Incredible Disappearing Data Scientist
Rebecca Bilbro
 
PPTX
Calin Constantinov - Neo4j - Bucharest Big Data Week Meetup - Bucharest 2018
Calin Constantinov
 
PDF
Jakub Langr (University of Oxford) - Overview of Generative Adversarial Netwo...
Codiax
 
PPTX
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!
Sri Ambati
 
PDF
Rsqrd AI: Recent Advances in Explainable Machine Learning Research
Sanjana Chowdhury
 
PPTX
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...
Sri Ambati
 
PDF
Workshop on Machine Learning
Harshad Saykhedkar
 
PPTX
Models vs Reality: Quest for the Roots of Complexity
Julian Warszawski
 
PDF
AI Basics for Professionals to Help Begin Their AI Journey
Deepak Sharma
 
PDF
Machine learning tutorial
AshokKumarC18
 
DOCX
Heet detroja.resume
HeetDetroja
 
PDF
BSSML16 L1. Introduction, Models, and Evaluations
BigML, Inc
 
Interpretable machine learning : Methods for understanding complex models
Manojit Nandi
 
Practical Tips for Interpreting Machine Learning Models - Patrick Hall, H2O.ai
Sri Ambati
 
dod abstract
Barbara Jean Neal
 
Interpretable Machine Learning
inovex GmbH
 
Keepler | Understanding your own predictive models
Keepler Data Tech
 
Keepler Data Tech | Entendiendo tus propios modelos predictivos
Keepler Data Tech
 
Philip Rathle- Graph Boosted Artificial Intelligence
Neo4j
 
The Incredible Disappearing Data Scientist
Rebecca Bilbro
 
Calin Constantinov - Neo4j - Bucharest Big Data Week Meetup - Bucharest 2018
Calin Constantinov
 
Jakub Langr (University of Oxford) - Overview of Generative Adversarial Netwo...
Codiax
 
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!
Sri Ambati
 
Rsqrd AI: Recent Advances in Explainable Machine Learning Research
Sanjana Chowdhury
 
Explaining Black-Box Machine Learning Predictions - Sameer Singh, Assistant P...
Sri Ambati
 
Workshop on Machine Learning
Harshad Saykhedkar
 
Models vs Reality: Quest for the Roots of Complexity
Julian Warszawski
 
AI Basics for Professionals to Help Begin Their AI Journey
Deepak Sharma
 
Machine learning tutorial
AshokKumarC18
 
Heet detroja.resume
HeetDetroja
 
BSSML16 L1. Introduction, Models, and Evaluations
BigML, Inc
 

Similar to Train, explain, acclaim. Build a good model in three steps (20)

PPTX
Synergy of Human and Artificial Intelligence in Software Engineering
Tao Xie
 
PDF
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
Matthew Sinclair
 
PDF
M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...
Marco Brambilla
 
PPTX
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018
Sri Ambati
 
PPTX
ODSC APAC 2022 - Explainable AI
Aditya Bhattacharya
 
PDF
Human in the loop: Bayesian Rules Enabling Explainable AI
Pramit Choudhary
 
PPTX
Schrefl computer abd teaching impacts.pptx
YonghoLouis1
 
PPTX
Best practices in building machine learning models in Azure ML
Zeydy Ortiz, Ph. D.
 
PDF
PoolParty Semantic Classifier
Semantic Web Company
 
PPTX
Leveraging Open Source Automated Data Science Tools
Domino Data Lab
 
PDF
Machine Learning on Streaming Data using Kafka, Beam, and TensorFlow (Mikhail...
confluent
 
PDF
Technovision
SayantanGhosh58
 
PDF
Human in the loop: a design pattern for managing teams working with ML
Paco Nathan
 
PDF
[第45回 Machine Learning 15minutes! Broadcast] Azure AI - Build 2020 Updates
Naoki (Neo) SATO
 
PDF
ODSC West 2022 – Kitbashing in ML
Bryan Bischof
 
PPTX
C0-01 OEAD0002.pptx ,msbxkasbdkbakwdbkawdka
261203dj
 
PDF
SBQS 2013 Keynote: Cooperative Testing and Analysis
Tao Xie
 
PDF
Dl applicationlandscape-mar2018-180405144127
Aravindharamanan S
 
PDF
Deep Learning: Application Landscape - March 2018
Grigory Sapunov
 
PDF
Hybrid use of machine learning and ontology
Anthony (Tony) Sarris
 
Synergy of Human and Artificial Intelligence in Software Engineering
Tao Xie
 
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
Matthew Sinclair
 
M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...
Marco Brambilla
 
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018
Sri Ambati
 
ODSC APAC 2022 - Explainable AI
Aditya Bhattacharya
 
Human in the loop: Bayesian Rules Enabling Explainable AI
Pramit Choudhary
 
Schrefl computer abd teaching impacts.pptx
YonghoLouis1
 
Best practices in building machine learning models in Azure ML
Zeydy Ortiz, Ph. D.
 
PoolParty Semantic Classifier
Semantic Web Company
 
Leveraging Open Source Automated Data Science Tools
Domino Data Lab
 
Machine Learning on Streaming Data using Kafka, Beam, and TensorFlow (Mikhail...
confluent
 
Technovision
SayantanGhosh58
 
Human in the loop: a design pattern for managing teams working with ML
Paco Nathan
 
[第45回 Machine Learning 15minutes! Broadcast] Azure AI - Build 2020 Updates
Naoki (Neo) SATO
 
ODSC West 2022 – Kitbashing in ML
Bryan Bischof
 
C0-01 OEAD0002.pptx ,msbxkasbdkbakwdbkawdka
261203dj
 
SBQS 2013 Keynote: Cooperative Testing and Analysis
Tao Xie
 
Dl applicationlandscape-mar2018-180405144127
Aravindharamanan S
 
Deep Learning: Application Landscape - March 2018
Grigory Sapunov
 
Hybrid use of machine learning and ontology
Anthony (Tony) Sarris
 
Ad

Recently uploaded (20)

PDF
Knee Extensor Mechanism Injuries - Orthopedic Radiologic Imaging
Sean M. Fox
 
PPTX
How to Handle Salesperson Commision in Odoo 18 Sales
Celine George
 
PDF
Horarios de distribución de agua en julio
pegazohn1978
 
PDF
Reconstruct, Restore, Reimagine: New Perspectives on Stoke Newington’s Histor...
History of Stoke Newington
 
PPTX
GRADE-3-PPT-EVE-2025-ENG-Q1-LESSON-1.pptx
EveOdrapngimapNarido
 
PDF
Geographical Diversity of India 100 Mcq.pdf/ 7th class new ncert /Social/Samy...
Sandeep Swamy
 
PDF
Aprendendo Arquitetura Framework Salesforce - Dia 03
Mauricio Alexandre Silva
 
PDF
ARAL-Orientation_Morning-Session_Day-11.pdf
JoelVilloso1
 
PPTX
PATIENT ASSIGNMENTS AND NURSING CARE RESPONSIBILITIES.pptx
PRADEEP ABOTHU
 
PPTX
How to Create a PDF Report in Odoo 18 - Odoo Slides
Celine George
 
PPTX
How to Manage Large Scrollbar in Odoo 18 POS
Celine George
 
PPTX
Universal immunization Programme (UIP).pptx
Vishal Chanalia
 
PPTX
How to Create Odoo JS Dialog_Popup in Odoo 18
Celine George
 
PPTX
Cultivation practice of Litchi in Nepal.pptx
UmeshTimilsina1
 
PPTX
grade 5 lesson matatag ENGLISH 5_Q1_PPT_WEEK4.pptx
SireQuinn
 
PDF
CONCURSO DE POESIA “POETUFAS – PASSOS SUAVES PELO VERSO.pdf
Colégio Santa Teresinha
 
PPTX
Identifying elements in the story. Arrange the events in the story
geraldineamahido2
 
PPTX
I AM MALALA The Girl Who Stood Up for Education and was Shot by the Taliban...
Beena E S
 
PDF
The Constitution Review Committee (CRC) has released an updated schedule for ...
nservice241
 
PDF
Biological Bilingual Glossary Hindi and English Medium
World of Wisdom
 
Knee Extensor Mechanism Injuries - Orthopedic Radiologic Imaging
Sean M. Fox
 
How to Handle Salesperson Commision in Odoo 18 Sales
Celine George
 
Horarios de distribución de agua en julio
pegazohn1978
 
Reconstruct, Restore, Reimagine: New Perspectives on Stoke Newington’s Histor...
History of Stoke Newington
 
GRADE-3-PPT-EVE-2025-ENG-Q1-LESSON-1.pptx
EveOdrapngimapNarido
 
Geographical Diversity of India 100 Mcq.pdf/ 7th class new ncert /Social/Samy...
Sandeep Swamy
 
Aprendendo Arquitetura Framework Salesforce - Dia 03
Mauricio Alexandre Silva
 
ARAL-Orientation_Morning-Session_Day-11.pdf
JoelVilloso1
 
PATIENT ASSIGNMENTS AND NURSING CARE RESPONSIBILITIES.pptx
PRADEEP ABOTHU
 
How to Create a PDF Report in Odoo 18 - Odoo Slides
Celine George
 
How to Manage Large Scrollbar in Odoo 18 POS
Celine George
 
Universal immunization Programme (UIP).pptx
Vishal Chanalia
 
How to Create Odoo JS Dialog_Popup in Odoo 18
Celine George
 
Cultivation practice of Litchi in Nepal.pptx
UmeshTimilsina1
 
grade 5 lesson matatag ENGLISH 5_Q1_PPT_WEEK4.pptx
SireQuinn
 
CONCURSO DE POESIA “POETUFAS – PASSOS SUAVES PELO VERSO.pdf
Colégio Santa Teresinha
 
Identifying elements in the story. Arrange the events in the story
geraldineamahido2
 
I AM MALALA The Girl Who Stood Up for Education and was Shot by the Taliban...
Beena E S
 
The Constitution Review Committee (CRC) has released an updated schedule for ...
nservice241
 
Biological Bilingual Glossary Hindi and English Medium
World of Wisdom
 
Ad

Train, explain, acclaim. Build a good model in three steps