SlideShare a Scribd company logo
Explore, Explain,
and Debug
aka Interpretable Machine Learning
Przemysław Biecek
Why should we care?
Explore, Explain, and Debug aka Interpretable Machine Learning
Explore, Explain, and Debug aka Interpretable Machine Learning
https://www.propublica.org/article/how-we-analyzed-the-compas-recidivism-algorithm
https://www.massdevice.com/report-ibm-watson-delivered-unsafe-and-inaccurate-
cancer-recommendations/
https://www.wired.com/2015/10/can-learn-epic-failure-google-flu-trends/
• “You don’t see a lot of skepticism,” she says. “The algorithms are like shiny new
toys that we can’t resist using. We trust them so much that we project meaning on to
them.”
• Ultimately algorithms, according to O’Neil, reinforce discrimination and widen
inequality, “using people’s fear and trust of mathematics to prevent them from
asking questions”.
https://www.theguardian.com/books/2016/oct/27/cathy-oneil-weapons-of-math-
destruction-algorithms-big-data !8
Cathy O'Neil:
The era of blind faith
in big data must end
black boxes
Why do we need explanations for complex models?
Right to explanation
!9
Why do we need explanations for complex models?
Explore, Explain, and Debug aka Interpretable Machine Learning
DARPA cares
Defense Advanced Research
Projects Agency
Explore, Explain, and Debug aka Interpretable Machine Learning
Explore, Explain, and Debug aka Interpretable Machine Learning
We should too
Domain
understanding
Predictive
modeling
Validation and
Justification
EDA, transformations Linear models MSE, p-values
Shift in our focus: Statistics
Domain
understanding
Predictive
modeling
Validation and
Justification
Domain
understanding
Predictive
modeling
Validation and
Justification
EDA, transformations Linear models MSE, p-values
Simple EDA Lots of models + optimisation
Test/train
Cross Validation
Shift in our focus: Machine Learning
Domain
understanding
Predictive
modeling
Validation and
Justification
Domain
understanding
Predictive
modeling
Validation and
Justification
Domain
understanding
Predictive
modeling
Validation and
Justification
EDA, transformations Linear models MSE, p-values
Simple EDA
Test/train
Cross Validation
Simple EDA AutoML XAI, Fairness, Ethics
Validation and
Justification
Shift in our focus: Human Oriented ML?
Lots of models + optimisation
What would you ask for?
Explore, Explain, and Debug aka Interpretable Machine Learning
https://kmichael08.github.io
CHATBOT:
What would you ask for?
https://kmichael08.github.io
What If?
Why?
What happened
to similar cases?
CHATBOT:
What would you ask for?
Why?
https://www.encyclopedia-titanica.org/
What are his odds of surviving?
Random Forest prediction: 0.422
Input:
4 years old passenger from 1st class. Paid 72 for the ticket
What is the contribution of each variable to the final odds?
(model: Random Forest)
iBreakDown: Uncertainty of Model Explanations for Non-additive Predictive Models
Alicja Gosiewska, Przemyslaw Biecek (2019) https://arxiv.org/abs/1903.11420v1
Random Forest prediction: 0.422
Additive attribution of model prediction via sequence of conditionings
Added values of variable l in the sequence
Final attributions
Conditional distributions, read from top to the bottom
iBreakDown: Uncertainty of Model Explanations for Non-additive Predictive Models
Alicja Gosiewska, Przemyslaw Biecek (2019) https://arxiv.org/abs/1903.11420v1
Conditional distributions, read from top to the bottom
Break Down plots
model agnostic, additive attributions
Order does matter!
IME/EXPLAIN (Robnik 2008/2010), SHAP (Lundberg 2017), Break Down (our solution 2018)
Order does matter: SHAP is an average Break Down
SHAP (SHapley Additive exPlanations) Lundberg (2017)
Order does matter: SHAP is an average Break Down
SHAP (SHapley Additive exPlanations) Lundberg (2017)
Order does matter: use it to find interactions
What If?
What If?
Input:
42 years old passenger from 1st class. Paid 72 for the ticket
Logistic regression model predicts 0.32 probability of survival
What would happen if….
What If?
We cannot see all dimensions
Ceteris Paribus Profiles
Individual Conditional Expectations
Champion - Challenger analysis
Interactive explanations for
a better model - human interface
Explanations for
Meta Model
Meta models
Defaults (package
defaults (Def.P) and
optimal defaults
(Def.O)),
tunability of the
hyperparameters with
the package defaults
(Tun.P) and our
optimal defaults
(Tun.O) as reference
and tuning space
quantiles (q0.05 and
q0.95) for different
parameters of the
algorithms
Let’s focus on a single dataset: 334
For a selected class of models (here Random Forest) we can learn
how the model performance depends on hyper-parameters
Let’s focus on a single dataset: 334
For a selected class of models (here Random Forest) we can learn
how the model performance depends on hyper-parameters
FICO example
for credit scoring
https://buecker.netlify.com/slides.html#34
From: https://buecker.netlify.com/slides.html
From: https://buecker.netlify.com/slides.html
Performance for selected modeling methods
red ones are the most interesting
Partial Dependency Plot for the most
important feature
Partial Dependency Plot for the most
important feature
Break Down for a single decision
made by GBM model with 10 000 trees
SAFE:
Surrogate assisted feature
extractions for ML models
Use a good black box model (i.e. trained with AutoML) and extract an
interpretable model from it.
AutoIML
AutoIML
Use a good black box model (i.e. trained with AutoML) and extract an
interpretable model from it.
Preliminary results for the FICO data,
xgboost is used as a surrogate to construct a logistic regression
model.
Techniques for explanation and exploration
will change the way how we do predictive models
MDP : : Model Development Process
Data validation
Feature selection
Parameters tuning
Problem formulation Crisp modelling Fine tuning Maintaining
Data acquisition
Model deployment
Data cleaning
Data exploration
Sample selection
Feature engineering
Model selection
Model validation
Documentation
Communication
Data preparation
Data understanding
Model delivery
Model assembly
Model audit
Model benchmarking
Iterations P1 C1 C2 F1 F2 M1 M2 M3
time
Techniques for explanation and exploration
will change the way how we do predictive models
IML in R: DALEX, iml, mlr3vis(?), …
IML in python: ELI5, skater, xai, SHAP, lime, …
Other tools: H2O, …
An Introduction to Machine Learning Interpretability
Navdeep Gill, Patrick Hall

https://www.h2o.ai/oreilly-mli-booklet-2019/

Interpretable Machine Learning
Christoph Molnar

https://christophm.github.io/interpretable-ml-book/

Predictive Models: Explore, Explain, and Debug
Przemyslaw Biecek and Tomasz Burzykowski

https://pbiecek.github.io/PM_VEE/
Questions?

More Related Content

PDF
XAI or DIE at Data Science Summit 2019
Przemek Biecek
 
PDF
Responsible Machine Learning at Data Science Summit 2020
Przemek Biecek
 
PDF
Train, explain, acclaim. Build a good model in three steps
Przemek Biecek
 
PDF
Explain! Or I will sue you!
Przemek Biecek
 
PDF
Machine Learning Interpretability
inovex GmbH
 
PDF
DC02. Interpretation of predictions
Anton Kulesh
 
PDF
Interpretable machine learning : Methods for understanding complex models
Manojit Nandi
 
PDF
Ideas on Machine Learning Interpretability
Sri Ambati
 
XAI or DIE at Data Science Summit 2019
Przemek Biecek
 
Responsible Machine Learning at Data Science Summit 2020
Przemek Biecek
 
Train, explain, acclaim. Build a good model in three steps
Przemek Biecek
 
Explain! Or I will sue you!
Przemek Biecek
 
Machine Learning Interpretability
inovex GmbH
 
DC02. Interpretation of predictions
Anton Kulesh
 
Interpretable machine learning : Methods for understanding complex models
Manojit Nandi
 
Ideas on Machine Learning Interpretability
Sri Ambati
 

What's hot (6)

PPTX
Practical Tips for Interpreting Machine Learning Models - Patrick Hall, H2O.ai
Sri Ambati
 
PDF
The Incredible Disappearing Data Scientist
Rebecca Bilbro
 
PDF
Interpretable Machine Learning
inovex GmbH
 
PDF
Machine learning tutorial
AshokKumarC18
 
PDF
Building Data Apps with Python
Benjamin Bengfort
 
PDF
Testing the Intelligence of your AI
Iosif Itkin
 
Practical Tips for Interpreting Machine Learning Models - Patrick Hall, H2O.ai
Sri Ambati
 
The Incredible Disappearing Data Scientist
Rebecca Bilbro
 
Interpretable Machine Learning
inovex GmbH
 
Machine learning tutorial
AshokKumarC18
 
Building Data Apps with Python
Benjamin Bengfort
 
Testing the Intelligence of your AI
Iosif Itkin
 
Ad

Similar to Explore, Explain, and Debug aka Interpretable Machine Learning (20)

PDF
Human-Centered Interpretable Machine Learning
Przemek Biecek
 
PDF
Intepretable Machine Learning
Ankit Tewari
 
PDF
Keepler | Understanding your own predictive models
Keepler Data Tech
 
PPTX
Interpretable ML
Mayur Sand
 
PDF
Keepler Data Tech | Entendiendo tus propios modelos predictivos
Keepler Data Tech
 
PPTX
Responsible AI in Industry: Practical Challenges and Lessons Learned
Krishnaram Kenthapadi
 
PDF
Can Machine Learning Models be Trusted? Explaining Decisions of ML Models
Darek Smyk
 
PDF
Human in the loop: Bayesian Rules Enabling Explainable AI
Pramit Choudhary
 
PPTX
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018
Sri Ambati
 
PDF
Practical Explainable AI: How to build trustworthy, transparent and unbiased ...
Raheel Ahmad
 
PPTX
Explainable Machine Learning (Explainable ML)
Hayim Makabee
 
PDF
Tf itpbapm
Shannon Gallagher
 
PPTX
Hima_Lakkaraju_XAI_ShortCourse.pptx
PhanThDuy
 
PDF
Explainability for Learning to Rank
Sease
 
PDF
C3 w5
Ajay Taneja
 
PPTX
algorithmic-decisions, fairness, machine learning, provenance, transparency
Paolo Missier
 
PDF
Achieving Algorithmic Transparency with Shapley Additive Explanations (H2O Lo...
Sri Ambati
 
PPTX
Interpretable Machine Learning
Sri Ambati
 
PPTX
Scott Lundberg, Microsoft Research - Explainable Machine Learning with Shaple...
Sri Ambati
 
PDF
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
Lucas Jellema
 
Human-Centered Interpretable Machine Learning
Przemek Biecek
 
Intepretable Machine Learning
Ankit Tewari
 
Keepler | Understanding your own predictive models
Keepler Data Tech
 
Interpretable ML
Mayur Sand
 
Keepler Data Tech | Entendiendo tus propios modelos predictivos
Keepler Data Tech
 
Responsible AI in Industry: Practical Challenges and Lessons Learned
Krishnaram Kenthapadi
 
Can Machine Learning Models be Trusted? Explaining Decisions of ML Models
Darek Smyk
 
Human in the loop: Bayesian Rules Enabling Explainable AI
Pramit Choudhary
 
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018
Sri Ambati
 
Practical Explainable AI: How to build trustworthy, transparent and unbiased ...
Raheel Ahmad
 
Explainable Machine Learning (Explainable ML)
Hayim Makabee
 
Tf itpbapm
Shannon Gallagher
 
Hima_Lakkaraju_XAI_ShortCourse.pptx
PhanThDuy
 
Explainability for Learning to Rank
Sease
 
algorithmic-decisions, fairness, machine learning, provenance, transparency
Paolo Missier
 
Achieving Algorithmic Transparency with Shapley Additive Explanations (H2O Lo...
Sri Ambati
 
Interpretable Machine Learning
Sri Ambati
 
Scott Lundberg, Microsoft Research - Explainable Machine Learning with Shaple...
Sri Ambati
 
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
Lucas Jellema
 
Ad

Recently uploaded (20)

PDF
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
PPTX
Introduction to Biostatistics Presentation.pptx
AtemJoshua
 
PPTX
Pipeline Automatic Leak Detection for Water Distribution Systems
Sione Palu
 
PPTX
HSE WEEKLY REPORT for dummies and lazzzzy.pptx
ahmedibrahim691723
 
PPTX
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
PDF
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
PPTX
Data-Users-in-Database-Management-Systems (1).pptx
dharmik832021
 
PPTX
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
PPTX
Introduction-to-Python-Programming-Language (1).pptx
dhyeysapariya
 
PDF
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
PDF
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
PDF
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
PPTX
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
PDF
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
PDF
TIC ACTIVIDAD 1geeeeeeeeeeeeeeeeeeeeeeeeeeeeeer3.pdf
Thais Ruiz
 
PPTX
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
PPTX
The whitetiger novel review for collegeassignment.pptx
DhruvPatel754154
 
PDF
Blitz Campinas - Dia 24 de maio - Piettro.pdf
fabigreek
 
PPTX
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
Introduction to Biostatistics Presentation.pptx
AtemJoshua
 
Pipeline Automatic Leak Detection for Water Distribution Systems
Sione Palu
 
HSE WEEKLY REPORT for dummies and lazzzzy.pptx
ahmedibrahim691723
 
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
Data-Users-in-Database-Management-Systems (1).pptx
dharmik832021
 
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
Introduction-to-Python-Programming-Language (1).pptx
dhyeysapariya
 
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
TIC ACTIVIDAD 1geeeeeeeeeeeeeeeeeeeeeeeeeeeeeer3.pdf
Thais Ruiz
 
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
The whitetiger novel review for collegeassignment.pptx
DhruvPatel754154
 
Blitz Campinas - Dia 24 de maio - Piettro.pdf
fabigreek
 
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 

Explore, Explain, and Debug aka Interpretable Machine Learning