Explore, Explain, and Debug aka Interpretable Machine Learning

Explore, Explain,
and Debug
aka Interpretable Machine Learning
Przemysław Biecek

https://www.propublica.org/article/how-we-analyzed-the-compas-recidivism-algorithm

https://www.massdevice.com/report-ibm-watson-delivered-unsafe-and-inaccurate-
cancer-recommendations/

https://www.wired.com/2015/10/can-learn-epic-failure-google-ﬂu-trends/

• “You don’t see a lot of skepticism,” she says. “The algorithms are like shiny new
toys that we can’t resist using. We trust them so much that we project meaning on to
them.”
• Ultimately algorithms, according to O’Neil, reinforce discrimination and widen
inequality, “using people’s fear and trust of mathematics to prevent them from
asking questions”.
https://www.theguardian.com/books/2016/oct/27/cathy-oneil-weapons-of-math-
destruction-algorithms-big-data !8
Cathy O'Neil:
The era of blind faith
in big data must end
black boxes
Why do we need explanations for complex models?

Right to explanation
!9
Why do we need explanations for complex models?

DARPA cares
Defense Advanced Research
Projects Agency

Domain
understanding
Predictive
modeling
Validation and
Justification
EDA, transformations Linear models MSE, p-values
Shift in our focus: Statistics

Domain
understanding
Predictive
modeling
Validation and
Justification
Domain
understanding
Predictive
modeling
Validation and
Justification
Simple EDA Lots of models + optimisation
Test/train
Cross Validation
Shift in our focus: Machine Learning

Domain
understanding
Predictive
modeling
Validation and
Justification
Domain
understanding
Predictive
modeling
Validation and
Justification
Domain
understanding
Predictive
modeling
Validation and
Justification
Simple EDA
Test/train
Cross Validation
Simple EDA AutoML XAI, Fairness, Ethics
Validation and
Justification
Shift in our focus: Human Oriented ML?
Lots of models + optimisation

https://kmichael08.github.io
CHATBOT:
What would you ask for?

https://kmichael08.github.io
What If?
Why?
What happened
to similar cases?
CHATBOT:
What would you ask for?

https://www.encyclopedia-titanica.org/
What are his odds of surviving?
Random Forest prediction: 0.422

Input:
4 years old passenger from 1st class. Paid 72 for the ticket
What is the contribution of each variable to the final odds?
(model: Random Forest)
iBreakDown: Uncertainty of Model Explanations for Non-additive Predictive Models
Alicja Gosiewska, Przemyslaw Biecek (2019) https://arxiv.org/abs/1903.11420v1
Random Forest prediction: 0.422

Additive attribution of model prediction via sequence of conditionings
Added values of variable l in the sequence
Final attributions

Conditional distributions, read from top to the bottom
iBreakDown: Uncertainty of Model Explanations for Non-additive Predictive Models
Alicja Gosiewska, Przemyslaw Biecek (2019) https://arxiv.org/abs/1903.11420v1

Conditional distributions, read from top to the bottom

Break Down plots
model agnostic, additive attributions

Order does matter!
IME/EXPLAIN (Robnik 2008/2010), SHAP (Lundberg 2017), Break Down (our solution 2018)

Order does matter: SHAP is an average Break Down
SHAP (SHapley Additive exPlanations) Lundberg (2017)

Order does matter: use it to find interactions

What If?
Input:
42 years old passenger from 1st class. Paid 72 for the ticket
Logistic regression model predicts 0.32 probability of survival
What would happen if….

What If?
We cannot see all dimensions

Ceteris Paribus Profiles
Individual Conditional Expectations

Champion - Challenger analysis

Interactive explanations for
a better model - human interface

Defaults (package
defaults (Def.P) and
optimal defaults
(Def.O)),
tunability of the
hyperparameters with
the package defaults
(Tun.P) and our
optimal defaults
(Tun.O) as reference
and tuning space
quantiles (q0.05 and
q0.95) for different
parameters of the
algorithms

Let’s focus on a single dataset: 334
For a selected class of models (here Random Forest) we can learn
how the model performance depends on hyper-parameters

FICO example
for credit scoring

https://buecker.netlify.com/slides.html#34
From: https://buecker.netlify.com/slides.html

From: https://buecker.netlify.com/slides.html

Performance for selected modeling methods
red ones are the most interesting

Partial Dependency Plot for the most
important feature

Partial Dependency Plot for the most
important feature
Break Down for a single decision
made by GBM model with 10 000 trees

SAFE:
Surrogate assisted feature
extractions for ML models

Use a good black box model (i.e. trained with AutoML) and extract an
interpretable model from it.
AutoIML

AutoIML
Use a good black box model (i.e. trained with AutoML) and extract an
interpretable model from it.
Preliminary results for the FICO data,
xgboost is used as a surrogate to construct a logistic regression
model.

Techniques for explanation and exploration
will change the way how we do predictive models

MDP : : Model Development Process
Data validation
Feature selection
Parameters tuning
Problem formulation Crisp modelling Fine tuning Maintaining
Data acquisition
Model deployment
Data cleaning
Data exploration
Sample selection
Feature engineering
Model selection
Model validation
Documentation
Communication
Data preparation
Data understanding
Model delivery
Model assembly
Model audit
Model benchmarking
Iterations P1 C1 C2 F1 F2 M1 M2 M3
time
Techniques for explanation and exploration
will change the way how we do predictive models

IML in R: DALEX, iml, mlr3vis(?), …
IML in python: ELI5, skater, xai, SHAP, lime, …
Other tools: H2O, …

An Introduction to Machine Learning Interpretability
Navdeep Gill, Patrick Hall

https://www.h2o.ai/oreilly-mli-booklet-2019/

Interpretable Machine Learning
Christoph Molnar

https://christophm.github.io/interpretable-ml-book/

Predictive Models: Explore, Explain, and Debug
Przemyslaw Biecek and Tomasz Burzykowski

https://pbiecek.github.io/PM_VEE/

Explore, Explain, and Debug aka Interpretable Machine Learning

More Related Content

What's hot (6)

Similar to Explore, Explain, and Debug aka Interpretable Machine Learning (20)

Recently uploaded (20)

Explore, Explain, and Debug aka Interpretable Machine Learning