Part	Question we'll answer	Data	Main tool
Part 1	How far can classic interpretability take us?	Census (tabular)	Tree, coefficients, PCA, SHAP
Part 2	Does the same logic work on text and multiclass?	Emotions (text)	Multiclass SHAP
Part 3	Can you run the workflow alone?	Your dataset	Independent SHAP analysis

Model	F1 score	Interpretability
Decision tree (depth 3)	~0.75	Readable rule set
Logistic regression	~0.78	Coefficients directly inspectable
XGBoost	~0.81	Not directly inspectable

The workflow

Four steps to apply SHAP

Train your model

Any model — but tree boosters (XGBoost, LightGBM) pair with a fast TreeExplainer.

Create the explainer

explainer = shap.TreeExplainer(xgb_model, data=X_train, model_output="probability") · data=X_train is the background set — the mean prediction over it becomes the expected value.

Compute SHAP values

shap_values = explainer.shap_values(X_test) → a matrix (n_samples, n_features)

Visualize & interpret

Bar plot · beeswarm · dependence · force — we'll read all four next.

Caveat	What it means
Correlated features	Credit may split between twins in a messy way
Compute cost	`TreeExplainer` is fast — other explainers can be slow
Not causality	A strong SHAP value is association, not cause
Local instability	Similar people can get visibly different explanations
Explainer choice	Different explainers behave differently across models

Aspect	Part 1 (binary)	Part 3 (multiclass text)
Features	Tabular columns	Words / tokens
Task	Binary	6-class (sadness · joy · fear · anger · surprise · disgust)
SHAP output space	Probabilities	Logits (raw class scores)
Explanation unit	One per prediction	One per class, per prediction

Parcours : Data Scientist

Model Interpretability

with SHAP

Open the black box — understand why your model predicts what it predicts

About this masterclass

What you will learn

The problem

A 98% accurate model can still be wrong

Why it matters

Four reasons to open the black box

Roadmap

Our three-part journey

Part 1

From simple models

to the black box

Tabular data · Census income prediction

The dataset

Predicting US Census income

Interpretable by design · 1

Reading a decision tree

Interpretable by design · 2

What the tree tells us (and hides)

Interpretable by design · 3

Feature importance & coefficients

Interpretable by design · 4

PCA biplot — a global map

The tradeoff

Performance vs. interpretability

Part 2

SHAP

to the rescue

A unified language for any model

The core idea

SHAP in one mental picture

The foundation

Why SHAP is trustworthy

The workflow

Four steps to apply SHAP

Output structure

The SHAP values matrix

The baseline

Expected value = average prediction

Global view · 1

The bar plot — what matters on average

Global view · 2

The beeswarm — matters how and for whom

Local view · 1

The dependence plot — how one feature behaves

Local view · 2

The force plot — one specific decision

Know the limits

SHAP is powerful — not magic

Part 3

SHAP on text

and multiclass

Emotion classification with 6 classes

What changes

From tabular to text, from 2 classes to 6

New concept · 1

Logits vs. probabilities

New concept · 2

One explanation per class

Reading multiclass force plots

Which words fire each emotion?

Final Part

Now it's your turn

Apply the full SHAP workflow to your own data

Open challenge

Your 5-step SHAP workflow

Summary

Key takeaways

Quiz · Check your understanding

What does a positive SHAP value mean?

Further reading

Go deeper

Masterclass complete

Thank you!

Questions? Let's discuss.