Counterfactual explanations identify minimal input changes needed to alter a machine learning model’s prediction, offering actionable insights in tasks like churn analysis. However, existing methods often produce counterfactuals that vary in quality, coherence, and plausibility, limiting their practical value. We propose an ensemble evaluation framework that integrates multiple generation techniques and ranks their outputs using a tunable scoring function balancing multiple relevant metrics. Our approach addresses two key deployment scenarios: (i) in-house churn analysis, where decision-makers can interactively adjust scoring weights for tailored, user-driven explanations; and (ii) outsourced churn prediction, where counterfactuals must be generated on synthetic data to preserve privacy while remaining representative of real cases. Experiments on benchmark churn datasets demonstrate that our ensemble approach improves the consistency, interpretability, and utility of counterfactuals across both real and synthetic settings, supporting more reliable and privacy-aware decision-making.
Counterfactual Ensembles for Interpretable Churn Prediction: From Real-World to Privacy-Preserving Synthetic Data
Samuele Tonati;Marzio Di Vece;Fosca Giannotti;Roberto Pellungrini
2025
Abstract
Counterfactual explanations identify minimal input changes needed to alter a machine learning model’s prediction, offering actionable insights in tasks like churn analysis. However, existing methods often produce counterfactuals that vary in quality, coherence, and plausibility, limiting their practical value. We propose an ensemble evaluation framework that integrates multiple generation techniques and ranks their outputs using a tunable scoring function balancing multiple relevant metrics. Our approach addresses two key deployment scenarios: (i) in-house churn analysis, where decision-makers can interactively adjust scoring weights for tailored, user-driven explanations; and (ii) outsourced churn prediction, where counterfactuals must be generated on synthetic data to preserve privacy while remaining representative of real cases. Experiments on benchmark churn datasets demonstrate that our ensemble approach improves the consistency, interpretability, and utility of counterfactuals across both real and synthetic settings, supporting more reliable and privacy-aware decision-making.| File | Dimensione | Formato | |
|---|---|---|---|
|
s10994-025-06880-4.pdf
accesso aperto
Tipologia:
Published version
Licenza:
Creative Commons
Dimensione
3.43 MB
Formato
Adobe PDF
|
3.43 MB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.



