Counterfactual explanations identify minimal input changes needed to alter a machine learning model’s prediction, offering actionable insights in tasks like churn analysis. However, existing methods often produce counterfactuals that vary in quality, coherence, and plausibility, limiting their practical value. We propose an ensemble evaluation framework that integrates multiple generation techniques and ranks their outputs using a tunable scoring function balancing multiple relevant metrics. Our approach addresses two key deployment scenarios: (i) in-house churn analysis, where decision-makers can interactively adjust scoring weights for tailored, user-driven explanations; and (ii) outsourced churn prediction, where counterfactuals must be generated on synthetic data to preserve privacy while remaining representative of real cases. Experiments on benchmark churn datasets demonstrate that our ensemble approach improves the consistency, interpretability, and utility of counterfactuals across both real and synthetic settings, supporting more reliable and privacy-aware decision-making.

Counterfactual Ensembles for Interpretable Churn Prediction: From Real-World to Privacy-Preserving Synthetic Data

Samuele Tonati;Marzio Di Vece;Fosca Giannotti;Roberto Pellungrini
2025

Abstract

Counterfactual explanations identify minimal input changes needed to alter a machine learning model’s prediction, offering actionable insights in tasks like churn analysis. However, existing methods often produce counterfactuals that vary in quality, coherence, and plausibility, limiting their practical value. We propose an ensemble evaluation framework that integrates multiple generation techniques and ranks their outputs using a tunable scoring function balancing multiple relevant metrics. Our approach addresses two key deployment scenarios: (i) in-house churn analysis, where decision-makers can interactively adjust scoring weights for tailored, user-driven explanations; and (ii) outsourced churn prediction, where counterfactuals must be generated on synthetic data to preserve privacy while remaining representative of real cases. Experiments on benchmark churn datasets demonstrate that our ensemble approach improves the consistency, interpretability, and utility of counterfactuals across both real and synthetic settings, supporting more reliable and privacy-aware decision-making.
2025
Settore INF/01 - Informatica
Settore INFO-01/A - Informatica
   PNRR Partenariati Estesi - FAIR - Future artificial intelligence research.
   Ministero della pubblica istruzione, dell'università e della ricerca

   PNRR Infrastrutture di Ricerca - SoBigData.it - Strengthening the Italian RI for Social Mining and Big Data Analytics.
   SoBigData.it
   Ministero della pubblica istruzione, dell'università e della ricerca
   IR0000013

   Science and technology for the explanation of AI decision making
   XAI
   European Commission
   H2020
   834756

   It takes two to tango: a synergistic approach to human-machine decision making
   TANGO
   European Commission
   Grant Agreement n. 101120763
File in questo prodotto:
File Dimensione Formato  
s10994-025-06880-4.pdf

accesso aperto

Tipologia: Published version
Licenza: Creative Commons
Dimensione 3.43 MB
Formato Adobe PDF
3.43 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11384/156684
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
  • OpenAlex 0
social impact