Interpreting Federated Learning by Aggregating SHAP Explanations

Bonsignori, Valerio; Corbucci, Luca; Naretto, Francesca; Guidotti, Riccardo; Monreale, Anna

doi:10.1109/ACCESS.2026.3696009

Federated Learning trains models without transferring data outside local clients, but usually relies on non-interpretable Neural Networks. While explanations are essential for model adoption and trustworthiness, conventional explainability techniques require centralized data access, which violates Federated Learning principles.We propose a framework for Interpreting Federated Learning by Aggregating SHAP explanations, iFLASH, which introduces novel aggregation methods that significantly improve explanation quality compared to naive averaging approaches, while preserving data privacy in federated settings. iFLASH enables local Shap explainers on individual clients without exposing raw data by aggregating feature importance values rather than models or gradients. Clients compute and evaluate explanations using performance-based metrics, then send results to the server. The server weighs each client's contribution based on model performance and explanation quality, which aims to produce faithful aggregate explanations. The framework supports various aggregation strategies, adapting to different levels of data imbalance and heterogeneity. Experiments across cross-silo (12-16 clients) and cross-device (50-150 clients) scenarios demonstrate that Faithfulness-based aggregation consistently outperforms uniform averaging in cross-silo settings, EQ1 achieves higher Faithfulness than naive averaging in all the cross-silo configurations across all datasets and distributions, while quality-aware methods perform comparably in cross-device environments where the high number of clients provides sufficient averaging effect. iFLASH explanations align closely with centralised explanations in feature importance ranking and directionality, sometimes achieving better fidelity. Results demonstrate that iFLASH enables accurate, privacy-preserving explanations for domains where data cannot be centralised. We highlight that our proposal has been extensively evaluated also in a cross-device federated setting, a scenario that is overlooked in the explainable AI literature.

Interpreting Federated Learning by Aggregating SHAP Explanations

Bonsignori, Valerio;Corbucci, Luca;Naretto, Francesca;Guidotti, Riccardo;Monreale, Anna

2026

Abstract

Federated Learning trains models without transferring data outside local clients, but usually relies on non-interpretable Neural Networks. While explanations are essential for model adoption and trustworthiness, conventional explainability techniques require centralized data access, which violates Federated Learning principles.We propose a framework for Interpreting Federated Learning by Aggregating SHAP explanations, iFLASH, which introduces novel aggregation methods that significantly improve explanation quality compared to naive averaging approaches, while preserving data privacy in federated settings. iFLASH enables local Shap explainers on individual clients without exposing raw data by aggregating feature importance values rather than models or gradients. Clients compute and evaluate explanations using performance-based metrics, then send results to the server. The server weighs each client's contribution based on model performance and explanation quality, which aims to produce faithful aggregate explanations. The framework supports various aggregation strategies, adapting to different levels of data imbalance and heterogeneity. Experiments across cross-silo (12-16 clients) and cross-device (50-150 clients) scenarios demonstrate that Faithfulness-based aggregation consistently outperforms uniform averaging in cross-silo settings, EQ1 achieves higher Faithfulness than naive averaging in all the cross-silo configurations across all datasets and distributions, while quality-aware methods perform comparably in cross-device environments where the high number of clients provides sufficient averaging effect. iFLASH explanations align closely with centralised explanations in feature importance ranking and directionality, sometimes achieving better fidelity. Results demonstrate that iFLASH enables accurate, privacy-preserving explanations for domains where data cannot be centralised. We highlight that our proposal has been extensively evaluated also in a cross-device federated setting, a scenario that is overlooked in the explainable AI literature.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2026
			
	Settore Scientifico Disciplinare (validi fino a 24/06/2024)
	
				Settore INF/01 - Informatica
			
	Settore Scientifico Disciplinare (validi dal 09/05/2024)
	
				Settore INFO-01/A - Informatica
			
	Titolo Rivista
	
				IEEE ACCESS
			
	DOI
	
				https://dx.doi.org/10.1109/ACCESS.2026.3696009
			
	Parole chiave
	
				Cross-device; cross-silo; explainable artificial intelligence; faithfulness; federated learning; trustworthy AI
			
	Progetti che finanziano la ricerca
	
	Titolo Progetto
	
									It takes two to tango: a synergistic approach to human-machine decision making
								
	Acronimo
	
									TANGO
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon Europe Framework Programme - HORIZON  Research and Innovation Actions
								
	N. Contratto
	
									101120763
								
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
FINAL Article.pdf accesso aperto Descrizione: Main Paper Tipologia: Published version Licenza: Creative Commons Dimensione 2.6 MB Formato Adobe PDF	2.6 MB	Adobe PDF
Appendix (3).pdf accesso aperto Descrizione: Appendix Tipologia: Altro materiale allegato Licenza: Creative Commons Dimensione 2.45 MB Formato Adobe PDF	2.45 MB	Adobe PDF