Anomaly Detection Through Unsupervised Federated Learning

Nardi, Mirko; Valerio, Lorenzo; Passarella, Andrea

doi:10.1109/msn57253.2022.00085

Federated learning (FL) is proving to be one of the most promising paradigms for leveraging distributed resources, enabling a set of clients to collaboratively train a machine learning model while keeping the data decentralized. The explosive growth of interest in the topic has led to rapid advancements in several core aspects like communication efficiency, handling non-IID data, privacy, and security capabilities. However, the majority of FL works only deal with supervised tasks, assuming that clients' training sets are labeled. To leverage the enormous unlabeled data on distributed edge devices, in this paper, we aim to extend the FL paradigm to unsupervised tasks by addressing the problem of anomaly detection (AD) in decentralized settings. In particular, we propose a novel method in which, through a preprocessing phase, clients are grouped into communities, each having similar majority (i.e., inlier) patterns. Subsequently, each community of clients trains the same anomaly detection model (i.e., autoencoders) in a federated fashion. The resulting model is then shared and used to detect anomalies within the clients of the same community that joined the corresponding federated process. Experiments show that our method is robust, and it can detect communities consistent with the ideal partitioning in which groups of clients having the same inlier patterns are known. Furthermore, the performance is significantly better than those in which clients train models exclusively on local data and comparable with federated models of ideal communities' partition.

Anomaly Detection Through Unsupervised Federated Learning

Nardi, Mirko;Valerio, Lorenzo;Passarella, Andrea

2023

Abstract

Federated learning (FL) is proving to be one of the most promising paradigms for leveraging distributed resources, enabling a set of clients to collaboratively train a machine learning model while keeping the data decentralized. The explosive growth of interest in the topic has led to rapid advancements in several core aspects like communication efficiency, handling non-IID data, privacy, and security capabilities. However, the majority of FL works only deal with supervised tasks, assuming that clients' training sets are labeled. To leverage the enormous unlabeled data on distributed edge devices, in this paper, we aim to extend the FL paradigm to unsupervised tasks by addressing the problem of anomaly detection (AD) in decentralized settings. In particular, we propose a novel method in which, through a preprocessing phase, clients are grouped into communities, each having similar majority (i.e., inlier) patterns. Subsequently, each community of clients trains the same anomaly detection model (i.e., autoencoders) in a federated fashion. The resulting model is then shared and used to detect anomalies within the clients of the same community that joined the corresponding federated process. Experiments show that our method is robust, and it can detect communities consistent with the ideal partitioning in which groups of clients having the same inlier patterns are known. Furthermore, the performance is significantly better than those in which clients train models exclusively on local data and comparable with federated models of ideal communities' partition.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2023
			
	Settore Scientifico Disciplinare (validi fino a 24/06/2024)
	
				Settore INF/01 - Informatica
			
	Titolo del Convegno
	
				MSN 2022: The 18th International Conference on Mobility, Sensing and Networking Guangzhou, China, December 14-16, 2022
			
	Luogo del Convegno
	
				Guangzhou, China
			
	Periodo del Convegno
	
				14/12/2022 - 16/12/20222
			
	Titolo del Volume
	
				2022 18th International Conference on Mobility, Sensing and Networking (MSN) : MSN 2022 : 14-16 December 2022, Guangzhou, China : proceedings (Englisch)
			
	Editore
	
				IEEE Computer Society
			
	ISBN
	
				978-1-6654-6457-4
978-1-6654-6458-1
			
	DOI
	
				https://dx.doi.org/10.1109/msn57253.2022.00085
			
	Parole chiave
	
				Federated; learning; anomaly; detection
			
	Progetti che finanziano la ricerca
	
	Titolo Progetto
	
									SoBigData++: European Integrated Infrastructure for Social Mining and Big Data Analytics
								
	Acronimo
	
									SoBigData-PlusPlus
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon 2020 Framework Programme
								
	N. Contratto
	
									871042
								
	Titolo Progetto
	
									Multimodal Extreme Scale Data Analytics for Smart Cities Environments
								
	Acronimo
	
									MARVEL
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon 2020 Framework Programme
								
	N. Contratto
	
									957337
								
	Titolo Progetto
	
									HumanE AI Network
								
	Acronimo
	
									HumanE-AI-Net
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon 2020 Framework Programme
								
	N. Contratto
	
									952026
								
	Titolo Progetto
	
									Social Explainable Artificial Intelligence
								
	Acronimo
	
									SAI
								
	Nome finanziatore
	
										CHIST-ERA
									
	N. Contratto
	
									CHIST-ERA-19-XAI-010
								
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
2209.04184v1.pdf accesso aperto Tipologia: Accepted version (post-print) Licenza: Solo Lettura Dimensione 278.78 kB Formato Adobe PDF	278.78 kB	Adobe PDF