Online Sensitivity Optimization in Differentially Private Learning

Galli, Filippo; Palamidessi, Catuscia; Cucinotta, Tommaso

doi:10.1609/aaai.v38i11.29099

Training differentially private machine learning models requires constraining an individual’s contribution to the optimization process. This is achieved by clipping the 2-norm of their gradient at a predetermined threshold prior to averaging and batch sanitization. This selection adversely influences optimization in two opposing ways: it either exacerbates the bias due to excessive clipping at lower values, or augments sanitization noise at higher values. The choice significantly hinges on factors such as the dataset, model architecture, and even varies within the same optimization, demanding meticulous tuning usually accomplished through a grid search. In order to circumvent the privacy expenses incurred in hyperparameter tuning, we present a novel approach to dynamically optimize the clipping threshold. We treat this threshold as an additional learnable parameter, establishing a clean relationship between the threshold and the cost function. This allows us to optimize the former with gradient descent, with minimal repercussions on the overall privacy analysis. Our method is thoroughly assessed against alternative fixed and adaptive strategies across diverse datasets, tasks, model dimensions, and privacy levels. Our results indicate that it performs comparably or better in the evaluated scenarios, given the same privacy requirements.

Online Sensitivity Optimization in Differentially Private Learning

Galli, Filippo;Palamidessi, Catuscia;Cucinotta, Tommaso

2024

Abstract

Training differentially private machine learning models requires constraining an individual’s contribution to the optimization process. This is achieved by clipping the 2-norm of their gradient at a predetermined threshold prior to averaging and batch sanitization. This selection adversely influences optimization in two opposing ways: it either exacerbates the bias due to excessive clipping at lower values, or augments sanitization noise at higher values. The choice significantly hinges on factors such as the dataset, model architecture, and even varies within the same optimization, demanding meticulous tuning usually accomplished through a grid search. In order to circumvent the privacy expenses incurred in hyperparameter tuning, we present a novel approach to dynamically optimize the clipping threshold. We treat this threshold as an additional learnable parameter, establishing a clean relationship between the threshold and the cost function. This allows us to optimize the former with gradient descent, with minimal repercussions on the overall privacy analysis. Our method is thoroughly assessed against alternative fixed and adaptive strategies across diverse datasets, tasks, model dimensions, and privacy levels. Our results indicate that it performs comparably or better in the evaluated scenarios, given the same privacy requirements.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2024
			
	Settore Scientifico Disciplinare (validi fino a 24/06/2024)
	
				Settore INF/01 - Informatica
			
	Titolo del Convegno
	
				38th AAAI Conference on Artificial Intelligence
			
	Luogo del Convegno
	
				Vancouver, Canada
			
	Periodo del Convegno
	
				2024
			
	Titolo del Volume
	
				Proceedings of the AAAI Conference on Artificial Intelligence
			
	Editore
	
				Association for the Advancement of Artificial Intelligence
			
	ISBN
	
				978-1-57735-887-9
1-57735-887-2
			
	DOI
	
				https://dx.doi.org/10.1609/aaai.v38i11.29099
			
	Parole chiave
	
				Privacy; Optimization
			
	Progetti che finanziano la ricerca
	
	Titolo Progetto
	
									Privacy and Utility Allied
								
	Acronimo
	
									HYPATIA
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon 2020 Framework Programme
								
	N. Contratto
	
									835294
								
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno