T-norms driven loss functions for machine learning

Giannini, F.; Diligenti, M.; Maggini, M.; Gori, M.; Marra, G.

doi:10.1007/s10489-022-04383-6

Injecting prior knowledge into the learning process of a neural architecture is one of the main challenges currently faced by the artificial intelligence community, which also motivated the emergence of neural-symbolic models. One of the main advantages of these approaches is their capacity to learn competitive solutions with a significant reduction of the amount of supervised data. In this regard, a commonly adopted solution consists of representing the prior knowledge via first-order logic formulas, then relaxing the formulas into a set of differentiable constraints by using a t-norm fuzzy logic. This paper shows that this relaxation, together with the choice of the penalty terms enforcing the constraint satisfaction, can be unambiguously determined by the selection of a t-norm generator, providing numerical simplification properties and a tighter integration between the logic knowledge and the learning objective. When restricted to supervised learning, the presented theoretical framework provides a straight derivation of the popular cross-entropy loss, which has been shown to provide faster convergence and to reduce the vanishing gradient problem in very deep structures. However, the proposed learning formulation extends the advantages of the cross-entropy loss to the general knowledge that can be represented by neural-symbolic methods. In addition, the presented methodology allows the development of novel classes of loss functions, which are shown in the experimental results to lead to faster convergence rates than the approaches previously proposed in the literature.

T-norms driven loss functions for machine learning

Giannini F.;Diligenti M.;Maggini M.;Gori M.;Marra G.

2023

Abstract

Injecting prior knowledge into the learning process of a neural architecture is one of the main challenges currently faced by the artificial intelligence community, which also motivated the emergence of neural-symbolic models. One of the main advantages of these approaches is their capacity to learn competitive solutions with a significant reduction of the amount of supervised data. In this regard, a commonly adopted solution consists of representing the prior knowledge via first-order logic formulas, then relaxing the formulas into a set of differentiable constraints by using a t-norm fuzzy logic. This paper shows that this relaxation, together with the choice of the penalty terms enforcing the constraint satisfaction, can be unambiguously determined by the selection of a t-norm generator, providing numerical simplification properties and a tighter integration between the logic knowledge and the learning objective. When restricted to supervised learning, the presented theoretical framework provides a straight derivation of the popular cross-entropy loss, which has been shown to provide faster convergence and to reduce the vanishing gradient problem in very deep structures. However, the proposed learning formulation extends the advantages of the cross-entropy loss to the general knowledge that can be represented by neural-symbolic methods. In addition, the presented methodology allows the development of novel classes of loss functions, which are shown in the experimental results to lead to faster convergence rates than the approaches previously proposed in the literature.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2023
			
	Settore Scientifico Disciplinare (validi dal 09/05/2024)
	
				Settore INFO-01/A - Informatica
Settore IINF-05/A - Sistemi di elaborazione delle informazioni
			
	Titolo Rivista
	
				APPLIED INTELLIGENCE
			
	DOI
	
				https://dx.doi.org/10.1007/s10489-022-04383-6
			
	Parole chiave
	
				Integration of logic and learning; Learning from constraints; Loss functions; Neural-symbolic integration; T-norm generators
			
	Progetti che finanziano la ricerca
	
	Titolo Progetto
	
									A European AI On Demand Platform and Ecosystem
								
	Acronimo
	
									AI4EU
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon 2020 Framework Programme
								
	N. Contratto
	
									825619
								
	Titolo Progetto
	
									Learning with Multiple Representations
								
	Acronimo
	
									LEMUR
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon Europe Framework Programme
								
	N. Contratto
	
									101073307
								
	Titolo Progetto
	
									Foundations of Trustworthy AI - Integrating Reasoning, Learning and Optimization
								
	Acronimo
	
									TAILOR
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon 2020 Framework Programme
								
	N. Contratto
	
									952215
								
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
AI - T-norms driven loss functions for machine learning.pdf accesso aperto Tipologia: Published version Licenza: Creative Commons Dimensione 948.49 kB Formato Adobe PDF	948.49 kB	Adobe PDF