Understanding and Exploiting the Latent Space to improve Machine Learning models eXplainability

In recent years, Artificial Intelligence (AI) and Machine Learning (ML) systems have dramatically increased their capabilities, achieving human-like or even humansuperior performance in specific tasks. This increased performance has gone hand in hand with an increase in the complexity of AI and ML models, compromising their transparency and trustworthiness and making them inscrutable black boxes for decision making. Explainable AI (XAI) is a field that seeks to make the decisions suggested by ML models more transparent to human users, by providing different types of explanations. This thesis explores the possibility of using a reduced feature space called “latent space”, produced by a particular kind of ML models, as a means for the explanation process. First, we study the possibility of navigating the latent space as a form of interactive explanation to better understand the rationale behind the model’s predictions. Second, we propose an interpretable-by-design approach to make the explanation process completely transparent to the user. Third, we exploit mathematical properties of the latent space of certain ML models (similarity and linearity) to produce explanations that are shown more plausible and accurate than those of existing competitors in the state of the art. In order to validate our approach, we perform extensive benchmarking on different datasets, with respect to both existing metrics and new ones introduced in our work to highlight new XAI problems, beyond current literature.

Understanding and Exploiting the Latent Space to improve Machine Learning models eXplainability / Bodria, Francesco; relatore: GIANNOTTI, Fosca; relatore esterno: Pedreschi, Dino; Scuola Normale Superiore, ciclo 35, 07-Sep-2023.

Understanding and Exploiting the Latent Space to improve Machine Learning models eXplainability

BODRIA, Francesco

2023

Abstract

In recent years, Artificial Intelligence (AI) and Machine Learning (ML) systems have dramatically increased their capabilities, achieving human-like or even humansuperior performance in specific tasks. This increased performance has gone hand in hand with an increase in the complexity of AI and ML models, compromising their transparency and trustworthiness and making them inscrutable black boxes for decision making. Explainable AI (XAI) is a field that seeks to make the decisions suggested by ML models more transparent to human users, by providing different types of explanations. This thesis explores the possibility of using a reduced feature space called “latent space”, produced by a particular kind of ML models, as a means for the explanation process. First, we study the possibility of navigating the latent space as a form of interactive explanation to better understand the rationale behind the model’s predictions. Second, we propose an interpretable-by-design approach to make the explanation process completely transparent to the user. Third, we exploit mathematical properties of the latent space of certain ML models (similarity and linearity) to produce explanations that are shown more plausible and accurate than those of existing competitors in the state of the art. In order to validate our approach, we perform extensive benchmarking on different datasets, with respect to both existing metrics and new ones introduced in our work to highlight new XAI problems, beyond current literature.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di discussione
	
				7-set-2023
			
	Settori scientifico-disciplinari (SSD) (validi fino a 24/06/2024)
	
				Settore INF/01 - Informatica
			
	Corso PhD
	
				Data Science
			
	Ciclo
	
				35
			
	DOI
	
				https://dx.doi.org/10.25429/bodria-francesco_phd2023-09-07
			
	Parole chiave
	
				eXplainable Artificial Intelligence; XAI; Machine Learning; Latent Space
			
	Relatore/i interno/i
	
				GIANNOTTI, Fosca
			
	Relatore/i esterno/i
	
				Pedreschi, Dino
Guidotti, Riccardo
			
	Editore
	
				Scuola Normale Superiore
			
	Appare nelle tipologie:
	
				9.1 Tesi PhD

File in questo prodotto:

File	Dimensione	Formato
Bodria_Tesi.pdf accesso aperto Tipologia: Tesi PhD Licenza: Solo Lettura Dimensione 14.77 MB Formato Adobe PDF	14.77 MB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11384/136464

Citazioni

ND

ND

ND

ND

social impact