Explaining short text classification with diverse synthetic exemplars and counter-exemplars

Lampridis, Orestis; State, Laura; Guidotti, Riccardo; Ruggieri, Salvatore

doi:10.1007/s10994-022-06150-7

We present xspells, a model-agnostic local approach for explaining the decisions of black box models in classification of short texts. The explanations provided consist of a set of exemplar sentences and a set of counter-exemplar sentences. The former are examples classified by the black box with the same label as the text to explain. The latter are examples classified with a different label (a form of counter-factuals). Both are close in meaning to the text to explain, and both are meaningful sentences – albeit they are synthetically generated. xspells generates neighbors of the text to explain in a latent space using Variational Autoencoders for encoding text and decoding latent instances. A decision tree is learned from randomly generated neighbors, and used to drive the selection of the exemplars and counter-exemplars. Moreover, diversity of counter-exemplars is modeled as an optimization problem, solved by a greedy algorithm with theoretical guarantee. We report experiments on three datasets showing that xspells outperforms the well-known lime method in terms of quality of explanations, fidelity, diversity, and usefulness, and that is comparable to it in terms of stability.

Explaining short text classification with diverse synthetic exemplars and counter-exemplars

Lampridis, Orestis;State, Laura;Guidotti, Riccardo;Ruggieri, Salvatore

2022

Abstract

We present xspells, a model-agnostic local approach for explaining the decisions of black box models in classification of short texts. The explanations provided consist of a set of exemplar sentences and a set of counter-exemplar sentences. The former are examples classified by the black box with the same label as the text to explain. The latter are examples classified with a different label (a form of counter-factuals). Both are close in meaning to the text to explain, and both are meaningful sentences – albeit they are synthetically generated. xspells generates neighbors of the text to explain in a latent space using Variational Autoencoders for encoding text and decoding latent instances. A decision tree is learned from randomly generated neighbors, and used to drive the selection of the exemplars and counter-exemplars. Moreover, diversity of counter-exemplars is modeled as an optimization problem, solved by a greedy algorithm with theoretical guarantee. We report experiments on three datasets showing that xspells outperforms the well-known lime method in terms of quality of explanations, fidelity, diversity, and usefulness, and that is comparable to it in terms of stability.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2022
			
	Settore Scientifico Disciplinare (validi fino a 24/06/2024)
	
				Settore INF/01 - Informatica
			
	Titolo Rivista
	
				MACHINE LEARNING
			
	DOI
	
				https://dx.doi.org/10.1007/s10994-022-06150-7
			
	Parole chiave
	
				Counter-factuals; Explainable AI; Model-agnostic explanation; Short text Classification; Synthetic exemplars; Decision trees; Text processing
			
	Progetti che finanziano la ricerca
	
	Titolo Progetto
	
									Artificial Intelligence without Bias
								
	Acronimo
	
									NoBIAS
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon 2020 Framework Programme
								
	N. Contratto
	
									860630
								
	Titolo Progetto
	
									SoBigData++: European Integrated Infrastructure for Social Mining and Big Data Analytics
								
	Acronimo
	
									SoBigData-PlusPlus
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon 2020 Framework Programme
								
	N. Contratto
	
									871042
								
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
lampridis_explaining_short_text_classification_with_diverse_synthetic_exemplars_and_counter-exemplars.pdf accesso aperto Descrizione: Explaining short text classification with diverse synthetic exemplars and counter-exemplars Tipologia: Published version Licenza: Creative Commons Dimensione 1.84 MB Formato Adobe PDF	1.84 MB	Adobe PDF