Explainable authorship identification in cultural heritage applications

Setzu, Mattia; Corbara, Silvia; Monreale, Anna; Moreo, Alejandro; Sebastiani, Fabrizio

doi:10.1145/3654675

While a substantial amount of work has recently been devoted to improving the accuracy of computational Authorship Identification (AId) systems for textual data, little to no attention has been paid to endowing AId systems with the ability to explain the reasons behind their predictions. This substantially hinders the practical application of AId methods, since the predictions returned by such systems are hardly useful unless they are supported by suitable explanations. In this article, we explore the applicability of existing general-purpose eXplainable Artificial Intelligence (XAI) techniques to AId, with a focus on explanations addressed to scholars working in cultural heritage. In particular, we assess the relative merits of three different types of XAI techniques (feature ranking, probing, factual and counterfactual selection) on three different AId tasks (authorship attribution, authorship verification and same-authorship verification) by running experiments on real AId textual data. Our analysis shows that, while these techniques make important first steps towards XAI, more work remains to be done to provide tools that can be profitably integrated into the workflows of scholars.

Explainable authorship identification in cultural heritage applications

Setzu, Mattia;Corbara, Silvia;Monreale, Anna;Moreo, Alejandro;Sebastiani, Fabrizio

2024

Abstract

While a substantial amount of work has recently been devoted to improving the accuracy of computational Authorship Identification (AId) systems for textual data, little to no attention has been paid to endowing AId systems with the ability to explain the reasons behind their predictions. This substantially hinders the practical application of AId methods, since the predictions returned by such systems are hardly useful unless they are supported by suitable explanations. In this article, we explore the applicability of existing general-purpose eXplainable Artificial Intelligence (XAI) techniques to AId, with a focus on explanations addressed to scholars working in cultural heritage. In particular, we assess the relative merits of three different types of XAI techniques (feature ranking, probing, factual and counterfactual selection) on three different AId tasks (authorship attribution, authorship verification and same-authorship verification) by running experiments on real AId textual data. Our analysis shows that, while these techniques make important first steps towards XAI, more work remains to be done to provide tools that can be profitably integrated into the workflows of scholars.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2024
			
	Settore Scientifico Disciplinare (validi fino a 24/06/2024)
	
				Settore INF/01 - Informatica
			
	Titolo Rivista
	
				ACM JOURNAL ON COMPUTING AND CULTURAL HERITAGE
			
	DOI
	
				https://dx.doi.org/10.1145/3654675
			
	Parole chiave
	
				Explainable Artificial Intelligence; Cultural Heritage; Authorship Identification
			
	Progetti che finanziano la ricerca
	
	Titolo Progetto
	
									SoBigData++: European Integrated Infrastructure for Social Mining and Big Data Analytics
								
	Acronimo
	
									SoBigData-PlusPlus
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon 2020 Framework Programme
								
	N. Contratto
	
									871042
								
	Titolo Progetto
	
									A European Excellence Centre for Media, Society and Democracy
								
	Acronimo
	
									AI4Media
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon 2020 Framework Programme
								
	N. Contratto
	
									951911
								
	Titolo Progetto
	
									Spoke 1 “Human-centered AI” project
								
	Acronimo
	
									FAIR
								
	Nome finanziatore
	
										MUR
									
	Finanziamento
	
									PNRR - M4C2
								
	N. Contratto
	
									PE00000013
								
	Titolo Progetto
	
									Science and technology for the explanation of AI decision making
								
	Acronimo
	
									XAI
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon 2020 Framework Programme
								
	N. Contratto
	
									834756
								
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
Explainable_Authorship_Identification_in_Cultural_Heritage_Applications_final+.pdf accesso aperto Tipologia: Accepted version (post-print) Licenza: Solo Lettura Dimensione 1.25 MB Formato Adobe PDF	1.25 MB	Adobe PDF