Meaningful Explanations of Black Box AI Decision Systems

Pedreschi, Dino; Giannotti, Fosca; Guidotti, Riccardo; Monreale, Anna; Ruggieri, Salvatore; Turini, Franco

doi:10.1609/aaai.v33i01.33019780

Black box AI systems for automated decision making, often based on machine learning over (big) data, map a user’s features into a class or a score without exposing the reasons why. This is problematic not only for lack of transparency, but also for possible biases inherited by the algorithms from human prejudices and collection artifacts hidden in the training data, which may lead to unfair or wrong decisions. We focus on the urgent open challenge of how to construct meaningful explanations of opaque AI/ML systems, introducing the local-toglobal framework for black box explanation, articulated along three lines: (i) the language for expressing explanations in terms of logic rules, with statistical and causal interpretation; (ii) the inference of local explanations for revealing the decision rationale for a specific case, by auditing the black box in the vicinity of the target instance; (iii), the bottom-up generalization of many local explanations into simple global ones, with algorithms that optimize for quality and comprehensibility. We argue that the local-first approach opens the door to a wide variety of alternative solutions along different dimensions: a variety of data sources (relational, text, images, etc.), a variety of learning problems (multi-label classification, regression, scoring, ranking), a variety of languages for expressing meaningful explanations, a variety of means to audit a black box.

Meaningful Explanations of Black Box AI Decision Systems

Pedreschi, Dino;Giannotti, Fosca;Guidotti, Riccardo;Monreale, Anna;Ruggieri, Salvatore;Turini, Franco

2019

Abstract

Black box AI systems for automated decision making, often based on machine learning over (big) data, map a user’s features into a class or a score without exposing the reasons why. This is problematic not only for lack of transparency, but also for possible biases inherited by the algorithms from human prejudices and collection artifacts hidden in the training data, which may lead to unfair or wrong decisions. We focus on the urgent open challenge of how to construct meaningful explanations of opaque AI/ML systems, introducing the local-toglobal framework for black box explanation, articulated along three lines: (i) the language for expressing explanations in terms of logic rules, with statistical and causal interpretation; (ii) the inference of local explanations for revealing the decision rationale for a specific case, by auditing the black box in the vicinity of the target instance; (iii), the bottom-up generalization of many local explanations into simple global ones, with algorithms that optimize for quality and comprehensibility. We argue that the local-first approach opens the door to a wide variety of alternative solutions along different dimensions: a variety of data sources (relational, text, images, etc.), a variety of learning problems (multi-label classification, regression, scoring, ranking), a variety of languages for expressing meaningful explanations, a variety of means to audit a black box.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2019
			
	Settore Scientifico Disciplinare (validi fino a 24/06/2024)
	
				Settore INF/01 - Informatica
			
	Titolo del Convegno
	
				AAAI Conference on Artificial Intelligence
			
	Titolo del Volume
	
				Vol. 33 No. 01: AAAI-19, IAAI-19, EAAI-20
			
	ISBN
	
				978-1-57735-809-1
			
	DOI
	
				https://dx.doi.org/10.1609/aaai.v33i01.33019780
			
	Progetti che finanziano la ricerca
	
	Finanziamento
	
									Horizon 2020
								
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
5050-Article Text-8113-1-10-20190709 (4).pdf accesso aperto Tipologia: Published version Licenza: Non specificata Dimensione 186.04 kB Formato Adobe PDF	186.04 kB	Adobe PDF