Local Rule-Based Explanations of Black Box Decision Systems

Guidotti, Riccardo; Monreale, Anna; Ruggieri, Salvatore; Pedreschi, Dino; Turini, Franco; Giannotti, Fosca

The recent years have witnessed the rise of accurate but obscure decision systems which hide the logic of their internal decision processes to the users. The lack of explanations for the decisions of black box systems is a key ethical issue, and a limitation to the adoption of machine learning components in socially sensitive and safety-critical contexts. %Therefore, we need explanations that reveals the reasons why a predictor takes a certain decision. In this paper we focus on the problem of black box outcome explanation, i.e., explaining the reasons of the decision taken on a specific instance. We propose LORE, an agnostic method able to provide interpretable and faithful explanations. LORE first leans a local interpretable predictor on a synthetic neighborhood generated by a genetic algorithm. Then it derives from the logic of the local interpretable predictor a meaningful explanation consisting of: a decision rule, which explains the reasons of the decision; and a set of counterfactual rules, suggesting the changes in the instance's features that lead to a different outcome. Wide experiments show that LORE outperforms existing methods and baselines both in the quality of explanations and in the accuracy in mimicking the black box.

Local Rule-Based Explanations of Black Box Decision Systems

Guidotti, Riccardo;Monreale, Anna;Ruggieri, Salvatore;Pedreschi, Dino;Turini, Franco;Giannotti, Fosca

2018

Abstract

The recent years have witnessed the rise of accurate but obscure decision systems which hide the logic of their internal decision processes to the users. The lack of explanations for the decisions of black box systems is a key ethical issue, and a limitation to the adoption of machine learning components in socially sensitive and safety-critical contexts. %Therefore, we need explanations that reveals the reasons why a predictor takes a certain decision. In this paper we focus on the problem of black box outcome explanation, i.e., explaining the reasons of the decision taken on a specific instance. We propose LORE, an agnostic method able to provide interpretable and faithful explanations. LORE first leans a local interpretable predictor on a synthetic neighborhood generated by a genetic algorithm. Then it derives from the logic of the local interpretable predictor a meaningful explanation consisting of: a decision rule, which explains the reasons of the decision; and a set of counterfactual rules, suggesting the changes in the instance's features that lead to a different outcome. Wide experiments show that LORE outperforms existing methods and baselines both in the quality of explanations and in the accuracy in mimicking the black box.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2018
			
	Settore Scientifico Disciplinare (validi fino a 24/06/2024)
	
				Settore INF/01 - Informatica
			
	Parole chiave
	
				Computer Science - Artificial Intelligence; Computer Science - Artificial Intelligence
			
	Appare nelle tipologie:
	
				5.12 Altro

File in questo prodotto:

File	Dimensione	Formato
1805.10820.pdf accesso aperto Descrizione: Local Rule-Based Explanations of Black Box Decision Systems Tipologia: Submitted version (pre-print) Licenza: Creative Commons Dimensione 1.81 MB Formato Adobe PDF	1.81 MB	Adobe PDF