An Overview of Recent Approaches to Enable Diversity in Large Language Models through Aligning with Human Perspectives

The varied backgrounds and experiences of human annotators inject different opinions and potential biases into the data, inevitably leading to disagreements. Yet, traditional aggregation methods fail to capture individual judgments since they rely on the notion of a single ground truth. Our aim is to review prior contributions to pinpoint the shortcomings that might cause stereotypical content generation. As a preliminary study, our purpose is to investigate state-of-the-art approaches, primarily focusing on the following two research directions. First, we investigate how adding subjectivity aspects to LLMs might guarantee diversity. We then look into the alignment between humans and LLMs and discuss how to measure it. Considering existing gaps, our review explores possible methods to mitigate the perpetuation of biases targeting specific communities. However, we recognize the potential risk of disseminating sensitive information due to the utilization of socio-demographic data in the training process. These considerations underscore the inclusion of diverse perspectives while taking into account the critical importance of implementing robust safeguards to protect individuals’ privacy and prevent the inadvertent propagation of sensitive information.

An Overview of Recent Approaches to Enable Diversity in Large Language Models through Aligning with Human Perspectives

Muscato, Benedetta;Mala, Chandana Sree;Manerba M. M.;Gezici, Gizem;Giannotti, Fosca

2024

Abstract

The varied backgrounds and experiences of human annotators inject different opinions and potential biases into the data, inevitably leading to disagreements. Yet, traditional aggregation methods fail to capture individual judgments since they rely on the notion of a single ground truth. Our aim is to review prior contributions to pinpoint the shortcomings that might cause stereotypical content generation. As a preliminary study, our purpose is to investigate state-of-the-art approaches, primarily focusing on the following two research directions. First, we investigate how adding subjectivity aspects to LLMs might guarantee diversity. We then look into the alignment between humans and LLMs and discuss how to measure it. Considering existing gaps, our review explores possible methods to mitigate the perpetuation of biases targeting specific communities. However, we recognize the potential risk of disseminating sensitive information due to the utilization of socio-demographic data in the training process. These considerations underscore the inclusion of diverse perspectives while taking into account the critical importance of implementing robust safeguards to protect individuals’ privacy and prevent the inadvertent propagation of sensitive information.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2024
			
	Settore Scientifico Disciplinare (validi dal 09/05/2024)
	
				Settore INFO-01/A - Informatica
			
	Titolo del Convegno
	
				3rd Workshop on Perspectivist Approaches to NLP, NLPerspectives 2024
			
	Luogo del Convegno
	
				ita
			
	Periodo del Convegno
	
				2024
			
	Titolo del Volume
	
				In Proceedings of the 3rd Workshop on Perspectivist Approaches to NLP (NLPerspectives)@ LREC-COLING 2024
			
	Editore
	
				European Language Resources Association (ELRA)
			
	ISBN
	
				9782493814234
			
	Parole chiave
	
				Bias; Diversity; Human Annotation; Minority Groups; Perspectivism; Text Generation;
			
	Progetti che finanziano la ricerca
	
	Titolo Progetto
	
									PNRR Partenariati Estesi - FAIR - Future artificial intelligence research.
								
	Nome finanziatore
	
										Ministero della pubblica istruzione, dell'università e della ricerca
									
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
An Overview of Recent Approaches to Enable Diversity in LLMs.pdf accesso aperto Tipologia: Published version Licenza: Creative Commons Dimensione 448.35 kB Formato Adobe PDF	448.35 kB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11384/149509

Citazioni

ND

1

ND

ND

social impact