This paper analyzes the problems of incorrect disambiguation of entities in Wikidata items, both in general and focusing on items regarding humans. The problem of incorrect disambiguation is categorized into two types, i.e. conflations and duplications. The paper subsequently treats the causes of conflations and duplications, the methods available for detecting them, the solutions applicable to them and the issues that constitute an obstacle to the aforementioned solutions; three proposals are finally made to mitigate these issues.

Conflations and Duplications in Wikidata Items : Causes, Detection, Solutions, and Issues

Pellizzari di San Girolamo, Camillo Carlo
2024

Abstract

This paper analyzes the problems of incorrect disambiguation of entities in Wikidata items, both in general and focusing on items regarding humans. The problem of incorrect disambiguation is categorized into two types, i.e. conflations and duplications. The paper subsequently treats the causes of conflations and duplications, the methods available for detecting them, the solutions applicable to them and the issues that constitute an obstacle to the aforementioned solutions; three proposals are finally made to mitigate these issues.
2024
Settore INF/01 - Informatica
Settore M-STO/08 - Archivistica, Bibliografia e Biblioteconomia
The 4th Wikidata Workshop: Workshop for the scientific Wikidata community @ISWC 2023
Athens, Greece
13-11-2023
Wikidata 2023 : the 4th Wikidata Workshop : Proceedings of the Wikidata Workshop 2023 co-located with 22nd International Semantic Web Conference (ISWC 2023), Athens, Greece, November 13, 2023
Aachen University
Wikidata; entity management; authority control
File in questo prodotto:
File Dimensione Formato  
Conflations and duplications in Wikidata items.pdf

accesso aperto

Tipologia: Published version
Licenza: Creative Commons
Dimensione 908.82 kB
Formato Adobe PDF
908.82 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11384/138302
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact