This paper analyzes the problems of incorrect disambiguation of entities in Wikidata items, both in general and focusing on items regarding humans. The problem of incorrect disambiguation is categorized into two types, i.e. conflations and duplications. The paper subsequently treats the causes of conflations and duplications, the methods available for detecting them, the solutions applicable to them and the issues that constitute an obstacle to the aforementioned solutions; three proposals are finally made to mitigate these issues.
Conflations and Duplications in Wikidata Items : Causes, Detection, Solutions, and Issues
Pellizzari di San Girolamo, Camillo Carlo
2024
Abstract
This paper analyzes the problems of incorrect disambiguation of entities in Wikidata items, both in general and focusing on items regarding humans. The problem of incorrect disambiguation is categorized into two types, i.e. conflations and duplications. The paper subsequently treats the causes of conflations and duplications, the methods available for detecting them, the solutions applicable to them and the issues that constitute an obstacle to the aforementioned solutions; three proposals are finally made to mitigate these issues.File in questo prodotto:
File | Dimensione | Formato | |
---|---|---|---|
Conflations and duplications in Wikidata items.pdf
accesso aperto
Tipologia:
Published version
Licenza:
Creative Commons
Dimensione
908.82 kB
Formato
Adobe PDF
|
908.82 kB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.