The representation of Italian libraries in Wikidata has grown through two major data imports. In 2020, under commission from Tuscany region, the Sistema Cultura Toscana dataset was uploaded, raising the number of Italian libraries from fewer than 500 to 1,322 and documenting the methodology of this first large-scale project. A second step followed in 2022 with the addition of 11,239 entries from the ICCU Italian Libraries Database. This process involved merging CC0 datasets, entity alignment, and addressing gaps between the web versions of databases and their dumps. Together, these efforts illustrate both achievements and challenges in enriching Wikidata’s coverage of Italian libraries, specifically highlighting the role of iterative, human-curated workflows in large-scale data imports.

Documenting Italian Libraries on Wikidata: From Local Projects to a Multilayered National Knowledge Graph

Marchetti, Alessandro;Pellizzari di San Girolamo, Camillo Carlo
2026

Abstract

The representation of Italian libraries in Wikidata has grown through two major data imports. In 2020, under commission from Tuscany region, the Sistema Cultura Toscana dataset was uploaded, raising the number of Italian libraries from fewer than 500 to 1,322 and documenting the methodology of this first large-scale project. A second step followed in 2022 with the addition of 11,239 entries from the ICCU Italian Libraries Database. This process involved merging CC0 datasets, entity alignment, and addressing gaps between the web versions of databases and their dumps. Together, these efforts illustrate both achievements and challenges in enriching Wikidata’s coverage of Italian libraries, specifically highlighting the role of iterative, human-curated workflows in large-scale data imports.
2026
Settore M-STO/08 - Archivistica, Bibliografia e Biblioteconomia
Settore INF/01 - Informatica
Settore HIST-04/C - Archivistica, bibliografia e biblioteconomia
Settore INFO-01/A - Informatica
Italian Libraries Database; Wikidata import; OpenRefine
File in questo prodotto:
File Dimensione Formato  
Documenting Italian Libraries on Wikidata.pdf

accesso aperto

Tipologia: Published version
Licenza: Creative Commons
Dimensione 4.41 MB
Formato Adobe PDF
4.41 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11384/161744
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
  • OpenAlex ND
social impact