Datasheets for Digital Cultural Heritage Datasets

被引:1
|
作者
Alkemade, Henk [1 ,2 ]
Claeyssens, Steven [3 ]
Colavizza, Giovanni [4 ]
Freire, Nuno [5 ,6 ]
Lehmann, Joerg [7 ]
Neudecker, Clemens [1 ,7 ]
Osti, Giulia [8 ]
Van Strien, Daniel [9 ]
机构
[1] Europeana Network Assoc, EuropeanaTech Community, The Hague, Netherlands
[2] ARARE, Dublin, Ireland
[3] Natl Lib Netherlands, KB, The Hague, Netherlands
[4] Univ Bologna, Dept Class & Italian Philol, Bologna, Italy
[5] NOVA Univ Lisbon, Sch Social Sci & Humanities, Lisbon, Portugal
[6] Europeana Fdn, The Hague, Netherlands
[7] Staatsbibliothek Berlin Berlin State Lib, Berlin, Germany
[8] Univ Coll Dublin, Sch Informat & Commun Studies, Dublin, Ireland
[9] Hugging Face, Glasgow City, Scotland
关键词
datasheets; datasets; digital cultural heritage; model cards; machine learning; GLAM institutions;
D O I
10.5334/johd.124
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Sparked by issues of quality and lack of proper documentation for datasets, the machine learning community has begun developing standardised processes for establishing datasheets for machine learning datasets, with the intent to provide context and information on provenance, purposes, composition, the collection process, recommended uses or societal biases reflected in training datasets. This approach fits well with practices and procedures established in GLAM institutions, such as establishing collections' descriptions. However, digital cultural heritage datasets are marked by specific characteristics. They are often the product of multiple layers of selection; they may have been created for different purposes than establishing a statistical sample according to a specific research question; they change over time and are heterogeneous. Punctuated by a series of recommendations to create datasheets for digital cultural heritage, the paper addresses the scope and characteristics of digital cultural heritage datasets; possible metrics and measures; lessons from concepts similar to datasheets and/or established workflows in the cultural heritage sector. This paper includes a proposal for a datasheet template that has been adapted for use in cultural heritage institutions, and which proposes to incorporate information on the motivation and selection criteria, digitisation pipeline, data provenance, the use of linked open data, and version information.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [41] Sustainable Restoration of Cultural Heritage in the digital era
    Cinquepalmi, Federico
    Tiburcio, Virginia Adele
    [J]. VITRUVIO-INTERNATIONAL JOURNAL OF ARCHITECTURAL TECHNOLOGY AND SUSTAINABILITY, 2023, 8 (02): : 76 - 87
  • [42] Theorizing digital cultural heritage: A critical discourse
    Jones, Katherine Burton
    [J]. MUSEUM MANAGEMENT AND CURATORSHIP, 2008, 23 (04) : 403 - 405
  • [43] Conceptualisation and Institutionalisation of the Concept "Digital Cultural Heritage"
    Gorlova, Irina I.
    Zorin, Alexander L.
    Kryukov, Anatoly V.
    [J]. TOMSK STATE UNIVERSITY JOURNAL, 2019, (449): : 102 - 108
  • [44] Cultural Heritage Digital Libraries on Data Grids
    Calanducci, Antonio
    Sevilla, Jorge
    Barbera, Roberto
    Andronico, Giuseppe
    Saso, Monica
    De Filippo, Alessandro
    Iannizzotto, Stefania
    Vicinanza, Domenico
    De Mattia, Francesco
    [J]. RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, PROCEEDINGS, 2009, 5714 : 469 - +
  • [45] Towards a Global Infrastructure for Digital Cultural Heritage
    Povroznik, Nadezhda
    [J]. DIGITAL HERITAGE: PROGRESS IN CULTURAL HERITAGE: DOCUMENTATION, PRESERVATION, AND PROTECTION, EUROMED 2018, PT I, 2018, 11196 : 607 - 615
  • [46] Theorizing Digital Cultural Heritage: A Critical Discourse
    Bruton, Dean
    [J]. INFORMATION COMMUNICATION & SOCIETY, 2011, 14 (07) : 1077 - 1078
  • [47] Introduction: Cultural Heritage and Digital Scholarship in China
    Ruan, Lian J.
    Xia, Shengping
    [J]. LIBRARY TRENDS, 2023, 71 (03) : 339 - 344
  • [48] meSch – Material encounters with digital cultural Heritage
    Petrelli, Daniela
    Not, Elena
    Damala, Areti
    van Dijk, Dick
    Lechner, Monika
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8740 : 536 - 545
  • [49] Intellectual Property Protection of Digital Cultural Heritage
    Todorov, Todor
    Lutfiu, Shpend
    [J]. DIGITAL PRESENTATION AND PRESERVATION OF CULTURAL AND SCIENTIFIC HERITAGE, 2023, 13 : 263 - 268
  • [50] meSch - Material Encounters with Digital Cultural Heritage
    Petrelli, Daniela
    Not, Elena
    Damala, Areti
    van Dijk, Dick
    Lechner, Monika
    [J]. DIGITAL HERITAGE: PROGRESS IN CULTURAL HERITAGE: DOCUMENTATION, PRESERVATION, AND PROTECTION, 2014, 8740 : 536 - 545