The "Collections as ML Data" checklist for machine learning and cultural heritage

被引:8
|
作者
Lee, Benjamin Charles Germain [1 ,2 ]
机构
[1] Univ Washington, Paul G Allen Sch Comp Sci & Engn, Seattle, WA USA
[2] Univ Washington, Paul GAllen Sch Comp Sci & Engn, 185 East Stevens Way Northeast, Seattle, WA 98195 USA
关键词
D O I
10.1002/asi.24765
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Within cultural heritage, there has been a growing and concerted effort to consider a critical sociotechnical lens when applying machine learning techniques to digital collections. Though the cultural heritage community has collectively developed an emerging body of work detailing responsible operations for machine learning in galleries, museums, archives, and libraries at the organizational level, there remains a paucity of guidelines created for researchers embarking on machine learning projects with digital collections. The manifold stakes and sensitivities involved in applying machine learning to cultural heritage underscore the importance of developing such guidelines. This article contributes to this need by formulating a detailed checklist with guiding questions and practices that can be employed while developing a machine learning project that utilizes cultural heritage data. I call the resulting checklist the "Collections as ML Data" checklist, which, when completed, can be published with the deliverables of the project. By surveying existing projects, including my own project, Newspaper Navigator, I justify the "Collections as ML Data" checklist and demonstrate how the formulated guiding questions can be employed by researchers.
引用
收藏
页码:375 / 396
页数:22
相关论文
共 50 条
  • [21] A checklist to publish collections as data in GLAM institutions
    Candela, Gustavo
    Gabriels, Nele
    Chambers, Sally
    Dobreva, Milena
    Ames, Sarah
    Ferriter, Megham
    Fitzgerald, Neil
    Harbo, Victor
    Hofmann, Katrine
    Holownia, Olga
    Irollo, Alba
    Mahey, Mahendra
    Manchester, Eileen
    Pham, Thuy An
    Potter, Abigail
    Van Keer, Ellen
    GLOBAL KNOWLEDGE MEMORY AND COMMUNICATION, 2023,
  • [22] The NERVE-ML (neural engineering reproducibility and validity essentials for machine learning) checklist: ensuring machine learning advances neural engineering*
    Carlson, David E.
    Chavarriaga, Ricardo
    Liu, Yiling
    Lotte, Fabien
    Lu, Bao-Liang
    JOURNAL OF NEURAL ENGINEERING, 2025, 22 (02)
  • [23] Incorporating sparse model machine learning in designing cultural heritage landscapes
    Goodarzi, Parichehr
    Ansari, Mojtaba
    Rahimian, Farzad Pour
    Mahdavinejad, Mohammadjavad
    Park, Chansik
    AUTOMATION IN CONSTRUCTION, 2023, 155
  • [24] A machine learning approach for IoT cultural data
    Piccialli F.
    Cuomo S.
    Cola V.S.D.
    Casolla G.
    Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (02) : 1715 - 1726
  • [25] Why collections matter: impacts of cultural heritage collections on people's lives
    Iwasaki, Amy
    Pederzoli, Jose Luiz
    INTERNATIONAL JOURNAL OF HERITAGE STUDIES, 2023, 29 (11) : 1229 - 1249
  • [26] Unsupervised learning on multimedia data: a Cultural Heritage case study
    Francesco Piccialli
    Giampaolo Casolla
    Salvatore Cuomo
    Fabio Giampaolo
    Edoardo Prezioso
    Vincenzo Schiano di Cola
    Multimedia Tools and Applications, 2020, 79 : 34429 - 34442
  • [27] Unsupervised learning on multimedia data: a Cultural Heritage case study
    Piccialli, Francesco
    Casolla, Giampaolo
    Cuomo, Salvatore
    Giampaolo, Fabio
    Prezioso, Edoardo
    di Cola, Vincenzo Schiano
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (45-46) : 34429 - 34442
  • [28] Data for 3D reconstruction and point cloud classification using machine learning in cultural heritage environment
    Pepe, Massimiliano
    Alfio, Vincenzo Saverio
    Costantino, Domenica
    Scaringi, Daniele
    DATA IN BRIEF, 2022, 42
  • [29] A Novel Vision for Navigation and Enrichment in Cultural Heritage Collections
    Decourselle, Joffrey
    Vennesland, Audun
    Aalberg, Trond
    Duchateau, Fabien
    Lumineau, Nicolas
    NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS (ADBIS 2015), 2015, 539 : 488 - 497
  • [30] Promoting user engagement with digital cultural heritage collections
    Agosti, Maristella
    Orio, Nicola
    Ponchia, Chiara
    INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2018, 19 (04) : 353 - 366