The "Collections as ML Data" checklist for machine learning and cultural heritage

被引:8
|
作者
Lee, Benjamin Charles Germain [1 ,2 ]
机构
[1] Univ Washington, Paul G Allen Sch Comp Sci & Engn, Seattle, WA USA
[2] Univ Washington, Paul GAllen Sch Comp Sci & Engn, 185 East Stevens Way Northeast, Seattle, WA 98195 USA
关键词
D O I
10.1002/asi.24765
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Within cultural heritage, there has been a growing and concerted effort to consider a critical sociotechnical lens when applying machine learning techniques to digital collections. Though the cultural heritage community has collectively developed an emerging body of work detailing responsible operations for machine learning in galleries, museums, archives, and libraries at the organizational level, there remains a paucity of guidelines created for researchers embarking on machine learning projects with digital collections. The manifold stakes and sensitivities involved in applying machine learning to cultural heritage underscore the importance of developing such guidelines. This article contributes to this need by formulating a detailed checklist with guiding questions and practices that can be employed while developing a machine learning project that utilizes cultural heritage data. I call the resulting checklist the "Collections as ML Data" checklist, which, when completed, can be published with the deliverables of the project. By surveying existing projects, including my own project, Newspaper Navigator, I justify the "Collections as ML Data" checklist and demonstrate how the formulated guiding questions can be employed by researchers.
引用
收藏
页码:375 / 396
页数:22
相关论文
共 50 条
  • [31] Evaluating the success of vocabulary reconciliation for cultural heritage collections
    van Hooland, Seth
    Verborgh, Ruben
    De Wilde, Max
    Hercher, Johannes
    Mannens, Erik
    Van de Walle, Rik
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2013, 64 (03): : 464 - 479
  • [32] Restructuring Cultural Heritage Collections in the Basic Formal Ontology
    Zou, Qing
    Park, Eun G.
    2015 DIGITAL HERITAGE INTERNATIONAL CONGRESS, VOL 2: ANALYSIS & INTERPRETATION THEORY, METHODOLOGIES, PRESERVATION & STANDARDS DIGITAL HERITAGE PROJECTS & APPLICATIONS, 2015, : 483 - 484
  • [33] CULTURAL HERITAGE OF THE KHANTY IN KUNSTKAMMER COLLECTIONS: ECONOMIC ACTIVITY
    Nadezhda, Lukina V.
    TOMSK STATE UNIVERSITY JOURNAL, 2014, (387): : 91 - 97
  • [34] Exploring entity recognition and disambiguation for cultural heritage collections
    van Hooland, Seth
    De Wilde, Max
    Verborgh, Ruben
    Steiner, Thomas
    Van de Walle, Rik
    DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2015, 30 (02) : 262 - 279
  • [35] Cultural Heritage as "Shared Heritage"? (I) Colonial Collections and the Future of a European Idea
    Thiemeyer, Thomas
    MERKUR-DEUTSCHE ZEITSCHRIFT FUR EUROPAISCHES DENKEN, 2018, 72 (829): : 30 - +
  • [36] MACHINE LEARNING FOR THE DOCUMENTATION, PREDICTION, AND AUGMENTATION OF HERITAGE STRUCTURE DATA
    Rihal, Satwant
    Assal, Hisham
    29TH CIPA SYMPOSIUM DOCUMENTING, UNDERSTANDING, PRESERVING CULTURAL HERITAGE. HUMANITIES AND DIGITAL TECHNOLOGIES FOR SHAPING THE FUTURE, VOL. 48-M-2, 2023, : 1301 - 1307
  • [37] An Approach based on Machine Learning Algorithms for the Recommendation of Scientific Cultural Heritage Objects
    Nafis, Fouad
    Al Fararni, Khalid
    Yahyaouy, Ali
    Aghoutane, Badraddine
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (05) : 230 - 238
  • [38] Processing Historical Film Footage with Photogrammetry and Machine Learning for Cultural Heritage Documentation
    Condorelli, Francesca
    Rinaudo, Fulvio
    SUMAC'19: PROCEEDINGS OF THE 1ST WORKSHOP ON STRUCTURING AND UNDERSTANDING OF MULTIMEDIA HERITAGE CONTENTS, 2019, : 39 - 46
  • [39] Visitor assistant tools based on Machine learning approaches in Cultural Heritage contexts
    Cuomo, Salvatore
    Chirico, Ugo
    2017 13TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY AND INTERNET-BASED SYSTEMS (SITIS), 2017, : 485 - 489
  • [40] Data-Driven Metabolic Subgrouping of Obesity by Machine Learning (ML)
    Lin, Ziwei
    Hui, You
    Wu, Shandong
    Qu, Shen
    DIABETES, 2020, 69