Explaining digital humanities by aligning images and textual descriptions

被引:17
|
作者
Cornia, Marcella [1 ]
Stefanini, Matteo [1 ]
Baraldi, Lorenzo [1 ]
Corsini, Massimiliano [1 ]
Cucchiara, Rita [1 ]
机构
[1] Univ Modena & Reggio Emilia, Dept Engn Enzo Ferrari, Via P Vivarelli 10, I-41125 Modena, Italy
关键词
Visual-semantic retrieval; Semi-supervised learning; Cultural heritage;
D O I
10.1016/j.patrec.2019.11.018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Replicating the human ability to connect Vision and Language has recently been gaining a lot of attention in the Computer Vision and the Natural Language Processing communities. This research effort has resulted in algorithms that can retrieve images from textual descriptions and vice versa, when realistic images and sentences with simple semantics are employed and when paired training data is provided. In this paper, we go beyond these limitations and tackle the design of visual-semantic algorithms in the domain of the Digital Humanities. This setting not only advertises more complex visual and semantic structures but also features a significant lack of training data which makes the use of fully-supervised approaches infeasible. With this aim, we propose a joint visual-semantic embedding that can automatically align illustrations and textual elements without paired supervision. This is achieved by transferring the knowledge learned on ordinary visual-semantic datasets to the artistic domain. Experiments, performed on two datasets specifically designed for this domain, validate the proposed strategies and quantify the domain shift between natural images and artworks. (C) 2019 Elsevier B.V. All rights reserved.
引用
下载
收藏
页码:166 / 172
页数:7
相关论文
共 50 条
  • [21] Aligning Spatial Perspective in Route Descriptions
    Andonova, Elena
    SPATIAL COGNITION VII, 2010, 6222 : 125 - 138
  • [22] Humanities Computing as Digital Humanities
    Svensson, Patrik
    DIGITAL HUMANITIES QUARTERLY, 2009, 3 (03):
  • [23] Engaging the humanities: the digital humanities
    O'Donnell, James J.
    DAEDALUS, 2009, 138 (01) : 99 - 104
  • [24] The digital humanities as a humanities project
    Svensson, Patrik
    ARTS AND HUMANITIES IN HIGHER EDUCATION, 2012, 11 (1-2) : 42 - 60
  • [25] Textual analysis with IRaMuTeQ of recent research in the History of mathematics education in Brazil: an example of Digital Humanities
    Taise Hoffmann, Yohana
    Bisset Alvarez, Edgar
    Marti-Lahera, Yohannis
    INVESTIGACION BIBLIOTECOLOGICA, 2020, 34 (84): : 103 - 133
  • [26] Aligning Images in theWild
    Lin, Wen-Yan
    Liu, Linlin
    Matsushita, Yasuyuki
    Low, Kok-Lim
    Liu, Siying
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 1 - 8
  • [27] A NOVEL TECHNIQUE TO ACQUIRE PERCEIVED UTILITY SCORES FROM TEXTUAL DESCRIPTIONS OF DISTORTED NATURAL IMAGES
    Rouse, David M.
    Wang, Yiran
    Zhang, Fan
    Hemami, Sheila S.
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 2505 - 2508
  • [28] Digital Humanities
    Janndis F.
    Informatik-Spektrum, 2016, 39 (2) : 155 - 160
  • [29] Digital humanities
    Castro, Celso, I
    ESTUDOS HISTORICOS, 2020, 33 (69): : 1 - 2
  • [30] ALIGNING PICTORIAL DESCRIPTIONS - AN APPROACH TO OBJECT RECOGNITION
    ULLMAN, S
    COGNITION, 1989, 32 (03) : 193 - 254