Explaining digital humanities by aligning images and textual descriptions

被引：17

作者：

Cornia, Marcella ^{[1
]}

Stefanini, Matteo ^{[1
]}

Baraldi, Lorenzo ^{[1
]}

Corsini, Massimiliano ^{[1
]}

Cucchiara, Rita ^{[1
]}

机构：

[1] Univ Modena & Reggio Emilia, Dept Engn Enzo Ferrari, Via P Vivarelli 10, I-41125 Modena, Italy

来源：

PATTERN RECOGNITION LETTERS | 2020年 / 129卷

关键词：

Visual-semantic retrieval; Semi-supervised learning; Cultural heritage;

D O I：

10.1016/j.patrec.2019.11.018

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Replicating the human ability to connect Vision and Language has recently been gaining a lot of attention in the Computer Vision and the Natural Language Processing communities. This research effort has resulted in algorithms that can retrieve images from textual descriptions and vice versa, when realistic images and sentences with simple semantics are employed and when paired training data is provided. In this paper, we go beyond these limitations and tackle the design of visual-semantic algorithms in the domain of the Digital Humanities. This setting not only advertises more complex visual and semantic structures but also features a significant lack of training data which makes the use of fully-supervised approaches infeasible. With this aim, we propose a joint visual-semantic embedding that can automatically align illustrations and textual elements without paired supervision. This is achieved by transferring the knowledge learned on ordinary visual-semantic datasets to the artistic domain. Experiments, performed on two datasets specifically designed for this domain, validate the proposed strategies and quantify the domain shift between natural images and artworks. (C) 2019 Elsevier B.V. All rights reserved.

引用

下载

页码：166 / 172

页数：7

共 50 条

[21] Aligning Spatial Perspective in Route Descriptions
Andonova, Elena
SPATIAL COGNITION VII, 2010, 6222 : 125 - 138
[22] Humanities Computing as Digital Humanities
Svensson, Patrik
DIGITAL HUMANITIES QUARTERLY, 2009, 3 (03):
[23] Engaging the humanities: the digital humanities
O'Donnell, James J.
DAEDALUS, 2009, 138 (01) : 99 - 104
[24] The digital humanities as a humanities project
Svensson, Patrik
ARTS AND HUMANITIES IN HIGHER EDUCATION, 2012, 11 (1-2) : 42 - 60
[25] Textual analysis with IRaMuTeQ of recent research in the History of mathematics education in Brazil: an example of Digital Humanities
Taise Hoffmann, Yohana
Bisset Alvarez, Edgar
Marti-Lahera, Yohannis
INVESTIGACION BIBLIOTECOLOGICA, 2020, 34 (84): : 103 - 133
[26] Aligning Images in theWild
Lin, Wen-Yan
Liu, Linlin
Matsushita, Yasuyuki
Low, Kok-Lim
Liu, Siying
2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 1 - 8
[27] A NOVEL TECHNIQUE TO ACQUIRE PERCEIVED UTILITY SCORES FROM TEXTUAL DESCRIPTIONS OF DISTORTED NATURAL IMAGES
Rouse, David M.
Wang, Yiran
Zhang, Fan
Hemami, Sheila S.
2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 2505 - 2508
[28] Digital Humanities
Janndis F.
Informatik-Spektrum, 2016, 39 (2) : 155 - 160
[29] Digital humanities
Castro, Celso, I
ESTUDOS HISTORICOS, 2020, 33 (69): : 1 - 2
[30] ALIGNING PICTORIAL DESCRIPTIONS - AN APPROACH TO OBJECT RECOGNITION
ULLMAN, S
COGNITION, 1989, 32 (03) : 193 - 254

← 1 2 3 4 5 →