One-Shot Object Detection in Heterogeneous Artwork Datasets

被引:1
|
作者
Madhu, Prathmesh [1 ]
Meyer, Anna [1 ]
Zinnen, Mathias [1 ]
Muhrenberg, Lara [2 ]
Suckow, Dirk [4 ]
Bendschus, Torsten [3 ]
Reinhardt, Corinna [3 ]
Bell, Peter [5 ]
Verstegen, Ute [2 ]
Kosti, Ronak [1 ]
Maier, Andreas [1 ]
Christlein, Vincent [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Pattern Recognit Lab, Erlangen, Germany
[2] Friedrich Alexander Univ Erlangen Nurnberg, Chair Christian Archaeol, Erlangen, Germany
[3] Friedrich Alexander Univ Erlangen Nurnberg, Chair Class Archaeol, Erlangen, Germany
[4] Friedrich Alexander Univ Erlangen Nurnberg, Inst Art Hist, Erlangen, Germany
[5] Philipps Univ Marburg, German & Art Studies, Marburg, Germany
基金
欧盟地平线“2020”;
关键词
one-shot; object detection; digital humanities; computational humanities; data augmentation;
D O I
10.1109/IPTA54936.2022.9784141
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Christian archeologists face many challenges in understanding visual narration through artwork images. This understanding is essential to access underlying semantic information. Therefore, narrative elements (objects) need to be labeled, compared, and contextualized by experts, which takes an enormous amount of time and effort. Our work aims to reduce labeling costs by using one-shot object detection to generate a labeled database from unannotated images. Novel object categories can be defined broadly and annotated using visual examples of narrative elements without training exclusively for such objects. In this work, we propose two ways of using contextual information as data augmentation to improve the detection performance. Furthermore, we introduce a multi-relation detector to our framework, which extracts global, local, and patch-based relations of the image. Additionally, we evaluate the use of contrastive learning. We use data from Christian archeology (CHA) and art history - IconArt-v2 (IA). Our context encoding approach improves the typical fine-tuning approach in terms of mean average precision (mAP) by about 3.5% (4 %) at 0.25 intersection over union (IoU) for UnSeen categories, and 6% (1.5%) for Seen categories in CHA (IA). To the best of our knowledge, our work is the first to explore few shot object detection on heterogeneous artistic data by investigating evaluation methods and data augmentation strategies. We will release the code and models after acceptance of the work.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] One-Shot Learning for Landmarks Detection
    Wang, Zihao
    Vandersteen, Clair
    Raffaelli, Charles
    Guevara, Nicolas
    Patou, Francois
    Delingette, Herve
    [J]. DEEP GENERATIVE MODELS, AND DATA AUGMENTATION, LABELLING, AND IMPERFECTIONS, 2021, 13003 : 163 - 172
  • [22] One-Shot Medical Landmark Detection
    Yao, Qingsong
    Quan, Quan
    Xiao, Li
    Zhou, S. Kevin
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 : 177 - 188
  • [23] DeepMark: One-Shot Clothing Detection
    Sidnev, Alexey
    Trushkov, Alexey
    Kazakov, Maxim
    Korolev, Ivan
    Sorokin, Vladislav
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3201 - 3204
  • [24] One-shot logo detection for large video datasets and live camera surveillance in criminal investigations
    Demertzis, Stefanos
    van Rooij, Sabina B.
    Lazaridis, Michalis
    Bouma, Henri
    Alvarez Fernandez, Manuel
    ten Hove, Johan-Martijn
    Sainz Mendez, Rodrigo
    Daras, Petros
    [J]. ARTIFICIAL INTELLIGENCE FOR SECURITY AND DEFENCE APPLICATIONS, 2023, 12742
  • [25] One-shot categorization of novel object classes in humans
    Morgenstern, Yaniv
    Schmidt, Filipp
    Fleming, Roland W.
    [J]. VISION RESEARCH, 2019, 165 : 98 - 108
  • [26] Object Detection and Classification on Heterogeneous Datasets
    Brosch, Tobias
    Elshaarany, Ahmed
    [J]. ICPRAM: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2019, : 307 - 312
  • [27] Deep Learning-Based Action Classification Using One-Shot Object Detection
    Yoo, Hyun
    Lee, Seo-El
    Chung, Kyungyong
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 76 (02): : 1343 - 1359
  • [28] One-Shot Multiple Object Tracking With Robust ID Preservation
    Lv, Weiyi
    Zhang, Ning
    Zhang, Junjie
    Zeng, Dan
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 4473 - 4488
  • [29] Make One-Shot Video Object Segmentation Efficient Again
    Meinhardt, Tim
    Leal-Taixe, Laura
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [30] A Bayesian approach to unsupervised one-shot learning of object categories
    Fei-Fei, L
    Fergus, R
    Perona, P
    [J]. NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, 2003, : 1134 - 1141