A deep one-shot network for query-based logo retrieval

被引:15
|
作者
Bhunia, Ayan Kumar [1 ]
Bhunia, Ankan Kumar [2 ]
Ghose, Shuvozit [3 ]
Das, Abhirup [3 ]
Roy, Partha Pratim [4 ]
Pal, Umapada [5 ]
机构
[1] Univ Surrey, Guildford, Surrey, England
[2] Jadavpur Univ, Kolkata, W Bengal, India
[3] Inst Engn & Management, Kolkata, India
[4] Indian Inst Technol Roorkee, Roorkee, Uttar Pradesh, India
[5] Indian Stat Inst, Hyderabad, Telangana, India
关键词
Logo retrieval; One-shot learning; Multi-scale conditioning; Similarity matching; Query retrieval; IMAGE; IDENTIFICATION;
D O I
10.1016/j.patcog.2019.106965
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Logo detection in real-world scene images is an important problem with applications in advertisement and marketing. Existing general-purpose object detection methods require large training data with annotations for every logo class. These methods do not satisfy the incremental demand of logo classes necessary for practical deployment since it is practically impossible to have such annotated data for new unseen logo. In this work, we develop an easy-to-implement query-based logo detection and localization system by employing a one-shot learning technique using off the shelf neural network components. Given an image of a query logo, our model searches for logo within a given target image and predicts the possible location of the logo by estimating a binary segmentation mask. The proposed model consists of a conditional branch and a segmentation branch. The former gives a conditional latent representation of the given query logo which is combined with feature maps of the segmentation branch at multiple scales in order to obtain the matching location of the query logo in a target image. Feature matching between the latent query representation and multi-scale feature maps of segmentation branch using simple concatenation operation followed by 1 x 1 convolution layer makes our model scale-invariant. Despite its simplicity, our query-based logo retrieval framework achieved superior performance in FlickrLogos-32 and TopLogos-10 dataset over different existing baseline methods. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] One-shot logo detection for large video datasets and live camera surveillance in criminal investigations
    Demertzis, Stefanos
    van Rooij, Sabina B.
    Lazaridis, Michalis
    Bouma, Henri
    Alvarez Fernandez, Manuel
    ten Hove, Johan-Martijn
    Sainz Mendez, Rodrigo
    Daras, Petros
    ARTIFICIAL INTELLIGENCE FOR SECURITY AND DEFENCE APPLICATIONS, 2023, 12742
  • [42] Delving Deep Into One-Shot Skeleton-Based Action Recognition With Diverse Occlusions
    Peng, Kunyu
    Roitberg, Alina
    Yang, Kailun
    Zhang, Jiaming
    Stiefelhagen, Rainer
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1489 - 1504
  • [43] Query-based Policy Literature Retrieval Method Based on Semi-Supervised Framework
    Wei, Moji
    Guo, Yanyan
    Li, Chen
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MODELING, NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING, CMNM 2024, 2024, : 28 - 32
  • [44] One-Shot Phase Retrieval Method for Interferometry Using a Multi-Stage Phase-Shifting Network
    Zhao, Yan
    Hu, Ke
    Liu, Fengwei
    IEEE PHOTONICS TECHNOLOGY LETTERS, 2023, 35 (10) : 577 - 580
  • [45] SPIDERnet: ATTENTION NETWORK FOR ONE-SHOT ANOMALY DETECTION IN SOUNDS
    Koizumi, Yuma
    Yasuda, Masahiro
    Murata, Shin
    Saito, Shoichim
    Uematsu, Hisashi
    Harada, Noboru
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 281 - 285
  • [46] One-shot Action Localization by Learning Sequence Matching Network
    Yang, Hongtao
    He, Xuming
    Porikli, Fatih
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1450 - 1459
  • [47] One-Shot Reachability Analysis of Neural Network Dynamical Systems
    Chen, Shaoru
    Preciado, Victor M.
    Fazlyab, Mahyar
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 10546 - 10552
  • [48] Column Generation Approach for One-Shot Virtual Network Embedding
    Jarray, Abdallah
    Karmouch, Ahmed
    2012 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2012, : 863 - 868
  • [49] Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos
    Zhang, Zhu
    Lin, Zhijie
    Zhao, Zhou
    Xiao, Zhenxin
    PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 655 - 664
  • [50] Bilateral guidance network for one-shot metal defect segmentation
    Shan, Dexing
    Zhang, Yunzhou
    Liu, Xiaozheng
    Zhao, Jiaqi
    Coleman, Sonya
    Kerr, Dermot
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 131