A multi-scale multi-level deep descriptor with saliency for image retrieval

被引:0
|
作者
Wu, Zebin [1 ]
Yu, Junqing [1 ,2 ]
机构
[1] Huazhong Univ Sci & Technol, Dept Comp Sci & Technol, Wuhan 430074, Peoples R China
[2] Huazhong Univ Sci & Technol, Ctr Network & Computat, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Image retrieval; Convolutional Neural Network; Multi-scale multi-level image representation; OBJECT DETECTION;
D O I
10.1007/s11042-022-13658-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In image retrieval, deep features from the fully-connected layer of Convolutional Neural Network(CNN) lack the ability to discriminate similar images containing small objects or structures. To address this problem, this work propose a multi-scale image representation with multiple semantic levels. The framework of our GOS (Global-Object-Salient) descriptor has three streams forming multiple feature levels. The global stream extracts features from the whole image of multiple resolutions and can capture multi-scale details. In the object stream, an object detector is used to detect objects of different scales at different locations. Object-level patches are of rectangular shape and may just contain a part of the object, so the salient stream is integrated to capture the most salient part of the image. GOS can capture global-level,object-level and salient-level features simultaneously. Experiments show that the three streams work in a complementary way and GOS framework is capable of producing competitive retrieval accuracy on four public image retrieval datasets.Specifically,we achieve 0.939(mAP),3.91(score@4), 0.908(mAP) respectively on Paris6K, UKB and Holidays dataset.
引用
收藏
页码:37939 / 37958
页数:20
相关论文
共 50 条
  • [1] A multi-scale multi-level deep descriptor with saliency for image retrieval
    Zebin Wu
    Junqing Yu
    [J]. Multimedia Tools and Applications, 2023, 82 : 37939 - 37958
  • [2] Multi-level and multi-scale deep saliency network for salient object detection
    Zhang, Qing
    Lin, Jiajun
    Zhuge, Jingling
    Yuan, Wenhao
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 59 : 415 - 424
  • [3] A multi-level descriptor using ultra-deep feature for image retrieval
    Wu, Zebin
    Yu, Junqing
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (18) : 25655 - 25672
  • [4] A multi-level descriptor using ultra-deep feature for image retrieval
    Zebin Wu
    Junqing Yu
    [J]. Multimedia Tools and Applications, 2019, 78 : 25655 - 25672
  • [5] Learning multi-level and multi-scale deep representations for privacy image classification
    Han, Yahui
    Huang, Yonggang
    Pan, Lei
    Zheng, Yunbo
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (02) : 2259 - 2274
  • [6] Learning multi-level and multi-scale deep representations for privacy image classification
    Yahui Han
    Yonggang Huang
    Lei Pan
    Yunbo Zheng
    [J]. Multimedia Tools and Applications, 2022, 81 : 2259 - 2274
  • [7] Deep Multi-level Hashing Codes for Image Retrieval
    Dong, Zhenjiang
    Song, Ge
    Jia, Xia
    Tan, Xiaoyang
    [J]. INTELLIGENT VISUAL SURVEILLANCE (IVS 2016), 2016, 664 : 91 - 98
  • [8] Saliency detection via multi-level integration and multi-scale fusion neural networks
    Huang, Mengke
    Liu, Zhi
    Ye, Linwei
    Zhou, Xiaofei
    Wang, Yang
    [J]. NEUROCOMPUTING, 2019, 364 : 310 - 321
  • [9] Multi-scale and multi-level shape descriptor learning via a hybrid fusion network
    Huang, Xinwei
    Li, Nannan
    Xia, Qing
    Li, Shuai
    Hao, Aimin
    Qin, Hong
    [J]. GRAPHICAL MODELS, 2022, 119
  • [10] A multi-scale layout descriptor based on Delaunay triangulation for image retrieval
    Borras Angosto, Agnes
    Llados Canet, Josep
    [J]. VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2008, : 139 - 144