A multi-scale multi-level deep descriptor with saliency for image retrieval

被引:0
|
作者
Wu, Zebin [1 ]
Yu, Junqing [1 ,2 ]
机构
[1] Huazhong Univ Sci & Technol, Dept Comp Sci & Technol, Wuhan 430074, Peoples R China
[2] Huazhong Univ Sci & Technol, Ctr Network & Computat, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Image retrieval; Convolutional Neural Network; Multi-scale multi-level image representation; OBJECT DETECTION;
D O I
10.1007/s11042-022-13658-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In image retrieval, deep features from the fully-connected layer of Convolutional Neural Network(CNN) lack the ability to discriminate similar images containing small objects or structures. To address this problem, this work propose a multi-scale image representation with multiple semantic levels. The framework of our GOS (Global-Object-Salient) descriptor has three streams forming multiple feature levels. The global stream extracts features from the whole image of multiple resolutions and can capture multi-scale details. In the object stream, an object detector is used to detect objects of different scales at different locations. Object-level patches are of rectangular shape and may just contain a part of the object, so the salient stream is integrated to capture the most salient part of the image. GOS can capture global-level,object-level and salient-level features simultaneously. Experiments show that the three streams work in a complementary way and GOS framework is capable of producing competitive retrieval accuracy on four public image retrieval datasets.Specifically,we achieve 0.939(mAP),3.91(score@4), 0.908(mAP) respectively on Paris6K, UKB and Holidays dataset.
引用
收藏
页码:37939 / 37958
页数:20
相关论文
共 50 条
  • [21] Deep multi-level networks with multi-task learning for saliency detection
    Zhang, Lihe
    Fang, Xiang
    Bo, Hongguang
    Wang, Tiantian
    Lu, Huchuan
    [J]. NEUROCOMPUTING, 2018, 312 : 229 - 238
  • [22] Deep Multi-Scale Attention Hashing Network for Large-Scale Image Retrieval
    Feng, Hao
    Wang, Nian
    Tang, Jun
    [J]. Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2022, 50 (04): : 35 - 45
  • [23] An image multi-scale feature recognition method based on image saliency
    Yang, Chengzhi
    [J]. International Journal of Circuits, Systems and Signal Processing, 2021, 15 : 280 - 287
  • [24] Early Fusion of Multi-level Saliency Descriptors for Image Classification
    Fidalgo, E.
    Alegre, E.
    Fernandez-Robles, L.
    Gonzalez-Castro, V
    [J]. REVISTA IBEROAMERICANA DE AUTOMATICA E INFORMATICA INDUSTRIAL, 2019, 16 (03): : 358 - 368
  • [25] Fast Image Matching Using Multi-level Texture Descriptor
    Ng, Hui-Fuang
    Lin, Chih-Yang
    Muindisi, Tatenda
    [J]. 2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [26] Multi-focus image fusion using multi-scale image decomposition and saliency detection
    Bavirisetti, Durga Prasad
    Dhuli, Ravindra
    [J]. AIN SHAMS ENGINEERING JOURNAL, 2018, 9 (04) : 1103 - 1117
  • [27] Image saliency detection via multi-scale iterative CNN
    Huang, Kun
    Gao, Shenghua
    [J]. VISUAL COMPUTER, 2020, 36 (07): : 1355 - 1367
  • [28] DeepAttent: Saliency Prediction with Deep Multi-scale Residual Network
    Dwivedi, Kshitij
    Singh, Nitin
    Shanmugham, Sabari R.
    Kumar, Manoj
    [J]. PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON COMPUTER VISION AND IMAGE PROCESSING, CVIP 2018, VOL 2, 2020, 1024 : 65 - 73
  • [29] Image saliency detection via multi-scale iterative CNN
    Kun Huang
    Shenghua Gao
    [J]. The Visual Computer, 2020, 36 : 1355 - 1367
  • [30] Deep Multi-Modal Metric Learning with Multi-Scale Correlation for Image-Text Retrieval
    Hua, Yan
    Yang, Yingyun
    Du, Jianhe
    [J]. ELECTRONICS, 2020, 9 (03)