Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval

被引:2
|
作者
Zhu, Yunquan [1 ]
Gao, Xinkai [1 ]
Ke, Bo [1 ]
Qiao, Ruizhi [1 ]
Sun, Xing [1 ]
机构
[1] Tencent, YouTu Lab, Shenzhen, Peoples R China
关键词
DESCRIPTORS; MODEL;
D O I
10.1109/ICCV51070.2023.01034
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image retrieval targets to find images from a database that are visually similar to the query image. Two-stage methods following retrieve-and-rerank paradigm have achieved excellent performance, but their separate local and global modules are inefficient to real-world applications. To better trade-off retrieval efficiency and accuracy, some approaches fuse global and local feature into a joint representation to perform single-stage image retrieval. However, they are still challenging due to various situations to tackle, e.g., background, occlusion and viewpoint. In this work, we design a Coarse-to-Fine framework to learn Compact Discriminative representation (CFCD) for end-to-end single- stage image retrieval-requiring only imagelevel labels. Specifically, we first design a novel adaptive softmax-based loss which dynamically tunes its scale and margin within each mini-batch and increases them progressively to strengthen supervision during training and intraclass compactness. Furthermore, we propose a mechanism which attentively selects prominent local descriptors and infuse fine-grained semantic relations into the global representation by a hard negative sampling strategy to optimize inter-class distinctiveness at a global scale. Extensive experimental results have demonstrated the effectiveness of our method, which achieves state-of-the-art single-stage image retrieval performance on benchmarks such as Revisited Oxford and Revisited Paris. Code is available at https://github.com/bassyess/CFCD.
引用
下载
收藏
页码:11226 / 11235
页数:10
相关论文
共 50 条
  • [1] Coarse-to-Fine Deep Metric Learning for Remote Sensing Image Retrieval
    Yun, Min-Sub
    Nam, Woo-Jeoung
    Lee, Seong-Whan
    REMOTE SENSING, 2020, 12 (02)
  • [2] Coarse-to-Fine Learning for Single-Image Super-Resolution
    Zhang, Kaibing
    Tao, Dacheng
    Gao, Xinbo
    Li, Xuelong
    Li, Jie
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (05) : 1109 - 1122
  • [3] Learning based coarse-to-fine image registration
    Jiang, Jiayan
    Zheng, Songfeng
    Toga, Arthur W.
    Tu, Zhuowen
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 429 - +
  • [4] Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning
    Tian, Kaibin
    Cheng, Yanhua
    Liu, Yi
    Hou, Xinglin
    Chen, Quan
    Li, Han
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5207 - 5214
  • [5] PALMPRINT RECOGNITION USING COARSE-TO-FINE STATISTICAL IMAGE REPRESENTATION
    Han, Yufei
    Sun, Zhenan
    Tan, Tieniu
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 1969 - 1972
  • [6] Rethinking Coarse-to-Fine Approach in Single Image Deblurring
    Cho, Sung-Jin
    Ji, Seo-Won
    Hong, Jun-Pyo
    Jung, Seung-Won
    Ko, Sung-Jea
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 4621 - 4630
  • [7] Pattern Retrieval in Large Image Databases Using Multiscale Coarse-to-Fine Cascaded Active Learning
    Blanchart, Pierre
    Ferecatu, Marin
    Cui, Shiyong
    Datcu, Mihai
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (04) : 1127 - 1141
  • [8] A Coarse-to-Fine Instance Segmentation Network with Learning Boundary Representation
    Luo, Feng
    Gao, Bin-Bin
    Yan, Jiangpeng
    Li, Xiu
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [9] A Coarse-to-Fine Approach for Medical Hyperspectral Image Classification with Sparse Representation
    Chang, Lan
    Zhang, Mengmeng
    Li, Wei
    AOPC 2017: OPTICAL SPECTROSCOPY AND IMAGING, 2017, 10461
  • [10] Coarse-to-fine manifold learning
    Castro, R
    Willett, R
    Nowak, R
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 992 - 995