Semantic Reinforced Attention Learning for Visual Place Recognition

被引:29
|
作者
Peng, Guohao [1 ]
Yue, Yufeng [2 ]
Zhang, Jun [1 ]
Wu, Zhenyu [1 ]
Tang, Xiaoyu [1 ]
Wang, Danwei [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
关键词
LARGE-SCALE; FAB-MAP; LOCALIZATION; SLAM;
D O I
10.1109/ICRA48506.2021.9561812
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large-scale visual place recognition (VPR) is inherently challenging because not all visual cues in the image are beneficial to the task. In order to highlight the task-relevant visual cues in the feature embedding, the existing attention mechanisms are either based on artificial rules or trained in a thorough data-driven manner. To fill the gap between the two types, we propose a novel Semantic Reinforced Attention Learning Network (SRALNet), in which the inferred attention can benefit from both semantic priors and data-driven fine-tuning. The contribution lies in two-folds. (1) To suppress misleading local features, an interpretable local weighting scheme is proposed based on hierarchical feature distribution. (2) By exploiting the interpretability of the local weighting scheme, a semantic constrained initialization is proposed so that the local attention can be reinforced by semantic priors. Experiments demonstrate that our method outperforms state-of-the-art techniques on city-scale VPR benchmark datasets.
引用
收藏
页码:13415 / 13422
页数:8
相关论文
共 50 条
  • [1] Spatial attention based visual semantic learning for action recognition in still images
    Zheng, Yunpeng
    Zheng, Xiangtao
    Lu, Xiaoqiang
    Wu, Siyuan
    [J]. NEUROCOMPUTING, 2020, 413 : 383 - 396
  • [2] Learning Semantics for Visual Place Recognition Through Multi-scale Attention
    Paolicelli, Valerio
    Tavera, Antonio
    Masone, Carlo
    Berton, Gabriele
    Caputo, Barbara
    [J]. IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT II, 2022, 13232 : 454 - 466
  • [3] Robustifying Visual Place Recognition with Semantic Scene Categorization
    Arshad, Saba
    Kim, Gon-Woo
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 467 - 469
  • [4] Vector Semantic Representations as Descriptors for Visual Place Recognition
    Neubert, Peer
    Schubert, Stefan
    Schlegel, Kenny
    Protzel, Peter
    [J]. ROBOTICS: SCIENCE AND SYSTEM XVII, 2021,
  • [5] Semantic-guided de-attention with sharpened triplet marginal loss for visual place recognition
    Choi, Seung-Min
    Lee, Seung-Ik
    Lee, Jae-Yeong
    Kweon, In So
    [J]. PATTERN RECOGNITION, 2023, 141
  • [6] Learning Sequence Descriptor Based on Spatio-Temporal Attention for Visual Place Recognition
    Zhao, Junqiao
    Zhang, Fenglin
    Cai, Yingfeng
    Tian, Gengxuan
    Mu, Wenjie
    Ye, Chen
    Feng, Tiantian
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (03): : 2351 - 2358
  • [7] Learning Context Flexible Attention Model for Long-Term Visual Place Recognition
    Chen, Zetao
    Liu, Lingqiao
    Sa, Inkyu
    Ge, Zongyuan
    Chli, Margarita
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04): : 4015 - 4022
  • [8] Visual Content Recognition by Exploiting Semantic Feature Map with Attention and Multi-task Learning
    Zhao, Rui-Wei
    Zhang, Qi
    Wu, Zuxuan
    Li, Jianguo
    Jiang, Yu-Gang
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (01)
  • [9] An extended-HCT semantic description for visual place recognition
    Wang, Min-Liang
    Lin, Huei-Yung
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2011, 30 (11): : 1403 - 1420
  • [10] Self-learning Attention Global Pooling Based Image Representation for Visual Place Recognition
    Huang, Xiaoquan
    Zheng, Song
    [J]. 2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 849 - 854