Explicit feature disentanglement for visual place recognition across appearance changes

被引:2
|
作者
Tang, Li [1 ]
Wang, Yue [1 ]
Tan, Qimeng [2 ]
Xiong, Rong [1 ]
机构
[1] Zhejiang Univ, Dept Control Sci & Engn, Hangzhou 30012, Peoples R China
[2] Beijing Inst Spacecraft Syst Engn, Beijing Key Lab Intelligent Space Robot Syst Tech, Beijing, Peoples R China
关键词
Place recognition; feature disentanglement; adversarial; self-supervised; changing appearance; SIMULTANEOUS LOCALIZATION; NAVIGATION; SLAM;
D O I
10.1177/17298814211037497
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In the long-term deployment of mobile robots, changing appearance brings challenges for localization. When a robot travels to the same place or restarts from an existing map, global localization is needed, where place recognition provides coarse position information. For visual sensors, changing appearances such as the transition from day to night and seasonal variation can reduce the performance of a visual place recognition system. To address this problem, we propose to learn domain-unrelated features across extreme changing appearance, where a domain denotes a specific appearance condition, such as a season or a kind of weather. We use an adversarial network with two discriminators to disentangle domain-related features and domain-unrelated features from images, and the domain-unrelated features are used as descriptors in place recognition. Provided images from different domains, our network is trained in a self-supervised manner which does not require correspondences between these domains. Besides, our feature extractors are shared among all domains, making it possible to contain more appearance without increasing model complexity. Qualitative and quantitative results on two toy cases are presented to show that our network can disentangle domain-related and domain-unrelated features from given data. Experiments on three public datasets and one proposed dataset for visual place recognition are conducted to illustrate the performance of our method compared with several typical algorithms. Besides, an ablation study is designed to validate the effectiveness of the introduced discriminators in our network. Additionally, we use a four-domain dataset to verify that the network can extend to multiple domains with one model while achieving similar performance.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Visual Place Recognition: A Survey
    Lowry, Stephanie
    Suenderhauf, Niko
    Newman, Paul
    Leonard, John J.
    Cox, David
    Corke, Peter
    Milford, Michael J.
    IEEE TRANSACTIONS ON ROBOTICS, 2016, 32 (01) : 1 - 19
  • [32] Visual Place Recognition: A Tutorial
    Schubert, Stefan
    Neubert, Peer
    Garg, Sourav
    Milford, Michael
    Fischer, Tobias
    IEEE ROBOTICS & AUTOMATION MAGAZINE, 2024, 31 (03) : 139 - 153
  • [33] Variational Bayesian Approach to Condition-Invariant Feature Extraction for Visual Place Recognition
    Oh, Junghyun
    Eoh, Gyuho
    APPLIED SCIENCES-BASEL, 2021, 11 (19):
  • [34] APPEARANCE FEATURE EXTRACTION VERSUS IMAGE TRANSFORM-BASED APPROACH FOR VISUAL SPEECH RECOGNITION
    Sagheer, Alaa
    Tsuruta, Naoyuki
    Taniguchi, Rin-Ichiro
    Maeda, Sakashi
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2006, 6 (01) : 101 - 122
  • [35] PHROG: A Multimodal Feature for Place Recognition
    Bonardi, Fabien
    Ainouz, Samia
    Boutteau, Remi
    Dupuis, Yohan
    Savatier, Xavier
    Vasseur, Pascal
    SENSORS, 2017, 17 (05)
  • [36] Appearance and shape-based hybrid visual feature extraction: toward audio-visual automatic speech recognition
    Debnath, Saswati
    Roy, Pinki
    SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (01) : 25 - 32
  • [37] A characterization of visual feature recognition
    Mathew, B
    Davis, A
    Evans, R
    2003 IEEE INTERNATIONAL WORKSHOP ON WORKLOAD CHARACTERIZATION, 2003, : 3 - 11
  • [38] Vehicle appearance feature recognition based on image
    Shao, Deyang
    Xu, Chao
    Luo, Shixian
    Feng, Bo
    Jiao, Long
    PROCEEDINGS OF THE 2016 6TH INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS, ENVIRONMENT, BIOTECHNOLOGY AND COMPUTER (MMEBC), 2016, 88 : 859 - 863
  • [39] Evaluation of Clustering Methods in Compression of Topological Models and Visual Place Recognition Using Global Appearance Descriptors
    Cebollada, Sergio
    Paya, Luis
    Mayol, Walterio
    Reinoso, Oscar
    APPLIED SCIENCES-BASEL, 2019, 9 (03):
  • [40] DISENTANGLEMENT FOR AUDIO-VISUAL EMOTION RECOGNITION USING MULTITASK SETUP
    Peri, Raghuveer
    Parthasarathy, Srinivas
    Bradshaw, Charles
    Sundaram, Shiva
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6344 - 6348