Learning Task-Aligned Local Features for Visual Localization

被引:2
|
作者
Liu, Chuanjin [1 ,2 ]
Liu, Hongmin [1 ,2 ]
Zhang, Lixin [1 ,2 ]
Zeng, Hui [1 ,2 ]
Luo, Lufeng [3 ]
Fan, Bin [1 ,2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Intelligence Sci & Technol, Beijing 100083, Peoples R China
[2] Minist Educ, Key Lab Intelligent Bion Unmanned Syst, Beijing 100083, Peoples R China
[3] Foshan Univ, Sch Mechatron Engn & Automat, Foshan 528000, Peoples R China
基金
中国国家自然科学基金;
关键词
Location awareness; Feature extraction; Visualization; Detectors; Three-dimensional displays; Task analysis; Cameras; Localization; deep learning for visual perception; computer vision for automation; SLAM;
D O I
10.1109/LRA.2023.3268015
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Visual localization plays a key role in various robot perception systems. Robust visual localization relies on reliable and repeatable local features to establish high quality point correspondences among images. This letter focuses on addressing two limitations of joint learning detector and descriptor. First, existing methods use independent structures and loss functions for keypoint detection and description separately, which poses difficulty in detecting keypoints corresponding to discriminative descriptors. Second, triplet samples are treated equally in most existing approaches, which limits the learning algorithm to obtain highly discriminative descriptors. In this letter, we propose Task-aligned SuperPoint (TaSP) to mitigate the above problems. First, we explicitly align descriptor and detector learning to improve the probability of being detected for those distinctive points. Second, we introduce a dynamic importance weighting module that calculates the weight of each triplet sample based on intrinsic and empirical importance, so as to make the network focus on the most informative triplets during the whole training process. In addition, we resort to 3D space to seek negative samples when forming triplets, which avoids the risk of selecting negatives from repetitive structures. State-of-the-art results on a variety of visual localization benchmarks demonstrate the superiority of our method.
引用
收藏
页码:3366 / 3373
页数:8
相关论文
共 50 条
  • [31] VISUAL LEARNING CAN BE REVERSED BY TASK
    TANNE, D
    SAGI, D
    [J]. INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1995, 36 (04) : S376 - S376
  • [32] Selection of local features for visual search
    Francini, Gianluca
    Lepsoy, Skjalg
    Balestri, Massimo
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (04) : 311 - 322
  • [33] AUDITORY AND VISUAL LOCALIZATION PERFORMANCE IN A SEQUENTIAL DISCRIMINATION TASK
    PERROTT, DR
    COSTANTINO, B
    CISNEROS, J
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1993, 93 (04): : 2134 - 2138
  • [34] Capsule Endoscope Localization based on Visual Features
    Iakovidis, Dimitris K.
    Spyrou, Evaggelos
    Diamantis, Dimitris
    Tsiompanidis, Ilias
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2013,
  • [36] Flexible Visual Processing in Young Adults with Autism: The Effects of Implicit Learning on a Global–Local Task
    Dana A. Hayward
    David I. Shore
    Jelena Ristic
    Hanna Kovshoff
    Grace Iarocci
    Laurent Mottron
    Jacob A. Burack
    [J]. Journal of Autism and Developmental Disorders, 2012, 42 : 2383 - 2392
  • [37] Global localization using distinctive visual features
    Se, S
    Lowe, D
    Little, J
    [J]. 2002 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-3, PROCEEDINGS, 2002, : 226 - 231
  • [38] Task Aligned Generative Meta-learning for Zero-shot Learning
    Liu, Zhe
    Li, Yun
    Yao, Lina
    Wang, Xianzhi
    Long, Guodong
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8723 - 8731
  • [39] Effect of defining features on inhibition in a spatial localization task
    Simone, PM
    Carlisle, EA
    McCormick, EB
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1998, 24 (03) : 993 - 1005
  • [40] Self-Supervised Learning of Visual Robot Localization Using LED State Prediction as a Pretext Task
    Nava, Mirko
    Carlotti, Nicholas
    Crupi, Luca
    Palossi, Daniele
    Giusti, Alessandro
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (04) : 3363 - 3370