Improved Scene Landmark Detection for Camera Localization

被引:0
|
作者
Do, Tien [1 ]
Sinha, Sudipta N. [2 ]
机构
[1] Tesla, Austin, TX 78725 USA
[2] Microsoft, Redmond, WA USA
关键词
D O I
10.1109/3DV62453.2024.00069
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Camera localization methods based on retrieval, local feature matching, and 3D structure-based pose estimation are accurate but require high storage, are slow, and are not privacy-preserving. A method based on scene landmark detection (SLD) was recently proposed to address these limitations. It involves training a convolutional neural network (CNN) to detect a few predetermined, salient, scene-specific 3D points or landmarks and computing camera pose from the associated 2D-3D correspondences. Although SLD outperformed existing learning-based approaches, it was notably less accurate than 3D structure-based methods. In this paper, we show that the accuracy gap was due to insufficient model capacity and noisy labels during training. To mitigate the capacity issue, we propose to split the landmarks into subgroups and train a separate network for each subgroup. To generate better training labels, we propose using dense reconstructions to estimate visibility of scene landmarks. Finally, we present a compact architecture to improve memory efficiency. Accuracy wise, our approach is on par with state of the art structure-based methods on the INDOOR- 6 dataset but runs significantly faster and uses less storage. Code and models can be found at https://github.com/microsoft/SceneLandmarkLocalization.
引用
收藏
页码:975 / 984
页数:10
相关论文
共 50 条
  • [31] A Comparison of Scene Change Localization Methods over the Open Video Scene Detection Dataset
    Panchenko, Taras
    Bieda, Igor
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (06): : 1 - 6
  • [32] Laplace Landmark Localization
    Robinson, Joseph P.
    Li, Yuncheng
    Zhang, Ning
    Fu, Yun
    Tulyakov, Sergey
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10102 - 10111
  • [33] IMPROVED HOURGLASS STRUCTURE FOR HIGH PERFORMANCE FACIAL LANDMARK DETECTION
    Lai, Shenqi
    Chai, Zhenhua
    Wei, Xiaoming
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 669 - 672
  • [34] Consistency Loss for Improved Colonoscopy Landmark Detection with Vision Transformers
    Tamhane, Aniruddha
    Dobkin, Daniel
    Shtalrid, Ore
    Bouhnik, Moshe
    Posner, Erez
    Mida, Tse'ela
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2023, PT II, 2024, 14349 : 124 - 133
  • [35] Indoor Camera Relocation Method Based on Improved Scene Coordinate Regression Network
    Wang, Jing
    Hu, Shaoyi
    Guo, Ping
    Jin, Yuchu
    Computer Engineering and Applications, 2023, 59 (15) : 160 - 168
  • [36] Camera-Sonar Combination for Improved Underwater Localization and Mapping
    Cardaillac, Alexandre
    Ludvigsen, Martin
    IEEE ACCESS, 2023, 11 : 123070 - 123079
  • [37] Online Detection of Map Discrepancies in Landmark-Based Robotic Localization
    Schwesinger, Dylan
    2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 412 - 415
  • [38] Detection of Scene Obstructions and Persistent View Changes in Transportation Camera Systems
    Raghavan, Ajay
    Price, Robert
    Liu, Juan
    2012 15TH INTERNATIONAL IEEE CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2012, : 957 - 962
  • [39] Multi-view Face Detection and Landmark Localization Based on MTCNN
    Ma, Mei
    Wang, Jianji
    2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 4200 - 4205
  • [40] Automatic control of PTZ camera based on object detection and scene partition
    Wang, Shuai
    Tian, Yan
    Xu, Yiping
    2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2015, : 1 - 6