Improved Scene Landmark Detection for Camera Localization

被引:0
|
作者
Do, Tien [1 ]
Sinha, Sudipta N. [2 ]
机构
[1] Tesla, Austin, TX 78725 USA
[2] Microsoft, Redmond, WA USA
关键词
D O I
10.1109/3DV62453.2024.00069
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Camera localization methods based on retrieval, local feature matching, and 3D structure-based pose estimation are accurate but require high storage, are slow, and are not privacy-preserving. A method based on scene landmark detection (SLD) was recently proposed to address these limitations. It involves training a convolutional neural network (CNN) to detect a few predetermined, salient, scene-specific 3D points or landmarks and computing camera pose from the associated 2D-3D correspondences. Although SLD outperformed existing learning-based approaches, it was notably less accurate than 3D structure-based methods. In this paper, we show that the accuracy gap was due to insufficient model capacity and noisy labels during training. To mitigate the capacity issue, we propose to split the landmarks into subgroups and train a separate network for each subgroup. To generate better training labels, we propose using dense reconstructions to estimate visibility of scene landmarks. Finally, we present a compact architecture to improve memory efficiency. Accuracy wise, our approach is on par with state of the art structure-based methods on the INDOOR- 6 dataset but runs significantly faster and uses less storage. Code and models can be found at https://github.com/microsoft/SceneLandmarkLocalization.
引用
收藏
页码:975 / 984
页数:10
相关论文
共 50 条
  • [41] Efficient scene change detection and camera motion annotation for video classification
    Xiong, W
    Lee, JCM
    COMPUTER VISION AND IMAGE UNDERSTANDING, 1998, 71 (02) : 166 - 181
  • [42] Explicit Occlusion Detection based Deformable Fitting for Facial Landmark Localization
    Yu, Xiang
    Yang, Fei
    Huang, Junzhou
    Metaxas, Dimitris N.
    2013 10TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), 2013,
  • [43] Map-Matching-Based Cascade Landmark Detection and Vehicle Localization
    Choi, Kyoungtaek
    Suhr, Jae Kyu
    Jung, Ho Gi
    IEEE ACCESS, 2019, 7 : 127874 - 127894
  • [44] FACE DETECTION AND LANDMARK LOCALIZATION USING BILAYER TREE STRUCTURED MODEL
    Hsu, Gee-Sern
    Chang, Kai-Hsiang
    Huang, Shih-Chieh
    Chung, Sheng-Luen
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 4303 - 4307
  • [45] Scene conditional background update for moving object detection in a moving camera
    Yun, Kimin
    Lim, Jongin
    Choi, Jin Young
    PATTERN RECOGNITION LETTERS, 2017, 88 : 57 - 63
  • [46] Fingerprint localization using WLAN RSS and magnetic field with landmark detection
    Tang, Peng
    Huang, ZhiQing
    Lei, Jun
    2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2017,
  • [47] Single photon detection and localization accuracy with an ebCMOS camera
    Cajgfinger, T.
    Dominjon, A.
    Barbier, R.
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2015, 787 : 176 - 181
  • [48] Improvement on Target Detection and Localization in Camera Sensor Network
    Hosseinkhani, Zohreh
    Karimi, Nader
    Hajabdollahi, Mohsen
    Samavi, Shadrokh
    2016 24TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2016, : 209 - 212
  • [49] Improved FCENet Algorithm for Natural Scene Text Detection
    Zhou, Yan
    Liao, Junwei
    Liu, Xiangyu
    Zhou, Yuexia
    Zeng, Fanzhi
    Computer Engineering and Applications, 2024, 60 (03) : 228 - 235
  • [50] An Improved Approach of Scene Change Detection in Archived Films
    Zhang Xiaona
    Qi Guoqing
    Wang Qiang
    Zhang Tao
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 825 - 828