Deep 6-DoF camera relocalization in variable and dynamic scenes by multitask learning

被引:4
|
作者
Wang, Junyi [1 ,2 ]
Qi, Yue [1 ,2 ,3 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Virtual Real Technol & Syst, Beijing, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
[3] Beihang Univ, Qingdao Res Inst, Qingdao, Peoples R China
基金
中国国家自然科学基金;
关键词
Image-based localization; Deep learning; Dynamic localization; Multitask learning; LOCALIZATION; ROBUST; TRACKING;
D O I
10.1007/s00138-023-01388-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, direct visual localization with convolutional neural networks has attracted researchers' attention with achieving an end-to-end process. However, on the one side, the lack of using 3D information leads to imprecise accuracy. Meanwhile, the single input image confuses the relocalization in the scenes that keep similar views at different positions. On the other side, the relocalization problem in variable or dynamic scenes is still challenging. Concentrating on these concerns, we propose two multitask relocalization networks called MMLNet and MMLNet+ for obtaining the 6-DoF camera pose in static, variable and dynamic scenes. Firstly, addressing the dataset lack of variable scenes, we construct a variable scene dataset with a semiautomatic process combining SFM and MVS algorithms with a few manual labels. Based on the process, three scenes covering an office, a bedroom and a sitting room are gathered and generated. Secondly, to enhance the perception between 2D images and 3D poses, we design a multitask network called MMLNet that regresses both camera pose and scene point cloud. Meanwhile, the Chamfer distance is joined into the original pose loss to optimize MMLNet. Moreover, MMLNet learns the pose trajectory feature by using LSTM layers to the additional pose array input, which meanwhile breaks through the limitation of single image input. Based on the MMLNet, aiming at dynamic and variable scenes, MMLNet+ outputs the auxiliary segmentation branch that distinguishes fixed, changeable or dynamic parts of the input image. Furthermore, we define the feature fusion block to implement the feature sharing among three tasks, further promoting the performance in dynamic and variable environments. Finally, experiments on static, dynamic and our constructed variable datasets demonstrate state-of-the-art relocalization performances of MMLNet and MMLNet+. Simultaneously, the positive effects of the pose learning part, reconstruction branch and segmentation task are also illustrated.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Uncalibrated stereo vision with deep learning for 6-DOF pose estimation for a robot arm system
    Abdelaal, Mahmoud
    Farag, Ramy M. A.
    Saad, Mohamed S.
    Bahgat, Ahmed
    Emara, Hassan M.
    El-Dessouki, Ayman
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2021, 145
  • [42] Deep 6-DoF Tracking of Unknown Objects for Reactive Grasping
    Tuscher, Marc
    Hoerz, Julian
    Driess, Danny
    Toussaint, Marc
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 14185 - 14191
  • [43] 6-DoF Pose Estimation from Stereo LiDAR of Actual Machine using Deep Learning
    Hashimoto, Shintaro
    Nakajima, Yu
    Ishihama, Naoki
    2023 IEEE AEROSPACE CONFERENCE, 2023,
  • [44] The design and dynamic analysis of a novel 6-DOF parallel mechanism
    Xiuling Liu
    Qingquan Wang
    Alexander Malikov
    Hongrui Wang
    International Journal of Machine Learning and Cybernetics, 2012, 3 : 27 - 37
  • [45] Characteristics analysis of dynamic model of 6-DOF Stewart platform
    School of Automation Science and Electrical Engineering, Beijing University of Aeronautics and Astronautics, Beijing 100083, China
    不详
    Beijing Hangkong Hangtian Daxue Xuebao, 2007, 8 (940-944):
  • [46] The design and dynamic analysis of a novel 6-DOF parallel mechanism
    Liu, Xiuling
    Wang, Qingquan
    Malikov, Alexander
    Wang, Hongrui
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2012, 3 (01) : 27 - 37
  • [47] Dynamic Stability of Unguided Projectile with 6-DOF Trajectory Modeling
    Bashir, Musavir
    Khan, Sher Afghan
    Udayagiri, Leelanadh
    Noor, Asim
    2017 2ND INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2017, : 1002 - 1009
  • [48] A strategy to simplify the dynamic model of 6-DOF motion simulator
    Yin, Liyi
    Yang, Chifu
    Gao, Changhong
    Cong, Dacheng
    Han, Junwei
    PROCEEDINGS OF THE 2015 4TH INTERNATIONAL CONFERENCE ON SENSORS, MEASUREMENT AND INTELLIGENT MATERIALS, 2016, 43 : 665 - 672
  • [49] Study on Dynamic and Accurate 6-DOF Measurement System and Approach
    Xin, Ruikai
    Lin, Jiarui
    Shi, Shendong
    Zhu, Jigui
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (06) : 6356 - 6365
  • [50] Vehicle global 6-DoF pose estimation under traffic surveillance camera
    Zhang, Shanxin
    Wang, Cheng
    He, Zijian
    Li, Qing
    Lin, Xiuhong
    Li, Xin
    Zhang, Juyong
    Yang, Chenhui
    Li, Jonathan
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 159 : 114 - 128