Self-supervised learning of monocular 3D geometry understanding with two- and three-view geometric constraints

被引:0
|
作者
Liu, Xiaoliang [1 ,2 ]
Shen, Furao [1 ,3 ]
Zhao, Jian [1 ,4 ]
Nie, Changhai [1 ,2 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Nanjing Univ, Dept Comp Sci & Technol, Nanjing, Peoples R China
[3] Nanjing Univ, Sch Artificial Intelligence, Nanjing, Peoples R China
[4] Nanjing Univ, Sch Elect Sci & Engn, Nanjing, Peoples R China
来源
VISUAL COMPUTER | 2024年 / 40卷 / 02期
基金
美国国家科学基金会;
关键词
3D geometry understanding; Optical flow estimation; Visual odometry; Depth estimation; Self-supervised learning; Dynamic scenes; VISUAL ODOMETRY; OPTICAL-FLOW;
D O I
10.1007/s00371-023-02840-y
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The 3D geometry understanding of dynamic scenes captured by moving cameras is one of the cornerstones of 3D scene understanding. Optical flow estimation, visual odometry, and depth estimation are the three most basic tasks in 3D geometry understanding. In this work, we present a unified framework for joint self-supervised learning of optical flow estimation, visual odometry, and depth estimation with two- and three-view geometric constraints. As we all know, visual odometry and depth estimation are more sensitive to dynamic objects, while optical flow estimation is more difficult to estimate the boundary area moved out of the image. To this end, we use estimated optical flow to help visual odometry and depth estimation process dynamic objects and use a rigid flow synthesized by the estimated pose and depth to help learn the optical flow of the area that moves out of the boundary due to camera motion. In order to further improve the consistency of cross-tasks, we introduce three-view geometric constraints and propose a three-view consistency loss. Finally, experiments on the KITTI data set show that our method can effectively improve the performance of the occluded boundary area and the dynamic object area. Moreover, our method achieves comparable or better performance than other monocular self-supervised state-of-the-art methods in these three subtasks.
引用
收藏
页码:1193 / 1204
页数:12
相关论文
共 50 条
  • [1] Self-supervised learning of monocular 3D geometry understanding with two- and three-view geometric constraints
    Xiaoliang Liu
    Furao Shen
    Jian Zhao
    Changhai Nie
    The Visual Computer, 2024, 40 (2) : 1193 - 1204
  • [2] DeepVIO: Self-supervised Deep Learning of Monocular Visual Inertial Odometry using 3D Geometric Constraints
    Han, Liming
    Lin, Yimin
    Du, Guoguang
    Lian, Shiguo
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 6906 - 6913
  • [3] Self-Supervised Learning of 3D Human Pose using Multi-view Geometry
    Kocabas, Muhammed
    Karagoz, Salih
    Akbas, Emre
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1077 - 1086
  • [4] Monocular depth estimation using self-supervised learning with more effective geometric constraints
    Xiong, Mingkang
    Zhang, Zhenghong
    Liu, Jiyuan
    Zhang, Tao
    Xiong, Huilin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 128
  • [5] Self-supervised Learning with Geometric Constraints in Monocular Video Connecting Flow, Depth, and Camera
    Chen, Yuhua
    Schmid, Cordelia
    Sminchisescu, Cristian
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7062 - 7071
  • [6] Monocular depth estimation using self-supervised learning with more effective geometric constraints
    Xiong, Mingkang
    Zhang, Zhenghong
    Liu, Jiyuan
    Zhang, Tao
    Xiong, Huilin
    Engineering Applications of Artificial Intelligence, 2024, 128
  • [7] Crowdsourced 3D Mapping: A Combined Multi-View Geometry and Self-Supervised Learning Approach
    Chaw, Hemang
    Jukola, Matai
    Brouns, Terence
    Arani, Elahe
    Zonooz, Bahram
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 4750 - 4757
  • [8] 3D Packing for Self-Supervised Monocular Depth Estimation
    Guizilini, Vitor
    Ambrus, Rares
    Pillai, Sudeep
    Raventos, Allan
    Gaidon, Adrien
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2482 - 2491
  • [9] Self-supervised Monocular Depth and Visual Odometry Learning with Scale-consistent Geometric Constraints
    Xiong, Mingkang
    Zhang, Zhenghong
    Zhong, Weilin
    Ji, Jinsheng
    Liu, Jiyuan
    Xiong, Huilin
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 963 - 969
  • [10] Self-Supervised 3D Human Pose Estimation with Multiple-View Geometry
    Bouazizi, Arij
    Wiederer, Julian
    Kressel, Ulrich
    Belagiannis, Vasileios
    2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,