Self-supervised learning of monocular 3D geometry understanding with two- and three-view geometric constraints

被引：0

作者：

Liu, Xiaoliang ^{[1
,2
]}

Shen, Furao ^{[1
,3
]}

Zhao, Jian ^{[1
,4
]}

Nie, Changhai ^{[1
,2
]}

机构：

[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China

[2] Nanjing Univ, Dept Comp Sci & Technol, Nanjing, Peoples R China

[3] Nanjing Univ, Sch Artificial Intelligence, Nanjing, Peoples R China

[4] Nanjing Univ, Sch Elect Sci & Engn, Nanjing, Peoples R China

来源：

VISUAL COMPUTER | 2024年 / 40卷 / 02期

基金：

美国国家科学基金会;

关键词：

3D geometry understanding; Optical flow estimation; Visual odometry; Depth estimation; Self-supervised learning; Dynamic scenes; VISUAL ODOMETRY; OPTICAL-FLOW;

D O I：

10.1007/s00371-023-02840-y

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

The 3D geometry understanding of dynamic scenes captured by moving cameras is one of the cornerstones of 3D scene understanding. Optical flow estimation, visual odometry, and depth estimation are the three most basic tasks in 3D geometry understanding. In this work, we present a unified framework for joint self-supervised learning of optical flow estimation, visual odometry, and depth estimation with two- and three-view geometric constraints. As we all know, visual odometry and depth estimation are more sensitive to dynamic objects, while optical flow estimation is more difficult to estimate the boundary area moved out of the image. To this end, we use estimated optical flow to help visual odometry and depth estimation process dynamic objects and use a rigid flow synthesized by the estimated pose and depth to help learn the optical flow of the area that moves out of the boundary due to camera motion. In order to further improve the consistency of cross-tasks, we introduce three-view geometric constraints and propose a three-view consistency loss. Finally, experiments on the KITTI data set show that our method can effectively improve the performance of the occluded boundary area and the dynamic object area. Moreover, our method achieves comparable or better performance than other monocular self-supervised state-of-the-art methods in these three subtasks.

引用

页码：1193 / 1204

页数：12

共 50 条

[1] Self-supervised learning of monocular 3D geometry understanding with two- and three-view geometric constraints
Xiaoliang Liu
Furao Shen
Jian Zhao
Changhai Nie
The Visual Computer, 2024, 40 (2) : 1193 - 1204
[2] DeepVIO: Self-supervised Deep Learning of Monocular Visual Inertial Odometry using 3D Geometric Constraints
Han, Liming
Lin, Yimin
Du, Guoguang
Lian, Shiguo
2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 6906 - 6913
[3] Self-Supervised Learning of 3D Human Pose using Multi-view Geometry
Kocabas, Muhammed
Karagoz, Salih
Akbas, Emre
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1077 - 1086
[4] Monocular depth estimation using self-supervised learning with more effective geometric constraints
Xiong, Mingkang
Zhang, Zhenghong
Liu, Jiyuan
Zhang, Tao
Xiong, Huilin
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 128
[5] Self-supervised Learning with Geometric Constraints in Monocular Video Connecting Flow, Depth, and Camera
Chen, Yuhua
Schmid, Cordelia
Sminchisescu, Cristian
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7062 - 7071
[6] Monocular depth estimation using self-supervised learning with more effective geometric constraints
Xiong, Mingkang
Zhang, Zhenghong
Liu, Jiyuan
Zhang, Tao
Xiong, Huilin
Engineering Applications of Artificial Intelligence, 2024, 128
[7] Crowdsourced 3D Mapping: A Combined Multi-View Geometry and Self-Supervised Learning Approach
Chaw, Hemang
Jukola, Matai
Brouns, Terence
Arani, Elahe
Zonooz, Bahram
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 4750 - 4757
[8] 3D Packing for Self-Supervised Monocular Depth Estimation
Guizilini, Vitor
Ambrus, Rares
Pillai, Sudeep
Raventos, Allan
Gaidon, Adrien
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2482 - 2491
[9] Self-supervised Monocular Depth and Visual Odometry Learning with Scale-consistent Geometric Constraints
Xiong, Mingkang
Zhang, Zhenghong
Zhong, Weilin
Ji, Jinsheng
Liu, Jiyuan
Xiong, Huilin
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 963 - 969
[10] Self-Supervised 3D Human Pose Estimation with Multiple-View Geometry
Bouazizi, Arij
Wiederer, Julian
Kressel, Ulrich
Belagiannis, Vasileios
2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,

← 1 2 3 4 5 →