SFGAN: Unsupervised Generative Adversarial Learning of 3D Scene Flow from the 3D Scene Self

被引:12
|
作者
Wang, Guangming [1 ]
Jiang, Chaokang [2 ]
Shen, Zehang [1 ]
Miao, Yanzi [2 ]
Wang, Hesheng [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai Engn Res Ctr Intelligent Control & Manag, Key Lab Marine Intelligent Equipment & Syst,Minis, Key Lab Syst Control & Informat Proc,Dept Automat, Shanghai 200240, Peoples R China
[2] China Univ Min & Technol, Engn Res Ctr Intelligent Control Underground Spac, Minist Educ, Sch Informat & Control Engn,Adv Robot Res Ctr, Xuzhou 221116, Jiangsu, Peoples R China
关键词
3D point clouds; generative adversarial network; scene flow estimation; soft correspondence; unsupervised learning;
D O I
10.1002/aisy.202100197
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene flow tracks the 3D motion of each point in adjacent point clouds. It provides fundamental 3D motion perception for autonomous driving and server robot. Although red green blue depth (RGBD) camera or light detection and ranging (LiDAR) capture discrete 3D points in space, the objects and motions usually are continuous in the macroworld. That is, the objects keep themselves consistent as they flow from the current frame to the next frame. Based on this insight, the generative adversarial networks (GAN) is utilized to self-learn 3D scene flow without ground truth. The fake point cloud is synthesized from the predicted scene flow and the point cloud of the first frame. The adversarial training of the generator and discriminator is realized through synthesizing indistinguishable fake point cloud and discriminating the real point cloud and the synthesized fake point cloud. The experiments on Karlsruhe Institute of Technology and Toyota Technological Institute (KITTI) dataset show that our method realizes promising results. Just as human, the proposed method can identify the similar local structures of two adjacent frames even without knowing the ground truth scene flow. Then, the local correspondence can be correctly estimated, and further the scene flow is correctly estimated. An interactive preprint version of the article can be found here: .
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Unsupervised Learning of 3D Scene Flow from Monocular Camera
    Wang, Guangming
    Tian, Xiaoyu
    Ding, Ruiqi
    Wang, Hesheng
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4325 - 4331
  • [2] Unsupervised Learning of 3D Scene Flow With LiDAR Odometry Assistance
    Wang, Guangming
    Feng, Zhiheng
    Jiang, Chaokang
    Liu, Jiuming
    Wang, Hesheng
    IEEE Transactions on Intelligent Transportation Systems, 2025, 26 (04) : 4557 - 4567
  • [3] Unsupervised Feature Learning for 3D Scene Labeling
    Lai, Kevin
    Bo, Liefeng
    Fox, Dieter
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 3050 - 3057
  • [4] Unsupervised Traffic Scene Generation with Synthetic 3D Scene Graphs
    Savkin, Artem
    Ellouze, Rachid
    Navab, Nassir
    Tombari, Federico
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 1229 - 1235
  • [5] 3INGAN: Learning a 3D Generative Model from Images of a Self-similar Scene
    Karnewar, Animesh
    Wang, Oliver
    Ritschel, Tobias
    Mitra, Niloy J.
    2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, : 342 - 352
  • [6] SinGRAF: Learning a 3D Generative Radiance Field for a Single Scene
    Son, Minjung
    Park, Jeong Joon
    Guihas, Leonidas
    Wetzstein, Gordon
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8507 - 8517
  • [7] Unsupervised Feature Learning for 3D Scene Reconstruction with Occupancy Maps
    Guizilini, Vitor
    Ramos, Fabio
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3827 - 3833
  • [8] Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions
    Wald, Johanna
    Dhamo, Helisa
    Navab, Nassir
    Tombari, Federico
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3960 - 3969
  • [9] 3D Scene Generation by Learning from Examples
    Dema, Mesfin A.
    Sari-Sarraf, Hamed
    2012 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2012, : 58 - 64
  • [10] Weakly Supervised Learning of Rigid 3D Scene Flow
    Gojcic, Zan
    Litany, Or
    Wieser, Andreas
    Guibas, Leonidas J.
    Birdal, Tolga
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5688 - 5699