3D Motion Decomposition for RGBD Future Dynamic Scene Synthesis

被引:6
|
作者
Qi, Xiaojuan [1 ]
Liu, Zhengzhe [2 ]
Chen, Qifeng [3 ]
Jia, Jiaya [4 ,5 ]
机构
[1] Univ Oxford, Oxford, England
[2] DJI, Shenzhen, Guangdong, Peoples R China
[3] HKUST, Hong Kong, Peoples R China
[4] CUHK, Hong Kong, Peoples R China
[5] YouTu Lab, Hong Kong, Peoples R China
关键词
D O I
10.1109/CVPR.2019.00786
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A future video is the 2D projection of a 3D scene with predicted camera and object motion. Accurate future video prediction inherently requires understanding of 3D motion and geometry of a scene. In this paper we propose a RGBD scene forecasting model with 3D motion decomposition. We predict ego-motion and foreground motion that are combined to generate a future 3D dynamic scene, which is then projected into a 2D image plane to synthesize future motion, RGB images and depth maps. Optional semantic maps can be integrated. Experimental results on KITTI and Driving datasets show that our model outperforms other state-of-the-arts in forecasting future RGBD dynamic scenes.
引用
收藏
页码:7665 / 7674
页数:10
相关论文
共 50 条
  • [21] Dynamic scene reconstruction for 3D virtual guidance
    Calbi, Alessandro
    Marcenaro, Lucio
    Regazzoni, Carlo S.
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2006, 4252 : 179 - 186
  • [22] Estimating camera motion through a 3D cluttered scene
    Mann, R
    Langer, MS
    [J]. 1ST CANADIAN CONFERENCE ON COMPUTER AND ROBOT VISION, PROCEEDINGS, 2004, : 472 - 479
  • [23] Stereoscopic Scene Flow Computation for 3D Motion Understanding
    Andreas Wedel
    Thomas Brox
    Tobi Vaudrey
    Clemens Rabe
    Uwe Franke
    Daniel Cremers
    [J]. International Journal of Computer Vision, 2011, 95 : 29 - 51
  • [24] Stereoscopic Scene Flow Computation for 3D Motion Understanding
    Wedel, Andreas
    Brox, Thomas
    Vaudrey, Tobi
    Rabe, Clemens
    Franke, Uwe
    Cremers, Daniel
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 95 (01) : 29 - 51
  • [25] 3D Scene Flow Estimation with a Rigid Motion Prior
    Vogel, Christoph
    Schindler, Konrad
    Roth, Stefan
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 1291 - 1298
  • [26] Text to Scene: A System of Configurable 3D Indoor Scene Synthesis
    Yang, Xinyan
    Hu, Fei
    Ye, Long
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2819 - 2821
  • [27] 3D Rotation Invariant Decomposition of Motion Signals
    Barthelemy, Quentin
    Larue, Anthony
    Mars, Jerome I.
    [J]. COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 172 - 182
  • [28] DYNAMIC RECONSTRUCTION OF 3D STRUCTURE, 3D MOTION AND MULTIPLE SURFACES
    ANDO, H
    [J]. INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1991, 32 (04) : 1277 - 1277
  • [29] View Synthesis for 3D Video Scene Composition
    Wang, Amanda
    Chien, Chun-Liang
    Hang, Hsueh-Ming
    [J]. 2015 IEEE 17TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2015,
  • [30] 3D Scene Angles using UL Decomposition of Planar Homography
    Paliwal, Pinak
    Paliwal, Vikas
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 2031 - 2038