Sequential Fusion of Multi-view Video Frames for 3D Scene Generation

被引:4
|
作者
Sun, Weilin [1 ]
Li, Xiangxian [1 ]
Li, Manyi [1 ]
Wang, Yuqing [1 ]
Zheng, Yuze [1 ]
Meng, Xiangxu [1 ]
Meng, Lei [1 ]
机构
[1] Shandong Univ, Jinan, Shandong, Peoples R China
来源
关键词
3D scene generation; Multi-view fusion; Multi-view time series data;
D O I
10.1007/978-3-031-20497-5_49
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D scene understanding and generation are to reconstruct the layout of the scene and each object from an RGB image, estimate its semantic type in 3D space and generate a 3D scene. At present, the 3D scene generation algorithm based on deep learning mainly recovers the 3D scene from a single image. Due to the complexity of the real environment, the information provided by a single image is limited, and there are problems such as the lack of single-view information and the occlusion of objects in the scene. In response to the above problems, we propose a 3D scene generation framework SGMT, which realizes multi-view position information fusion and reconstructs the 3D scene from multi-view video time series data to compensate for the missing object position in existing methods. We demonstrated the effectiveness of multi-view scene generation of SGMT on the UrbanScene3D and SUNRGBD dataset and studied the influence of SGCN and joint fine-tuning. In addition, we further explored the transfer ability of the SGMT between datasets and discussed future improvements.
引用
收藏
页码:597 / 608
页数:12
相关论文
共 50 条
  • [21] 3D Crowd Counting via Multi-View Fusion with 3D Gaussian Kernels
    Zhang, Qi
    Chan, Antoni B.
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12837 - 12844
  • [22] IMPROVED MULTI-VIEW DEPTH ESTIMATION FOR VIEW SYNTHESIS IN 3D VIDEO CODING
    Zhang, Qiuwen
    An, Ping
    Zhang, Yan
    Shen, Liquan
    Zhang, Zhaoyang
    [J]. 2011 3DTV CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO (3DTV-CON), 2011,
  • [23] A View-Level Rate Distortion Model for Multi-View/3D Video
    Roodaki, Hoda
    Iravani, Zahra
    Hashemi, Mahmoud Reza
    Shirmohammadi, Shervin
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (01) : 14 - 24
  • [24] Color calibration of multi-view video plus depth for advanced 3D video
    Sid Ahmed Fezza
    Mohamed-Chaker Larabi
    [J]. Signal, Image and Video Processing, 2015, 9 : 177 - 191
  • [25] Color calibration of multi-view video plus depth for advanced 3D video
    Fezza, Sid Ahmed
    Larabi, Mohamed-Chaker
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2015, 9 : 177 - 191
  • [26] RGB-D Multi-View System Calibration for Full 3D Scene Reconstruction
    Afzal, Hassan
    Aouada, Djamila
    Fofi, David
    Mirbach, Bruno
    Ottersten, Bjoern
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2459 - 2464
  • [27] 3D-DCT Chip Design for 3D Multi-view Video Compression
    Fan, Yu-Cheng
    Chen, Shan-Ann
    Wu, Kuo-Gi
    You, Jun-Lin
    [J]. APPLIED MATHEMATICS & INFORMATION SCIENCES, 2012, 6 (02): : 567S - 572S
  • [28] Indoor Scene Understanding by Fusing Multi-View RGB-D Image Frames
    Li, Xiangpan
    Zhang, Biao
    Sun, Fengchi
    Liu, Jie
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (06): : 1218 - 1226
  • [29] Wide-Baseline Multi-View Video Segmentation for 3D Reconstruction
    Sarim, Muhammad
    Hilton, Adrian
    Guillemaut, Jean-Yves
    Kim, Hansung
    Takai, Takeshi
    [J]. PROCEEDINGS OF THE 2010 ACM WORKSHOP ON 3D VIDEO PROCESSING (3DVP'10), 2010, : 13 - 18
  • [30] Adaptive delivery of immersive 3D multi-view video over the Internet
    Cagri Ozcinar
    Erhan Ekmekcioglu
    Janko Ćalić
    Ahmet Kondoz
    [J]. Multimedia Tools and Applications, 2016, 75 : 12431 - 12461