Sequential Fusion of Multi-view Video Frames for 3D Scene Generation

被引:4
|
作者
Sun, Weilin [1 ]
Li, Xiangxian [1 ]
Li, Manyi [1 ]
Wang, Yuqing [1 ]
Zheng, Yuze [1 ]
Meng, Xiangxu [1 ]
Meng, Lei [1 ]
机构
[1] Shandong Univ, Jinan, Shandong, Peoples R China
来源
关键词
3D scene generation; Multi-view fusion; Multi-view time series data;
D O I
10.1007/978-3-031-20497-5_49
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D scene understanding and generation are to reconstruct the layout of the scene and each object from an RGB image, estimate its semantic type in 3D space and generate a 3D scene. At present, the 3D scene generation algorithm based on deep learning mainly recovers the 3D scene from a single image. Due to the complexity of the real environment, the information provided by a single image is limited, and there are problems such as the lack of single-view information and the occlusion of objects in the scene. In response to the above problems, we propose a 3D scene generation framework SGMT, which realizes multi-view position information fusion and reconstructs the 3D scene from multi-view video time series data to compensate for the missing object position in existing methods. We demonstrated the effectiveness of multi-view scene generation of SGMT on the UrbanScene3D and SUNRGBD dataset and studied the influence of SGCN and joint fine-tuning. In addition, we further explored the transfer ability of the SGMT between datasets and discussed future improvements.
引用
收藏
页码:597 / 608
页数:12
相关论文
共 50 条
  • [1] Generation of Multi-View Video Using a Fusion Camera System for 3D Displays
    Lee, Eun-Kyung
    Ho, Yo-Sung
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (04) : 2797 - 2805
  • [2] Multi-view PointNet for 3D Scene Understanding
    Jaritz, Maximilian
    Gu, Jiayuan
    Su, Hao
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3995 - 4003
  • [3] A COMPACT 3D REPRESENTATION FOR MULTI-VIEW VIDEO
    Salvador, Jordi
    Casas, Josep R.
    [J]. INTERNATIONAL CONFERENCE ON 3D IMAGING 2011 (IC3D 2011), 2011,
  • [4] Multi-view video compression for 3D displays
    Zwicker, Matthias
    Yea, Sehoon
    Vetro, Anthony
    Forlines, Clifton
    Matusik, Wojciech
    Pfister, Hanspeter
    [J]. CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 1506 - +
  • [5] Sequential selection and calibration of video frames for 3D outdoor scene reconstruction
    Sun, Weilin
    Li, Manyi
    Li, Peng
    Cao, Xiao
    Meng, Xiangxu
    Meng, Lei
    [J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024,
  • [6] Neural 3D Video Synthesis from Multi-view Video
    Li, Tianye
    Slavcheva, Mira
    Zollhoefer, Michael
    Green, Simon
    Lassner, Christoph
    Kim, Changil
    Schmidt, Tanner
    Lovegrove, Steven
    Goesele, Michael
    Newcombe, Richard
    Lv, Zhaoyang
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5511 - 5521
  • [7] Virtual View Adaptation for 3D Multi-View Video Streaming
    Petrovic, Goran
    Do, Luat
    Zinger, Sveta
    de With, Peter H. N.
    [J]. STEREOSCOPIC DISPLAYS AND APPLICATIONS XXI, 2010, 7524
  • [8] A Multi-view Projection 3D Video Processing Technology
    Dou, Yuchao
    Dai, Taotao
    Fu, Lei
    Huang, Ziqiang
    [J]. 2012 IEEE FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2012, : 1162 - 1165
  • [9] MULTI-VIEW 3D RECONSTRUCTION FROM VIDEO WITH TRANSFORMER
    Zhong, Yijie
    Sun, Zhengxing
    Sun, Yunhan
    Luo, Shoutong
    Wang, Yi
    Zhang, Wei
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1661 - 1665
  • [10] Multi-View Fusion-Based 3D Object Detection for Robot Indoor Scene Perception
    Wang, Li
    Li, Ruifeng
    Sun, Jingwen
    Liu, Xingxing
    Zhao, Lijun
    Seah, Hock Soon
    Quah, Chee Kwang
    Tandianus, Budianto
    [J]. SENSORS, 2019, 19 (19)