Sequential Fusion of Multi-view Video Frames for 3D Scene Generation

被引：4

作者：

Sun, Weilin ^{[1
]}

Li, Xiangxian ^{[1
]}

Li, Manyi ^{[1
]}

Wang, Yuqing ^{[1
]}

Zheng, Yuze ^{[1
]}

Meng, Xiangxu ^{[1
]}

Meng, Lei ^{[1
]}

机构：

[1] Shandong Univ, Jinan, Shandong, Peoples R China

来源：

ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I | 2022年 / 13604卷

关键词：

3D scene generation; Multi-view fusion; Multi-view time series data;

D O I：

10.1007/978-3-031-20497-5_49

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D scene understanding and generation are to reconstruct the layout of the scene and each object from an RGB image, estimate its semantic type in 3D space and generate a 3D scene. At present, the 3D scene generation algorithm based on deep learning mainly recovers the 3D scene from a single image. Due to the complexity of the real environment, the information provided by a single image is limited, and there are problems such as the lack of single-view information and the occlusion of objects in the scene. In response to the above problems, we propose a 3D scene generation framework SGMT, which realizes multi-view position information fusion and reconstructs the 3D scene from multi-view video time series data to compensate for the missing object position in existing methods. We demonstrated the effectiveness of multi-view scene generation of SGMT on the UrbanScene3D and SUNRGBD dataset and studied the influence of SGCN and joint fine-tuning. In addition, we further explored the transfer ability of the SGMT between datasets and discussed future improvements.

引用

页码：597 / 608

页数：12

共 50 条

[1] Generation of Multi-View Video Using a Fusion Camera System for 3D Displays
Lee, Eun-Kyung
Ho, Yo-Sung
[J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (04) : 2797 - 2805
[2] Multi-view PointNet for 3D Scene Understanding
Jaritz, Maximilian
Gu, Jiayuan
Su, Hao
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3995 - 4003
[3] A COMPACT 3D REPRESENTATION FOR MULTI-VIEW VIDEO
Salvador, Jordi
Casas, Josep R.
[J]. INTERNATIONAL CONFERENCE ON 3D IMAGING 2011 (IC3D 2011), 2011,
[4] Multi-view video compression for 3D displays
Zwicker, Matthias
Yea, Sehoon
Vetro, Anthony
Forlines, Clifton
Matusik, Wojciech
Pfister, Hanspeter
[J]. CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 1506 - +
[5] Sequential selection and calibration of video frames for 3D outdoor scene reconstruction
Sun, Weilin
Li, Manyi
Li, Peng
Cao, Xiao
Meng, Xiangxu
Meng, Lei
[J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024,
[6] Neural 3D Video Synthesis from Multi-view Video
Li, Tianye
Slavcheva, Mira
Zollhoefer, Michael
Green, Simon
Lassner, Christoph
Kim, Changil
Schmidt, Tanner
Lovegrove, Steven
Goesele, Michael
Newcombe, Richard
Lv, Zhaoyang
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5511 - 5521
[7] Virtual View Adaptation for 3D Multi-View Video Streaming
Petrovic, Goran
Do, Luat
Zinger, Sveta
de With, Peter H. N.
[J]. STEREOSCOPIC DISPLAYS AND APPLICATIONS XXI, 2010, 7524
[8] A Multi-view Projection 3D Video Processing Technology
Dou, Yuchao
Dai, Taotao
Fu, Lei
Huang, Ziqiang
[J]. 2012 IEEE FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2012, : 1162 - 1165
[9] MULTI-VIEW 3D RECONSTRUCTION FROM VIDEO WITH TRANSFORMER
Zhong, Yijie
Sun, Zhengxing
Sun, Yunhan
Luo, Shoutong
Wang, Yi
Zhang, Wei
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1661 - 1665
[10] Multi-View Fusion-Based 3D Object Detection for Robot Indoor Scene Perception
Wang, Li
Li, Ruifeng
Sun, Jingwen
Liu, Xingxing
Zhao, Lijun
Seah, Hock Soon
Quah, Chee Kwang
Tandianus, Budianto
[J]. SENSORS, 2019, 19 (19)

← 1 2 3 4 5 →