Lift3D: Synthesize 3D Training Data by Lifting 2D GAN to 3D Generative Radiance Field

被引：0

作者：

Li, Leheng ^{[1
,3
]}

Lian, Qing ^{[2
]}

Wang, Luozhou ^{[1
]}

Ma, Ningning ^{[3
]}

Chen, Ying-Cong ^{[1
,2
]}

机构：

[1] HKUST GZ, Hong Kong, Peoples R China

[2] HKUST, Hong Kong, Peoples R China

[3] NIO Autonomous Driving, Shanghai, Peoples R China

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年

关键词：

VISION;

D O I：

10.1109/CVPR52729.2023.00040

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This work explores the use of 3D generative models to synthesize training data for 3D vision tasks. The key requirements of the generative models are that the generated data should be photorealistic to match the real-world scenarios, and the corresponding 3D attributes should be aligned with given sampling labels. However, we find that the recent NeRF-based 3D GANs hardly meet the above requirements due to their designed generation pipeline and the lack of explicit 3D supervision. In this work, we propose Lift3D, an inverted 2D-to-3D generation framework to achieve the data generation objectives. Lift3D has several merits compared to prior methods: (1) Unlike previous 3D GANs that the output resolution is fixed after training, Lift3D can generalize to any camera intrinsic with higher resolution and photorealistic output. (2) By lifting well-disentangled 2D GAN to 3D object NeRF, Lift3D provides explicit 3D information of generated objects, thus offering accurate 3D annotations for downstream tasks. We evaluate the effectiveness of our framework by augmenting autonomous driving datasets. Experimental results demonstrate that our data generation framework can effectively improve the performance of 3D object detectors. Code: len-li.github.io/lift3d-web

引用

下载

页码：332 / 341

页数：10

共 50 条

[21] Progressive Learning of 3D Reconstruction Network From 2D GAN Data
Dundar, Aysegul
Gao, Jun
Tao, Andrew
Catanzaro, Bryan
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (02) : 793 - 804
[22] 2D,3D,4D
数学大王(中高年级), 2011, (09) : 2 - 5
[23] A Unified 3D Mapping Framework Using a 3D or 2D LiDAR
Zhen, Weikun
Scherer, Sebastian
PROCEEDINGS OF THE 2018 INTERNATIONAL SYMPOSIUM ON EXPERIMENTAL ROBOTICS, 2020, 11 : 702 - 711
[24] Converting 2D Video to 3D: An Efficient Path to a 3D Experience
Cao, Xun
Bovik, Alan C.
Wang, Yao
Dai, Qionghai
IEEE MULTIMEDIA, 2011, 18 (04) : 12 - 17
[25] An analysis of the 2D demultiple and the 3D demultiple for a 3D complex model
Ikelle, LT
JOURNAL OF SEISMIC EXPLORATION, 2005, 13 (04): : 303 - 321
[26] A "LEARN 2D, APPLY 3D" METHOD FOR 3D DECONVOLUTION MICROSCOPY
Soulez, Ferreol
2014 IEEE 11th International Symposium on Biomedical Imaging (ISBI), 2014, : 1075 - 1078
[27] Monocular 3D Face Reconstruction with Joint 2D and 3D Constraints
Cui, Huili
Yang, Jing
Lai, Yu-Kun
Li, Kun
ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 129 - 141
[28] 3D versus 2D/3D shape descriptors:: A comparative study
Zaharia, T
Prêteux, F
IMAGE PROCESSING: ALGORITHMS AND SYSTEMS III, 2004, 5298 : 47 - 58
[29] Simulations of 3D silicon radiation detector structures in 2D and 3D
Kalliopuska, Juha
Eranen, Simo
Orava, Risto
2005 IEEE NUCLEAR SCIENCE SYMPOSIUM CONFERENCE RECORD, VOLS 1-5, 2005, : 803 - 807
[30] Directional BLU for full resolution field alternative auto-stereoscopic 3D/2D and 2D/3D LCDs
Käläntär, K.K., 1600, Blackwell Publishing Ltd (45):

← 1 2 3 4 5 →