Lift3D: Synthesize 3D Training Data by Lifting 2D GAN to 3D Generative Radiance Field

被引:0
|
作者
Li, Leheng [1 ,3 ]
Lian, Qing [2 ]
Wang, Luozhou [1 ]
Ma, Ningning [3 ]
Chen, Ying-Cong [1 ,2 ]
机构
[1] HKUST GZ, Hong Kong, Peoples R China
[2] HKUST, Hong Kong, Peoples R China
[3] NIO Autonomous Driving, Shanghai, Peoples R China
关键词
VISION;
D O I
10.1109/CVPR52729.2023.00040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work explores the use of 3D generative models to synthesize training data for 3D vision tasks. The key requirements of the generative models are that the generated data should be photorealistic to match the real-world scenarios, and the corresponding 3D attributes should be aligned with given sampling labels. However, we find that the recent NeRF-based 3D GANs hardly meet the above requirements due to their designed generation pipeline and the lack of explicit 3D supervision. In this work, we propose Lift3D, an inverted 2D-to-3D generation framework to achieve the data generation objectives. Lift3D has several merits compared to prior methods: (1) Unlike previous 3D GANs that the output resolution is fixed after training, Lift3D can generalize to any camera intrinsic with higher resolution and photorealistic output. (2) By lifting well-disentangled 2D GAN to 3D object NeRF, Lift3D provides explicit 3D information of generated objects, thus offering accurate 3D annotations for downstream tasks. We evaluate the effectiveness of our framework by augmenting autonomous driving datasets. Experimental results demonstrate that our data generation framework can effectively improve the performance of 3D object detectors. Code: len-li.github.io/lift3d-web
引用
下载
收藏
页码:332 / 341
页数:10
相关论文
共 50 条
  • [21] Progressive Learning of 3D Reconstruction Network From 2D GAN Data
    Dundar, Aysegul
    Gao, Jun
    Tao, Andrew
    Catanzaro, Bryan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (02) : 793 - 804
  • [23] A Unified 3D Mapping Framework Using a 3D or 2D LiDAR
    Zhen, Weikun
    Scherer, Sebastian
    PROCEEDINGS OF THE 2018 INTERNATIONAL SYMPOSIUM ON EXPERIMENTAL ROBOTICS, 2020, 11 : 702 - 711
  • [24] Converting 2D Video to 3D: An Efficient Path to a 3D Experience
    Cao, Xun
    Bovik, Alan C.
    Wang, Yao
    Dai, Qionghai
    IEEE MULTIMEDIA, 2011, 18 (04) : 12 - 17
  • [25] An analysis of the 2D demultiple and the 3D demultiple for a 3D complex model
    Ikelle, LT
    JOURNAL OF SEISMIC EXPLORATION, 2005, 13 (04): : 303 - 321
  • [26] A "LEARN 2D, APPLY 3D" METHOD FOR 3D DECONVOLUTION MICROSCOPY
    Soulez, Ferreol
    2014 IEEE 11th International Symposium on Biomedical Imaging (ISBI), 2014, : 1075 - 1078
  • [27] Monocular 3D Face Reconstruction with Joint 2D and 3D Constraints
    Cui, Huili
    Yang, Jing
    Lai, Yu-Kun
    Li, Kun
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 129 - 141
  • [28] 3D versus 2D/3D shape descriptors:: A comparative study
    Zaharia, T
    Prêteux, F
    IMAGE PROCESSING: ALGORITHMS AND SYSTEMS III, 2004, 5298 : 47 - 58
  • [29] Simulations of 3D silicon radiation detector structures in 2D and 3D
    Kalliopuska, Juha
    Eranen, Simo
    Orava, Risto
    2005 IEEE NUCLEAR SCIENCE SYMPOSIUM CONFERENCE RECORD, VOLS 1-5, 2005, : 803 - 807
  • [30] Directional BLU for full resolution field alternative auto-stereoscopic 3D/2D and 2D/3D LCDs
    Käläntär, K.K., 1600, Blackwell Publishing Ltd (45):