Towards real-time photorealistic 3D holography with deep neural networks

被引:376
|
作者
Shi, Liang [1 ,2 ]
Li, Beichen [1 ,2 ]
Kim, Changil [1 ,2 ]
Kellnhofer, Petr [1 ,2 ]
Matusik, Wojciech [1 ,2 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] MIT, Elect Engn & Comp Sci Dept, Cambridge, MA 02139 USA
关键词
COMPUTER-GENERATED HOLOGRAMS; ALGORITHM; DISPLAY; FIELD;
D O I
10.1038/s41586-020-03152-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The ability to present three-dimensional (3D) scenes with continuous depth sensation has a profound impact on virtual and augmented reality, human-computer interaction, education and training. Computer-generated holography (CGH) enables high-spatio-angular-resolution 3D projection via numerical simulation of diffraction and interference(1). Yet, existing physically based methods fail to produce holograms with both per-pixel focal control and accurate occlusion(2,3). The computationally taxing Fresnel diffraction simulation further places an explicit trade-off between image quality and runtime, making dynamic holography impractical(4). Here we demonstrate a deep-learning-based CGH pipeline capable of synthesizing a photorealistic colour 3D hologram from a single RGB-depth image in real time. Our convolutional neural network (CNN) is extremely memory efficient (below 620 kilobytes) and runs at 60 hertz for a resolution of 1,920 x 1,080 pixels on a single consumer-grade graphics processing unit. Leveraging low-power on-device artificial intelligence acceleration chips, our CNN also runs interactively on mobile (iPhone 11 Pro at 1.1 hertz) and edge (Google Edge TPU at 2.0 hertz) devices, promising real-time performance in future-generation virtual and augmented-reality mobile headsets. We enable this pipeline by introducing a large-scale CGH dataset (MIT-CGH-4K) with 4,000 pairs of RGB-depth images and corresponding 3D holograms. Our CNN is trained with differentiable wave-based loss functions(5) and physically approximates Fresnel diffraction. With an anti-aliasing phase-only encoding method, we experimentally demonstrate speckle-free, natural-looking, high-resolution 3D holograms. Our learning-based approach and the Fresnel hologram dataset will help to unlock the full potential of holography and enable applications in metasurface design(6,7), optical and acoustic tweezer-based microscopic manipulation(8-10), holographic microscopy(11) and single-exposure volumetric 3D printing(12,13).
引用
收藏
页码:234 / +
页数:20
相关论文
共 50 条
  • [21] Real-Time Detection of Events in Soccer Videos using 3D Convolutional Neural Networks
    Rongved, Olav A. Norgard
    Hicks, Steven A.
    Thambawita, Vajira
    Stensland, Hakon K.
    Zouganeli, Evi
    Johansen, Dag
    Riegler, Michael A.
    Halvorsen, Pal
    2020 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2020), 2020, : 135 - 144
  • [22] RT3D: Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices
    Niu, Wei
    Sun, Mengshu
    Li, Zhengang
    Chen, Jou-An
    Guan, Jiexiong
    Shen, Xipeng
    Wang, Yanzhi
    Liu, Sijia
    Lin, Xue
    Ren, Bin
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9179 - 9187
  • [23] Real-time photorealistic 3D map building using mobile multiple stereo cameras setup
    Netramai, C.
    Roth, H.
    2007 3DTV CONFERENCE, 2007, : 305 - 309
  • [24] Large holographic 3D display for real-time computer-generated holography
    Haeussler, R.
    Leister, N.
    Stolle, H.
    DIGITAL OPTICAL TECHNOLOGIES 2017, 2017, 10335
  • [25] Towards Real-time Underwater 3D Reconstruction with Plenoptic Cameras
    Skinner, Katherine A.
    Johnson-Roberson, Matthew
    2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 2014 - 2021
  • [26] Real-time 3D Scene Layout from a Single Image Using Convolutional Neural Networks
    Yang, Shichao
    Maturana, Daniel
    Scherer, Sebastian
    2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 2183 - 2189
  • [27] Real-time significant wave height estimation from raw ocean images based on 2D and 3D deep neural networks
    Choi, Heejeong
    Park, Minsik
    Son, Gyubin
    Jeong, Jaeyun
    Park, Jaesun
    Mo, Kyounghyun
    Kang, Pilsung
    OCEAN ENGINEERING, 2020, 201
  • [28] TR3D: TOWARDS REAL-TIME INDOOR 3D OBJECT DETECTION
    Rukhovich, Danila
    Vorontsova, Anna
    Konushin, Anton
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 281 - 285
  • [29] Deep D-Bar: Real-Time Electrical Impedance Tomography Imaging With Deep Neural Networks
    Hamilton, S. J.
    Hauptmann, A.
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (10) : 2367 - 2377
  • [30] A real-time 3D multi-view rendering from a real-time 3D capture
    1600, Blackwell Publishing Ltd (44):