Towards real-time photorealistic 3D holography with deep neural networks

被引:376
|
作者
Shi, Liang [1 ,2 ]
Li, Beichen [1 ,2 ]
Kim, Changil [1 ,2 ]
Kellnhofer, Petr [1 ,2 ]
Matusik, Wojciech [1 ,2 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] MIT, Elect Engn & Comp Sci Dept, Cambridge, MA 02139 USA
关键词
COMPUTER-GENERATED HOLOGRAMS; ALGORITHM; DISPLAY; FIELD;
D O I
10.1038/s41586-020-03152-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The ability to present three-dimensional (3D) scenes with continuous depth sensation has a profound impact on virtual and augmented reality, human-computer interaction, education and training. Computer-generated holography (CGH) enables high-spatio-angular-resolution 3D projection via numerical simulation of diffraction and interference(1). Yet, existing physically based methods fail to produce holograms with both per-pixel focal control and accurate occlusion(2,3). The computationally taxing Fresnel diffraction simulation further places an explicit trade-off between image quality and runtime, making dynamic holography impractical(4). Here we demonstrate a deep-learning-based CGH pipeline capable of synthesizing a photorealistic colour 3D hologram from a single RGB-depth image in real time. Our convolutional neural network (CNN) is extremely memory efficient (below 620 kilobytes) and runs at 60 hertz for a resolution of 1,920 x 1,080 pixels on a single consumer-grade graphics processing unit. Leveraging low-power on-device artificial intelligence acceleration chips, our CNN also runs interactively on mobile (iPhone 11 Pro at 1.1 hertz) and edge (Google Edge TPU at 2.0 hertz) devices, promising real-time performance in future-generation virtual and augmented-reality mobile headsets. We enable this pipeline by introducing a large-scale CGH dataset (MIT-CGH-4K) with 4,000 pairs of RGB-depth images and corresponding 3D holograms. Our CNN is trained with differentiable wave-based loss functions(5) and physically approximates Fresnel diffraction. With an anti-aliasing phase-only encoding method, we experimentally demonstrate speckle-free, natural-looking, high-resolution 3D holograms. Our learning-based approach and the Fresnel hologram dataset will help to unlock the full potential of holography and enable applications in metasurface design(6,7), optical and acoustic tweezer-based microscopic manipulation(8-10), holographic microscopy(11) and single-exposure volumetric 3D printing(12,13).
引用
收藏
页码:234 / +
页数:20
相关论文
共 50 条
  • [1] Towards real-time photorealistic 3D holography with deep neural networks
    Liang Shi
    Beichen Li
    Changil Kim
    Petr Kellnhofer
    Wojciech Matusik
    Nature, 2021, 591 : 234 - 239
  • [2] Author Correction: Towards real-time photorealistic 3D holography with deep neural networks
    Liang Shi
    Beichen Li
    Changil Kim
    Petr Kellnhofer
    Wojciech Matusik
    Nature, 2021, 593 : E13 - E13
  • [3] Towards real-time photorealistic 3D holography with deep neural networks (vol 591, pg 234, 2021)
    Shi, Liang
    Li, Beichen
    Kim, Changil
    Kellnhofer, Petr
    Matusik, Wojciech
    NATURE, 2021, 593 (7858) : E13 - E13
  • [4] Towards real-time EPID-based 3D in vivo dosimetry for IMRT with Deep Neural Networks: A feasibility study
    Martins, Juliana Cristina
    Maier, Joscha
    Gianoli, Chiara
    Neppl, Sebastian
    Dedes, George
    Alhazmi, Abdulaziz
    Veloza, Stella
    Reiner, Michael
    Belka, Claus
    Kachelriess, Marc
    Parodi, Katia
    PHYSICA MEDICA-EUROPEAN JOURNAL OF MEDICAL PHYSICS, 2023, 114
  • [5] Real-Time 3D Hand Pose Estimation with 3D Convolutional Neural Networks
    Ge, Liuhao
    Liang, Hui
    Yuan, Junsong
    Thalmann, Daniel
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (04) : 956 - 970
  • [6] A 3D Convolutional Neural Network Towards Real-time Amodal 3D Object Detection
    Sun, Hao
    Meng, Zehui
    Du, Xinxin
    Ang, Marcelo H., Jr.
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 8331 - 8338
  • [7] Towards Real-Time Drone Detection Using Deep Neural Networks
    Pulido, Cristhiam
    Ceron, Alexander
    DEVELOPMENTS AND ADVANCES IN DEFENSE AND SECURITY, MICRADS 2021, 2022, 255 : 149 - 159
  • [8] Continual 3D Convolutional Neural Networks for Real-time Processing of Videos
    Hedegaard, Lukas
    Iosifidis, Alexandros
    COMPUTER VISION - ECCV 2022, PT IV, 2022, 13664 : 369 - 385
  • [9] Towards Real-time Speech Emotion Recognition using Deep Neural Networks
    Fayek, H. M.
    Lech, M.
    Cavedon, L.
    2015 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2015,
  • [10] 3DFCNN: real-time action recognition using 3D deep neural networks with raw depth information
    Sanchez-Caballero, Adrian
    de Lopez-Diz, Sergio
    Fuentes-Jimenez, David
    Losada-Gutierrez, Cristina
    Marron-Romera, Marta
    Casillas-Perez, David
    Sarker, Mohammad Ibrahim
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 24119 - 24143