LiveNVS: Neural View Synthesis on Live RGB-D Streams

被引:1
|
作者
Fink, Laura [1 ,2 ]
Rueckert, Darius [1 ,3 ]
Franke, Linus [1 ]
Keinert, Joachim [2 ]
Stamminger, Marc [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Erlangen, Germany
[2] Fraunhofer IIS, Erlangen, Germany
[3] Voxray GmbH, Erlangen, Germany
关键词
Novel viewsynthesis; Neural rendering; Live preview; RGB-D Stream;
D O I
10.1145/3610548.3618213
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing real-time RGB-D reconstruction approaches, like Kinect Fusion, lack real-time photo-realistic visualization. This is due to noisy, oversmoothed or incomplete geometry and blurry textures which are fused from imperfect depth maps and camera poses. Recent neural rendering methods can overcome many of such artifacts but are mostly optimized for offline usage, hindering the integration into a live reconstruction pipeline. In this paper, we present LiveNVS, a system that allows for neural novel view synthesis on a live RGB-D input stream with very low latency and real-time rendering. Based on the RGB-D input stream, novel views are rendered by projecting neural features into the target view via a densely fused depth map and aggregating the features in image-space to a target feature map. A generalizable neural network then translates the target feature map into a high-quality RGB image. LiveNVS achieves state-of-the-art neural rendering quality of unknown scenes during capturing, allowing users to virtually explore the scene and assess reconstruction quality in real-time.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Live Weight Prediction of Cattle Based on Deep Regression of RGB-D Images
    Ruchay, Alexey
    Kober, Vitaly
    Dorofeev, Konstantin
    Kolpakov, Vladimir
    Gladkov, Alexey
    Guo, Hao
    AGRICULTURE-BASEL, 2022, 12 (11):
  • [32] 3D Background Modeling in Multi-view RGB-D Video
    Huang, Yung-Lin
    Wei, Ku-Chu
    Chien, Shao-Yi
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1051 - 1054
  • [33] Single View 3D Reconstruction Based on Improved RGB-D Image
    Cao, Mingwei
    Zheng, Liping
    Liu, Xiaoping
    IEEE SENSORS JOURNAL, 2020, 20 (20) : 12049 - 12056
  • [34] Pose Depth Volume extraction from RGB-D streams for frontal gait recognition
    Chattopadhyay, Pratik
    Roy, Aditi
    Sural, Shamik
    Mukhopadhyay, Jayanta
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2014, 25 (01) : 53 - 63
  • [35] Multimodal Neural Networks: RGB-D for Semantic Segmentation and Object Detection
    Schneider, Lukas
    Jasch, Manuel
    Froehlich, Bjoern
    Weber, Thomas
    Franke, Uwe
    Pollefeys, Marc
    Raetsch, Matthias
    IMAGE ANALYSIS, SCIA 2017, PT I, 2017, 10269 : 98 - 109
  • [36] RGB-D SLAM综述
    王旒军
    陈家斌
    余欢
    朱汇申
    导航定位与授时, 2017, 4 (06) : 9 - 18
  • [37] A Real-Time Scene Recognition System Based on RGB-D Video Streams
    Hua, Yuyun
    Zhang, Sixian
    Song, Xinhang
    Li, Jia'ning
    Jiang, Shuqiang
    ICMI'19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2019, : 496 - 498
  • [38] A survey on RGB-D datasets
    Lopes, Alexandre
    Souza, Roberto
    Pedrini, Helio
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 222
  • [39] RGB-D static gesture recognition based on convolutional neural network
    Xie, Bin
    He, Xiaoyu
    Li, Yi
    JOURNAL OF ENGINEERING-JOE, 2018, (16): : 1515 - 1520
  • [40] RGB-D CAMERA POSE ESTIMATION USING DEEP NEURAL NETWORK
    Guo, Fei
    He, Yifeng
    Guan, Ling
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 408 - 412