Scene-Constrained Neural Radiance Fields for High-Quality Sports Scene Rendering Based on Visual Sensor Network

被引:0
|
作者
Dai, Yanran [1 ]
Li, Jing [1 ]
Zhang, Yong [2 ,3 ]
Jiang, Yuqi [1 ]
Qin, Haidong [2 ,3 ]
Zhou, Xiaoshi [2 ,3 ]
Zhang, Yidan [1 ]
Yang, Tao [2 ,3 ]
机构
[1] Xidian Univ, Sch Telecommun Engn, Xian 710071, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci, Natl Engn Lab Integrated Aerosp Ground Ocean Big, Xian 710129, Peoples R China
[3] Northwestern Polytech Univ, SAIIP, Xian 710129, Peoples R China
基金
中国国家自然科学基金;
关键词
Depth estimation; large-scale scenes; neural radiation field (NeRF); weak texture; MULTIVIEW STEREO; 3D GAUSSIANS; SCALE;
D O I
10.1109/JSEN.2024.3452436
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Free-viewpoint videos offer audiences a more immersive and liberated way to watch sports. The rendering of sports scenes encompasses two essential elements: dynamic targets and static scenes. While much current research focuses on achieving high-quality rendering of human bodies, rendering large-scale sports scenes presents various challenges. Sports arenas are characterized by large spatial extents, restricted camera placement, uncontrollable lighting, weak textures, and repetitive patterns, all of which pose significant obstacles to achieving high-quality scene rendering. In this work, we propose a neural radiance field rendering method based on scene-prior geometric constraints. We introduce prior 3-D geometric dimensions and 2-D semantic masks, to derive high-precision ground plane depth maps from camera imaging parameters. This is a geometry-based method that does not rely on visual features, and thus, it is unaffected by insufficient textures, repetition, and reflections. Subsequently, we apply ground depth maps as geometric consistency constraints to optimize the neural rendering network, thereby reducing the impact of color inconsistencies across viewpoints. The visual sensor network we build can synchronously capture static fields and dynamic targets in sports scenes. Based on the visual sensor network, we collected multiviewpoint datasets of large-scale sports scenes at Invengo and Xidian Gymnasium for performance evaluation. Experimental results demonstrate that our method can generate high-precision and cross-viewpoint scale-consistent depth constraints and helps reduce holes and artifacts in synthesized views. Our method outperforms the state of the art (SOTA) for novel view rendering for challenging large-scale sports scenes.
引用
收藏
页码:35900 / 35913
页数:14
相关论文
共 38 条
  • [31] A two-dimensional MoS2 array based on artificial neural network learning for high-quality imaging
    Chen, Long
    Chen, Siyuan
    Wu, Jinchao
    Chen, Luhua
    Yang, Shuai
    Chu, Jian
    Jiang, Chengming
    Bi, Sheng
    Song, Jinhui
    NANO RESEARCH, 2023, 16 (07) : 10139 - 10147
  • [32] StereoEngine: An FPGA-Based Accelerator for Real-Time High-Quality Stereo Estimation With Binary Neural Network
    Chen, Gang
    Ling, Yehua
    He, Tao
    Meng, Haitao
    He, Shengyu
    Zhang, Yu
    Huang, Kai
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (11) : 4179 - 4190
  • [33] Rapid high-quality PET Patlak parametric image generation based on direct reconstruction and temporal nonlocal neural network
    Xie, Nuobei
    Gong, Kuang
    Guo, Ning
    Qin, Zhixing
    Wu, Zhifang
    Liu, Huafeng
    Li, Quanzheng
    NEUROIMAGE, 2021, 240
  • [34] High-quality steganography scheme using hybrid edge detector and Vernam algorithm based on hybrid fuzzy neural network
    Dhawan, Sachin
    Gupta, Rashmi
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (24):
  • [35] A two-dimensional MoS2 array based on artificial neural network learning for high-quality imaging
    Long Chen
    Siyuan Chen
    Jinchao Wu
    Luhua Chen
    Shuai Yang
    Jian Chu
    Chengming Jiang
    Sheng Bi
    Jinhui Song
    Nano Research, 2023, 16 : 10139 - 10147
  • [36] High-quality photoacoustic image reconstruction based on deep convolutional neural network: towards intra-operative photoacoustic imaging
    Farnia, Parastoo
    Mohammadi, Mohammad
    Najafzadeh, Ebrahim
    Alimohamadi, Maysam
    Makkiabadi, Bahador
    Ahmadian, Alireza
    BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2020, 6 (04)
  • [37] Direct Filter Learning From Iterative Reconstructed Images for High-Quality Analytical CBCT Reconstruction Using FDK-Based Neural Network
    Qi, Hongliang
    Long, Chao
    Li, Hanwei
    Huang, Shuang
    Hu, Debin
    Xu, Yuan
    Chen, Hongwen
    IEEE ACCESS, 2024, 12 : 121495 - 121506
  • [38] Comprehensive High-Quality Three-Dimensional Display System Based on a Simplified Light-Field Image Acquisition Method and a Full-Connected Deep Neural Network
    Erdenebat, Munkh-Uchral
    Amgalan, Tuvshinjargal
    Khuderchuluun, Anar
    Nam, Oh-Seung
    Jeon, Seok-Hee
    Kwon, Ki-Chul
    Kim, Nam
    SENSORS, 2023, 23 (14)