Scene-Constrained Neural Radiance Fields for High-Quality Sports Scene Rendering Based on Visual Sensor Network

被引:0
|
作者
Dai, Yanran [1 ]
Li, Jing [1 ]
Zhang, Yong [2 ,3 ]
Jiang, Yuqi [1 ]
Qin, Haidong [2 ,3 ]
Zhou, Xiaoshi [2 ,3 ]
Zhang, Yidan [1 ]
Yang, Tao [2 ,3 ]
机构
[1] Xidian Univ, Sch Telecommun Engn, Xian 710071, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci, Natl Engn Lab Integrated Aerosp Ground Ocean Big, Xian 710129, Peoples R China
[3] Northwestern Polytech Univ, SAIIP, Xian 710129, Peoples R China
基金
中国国家自然科学基金;
关键词
Depth estimation; large-scale scenes; neural radiation field (NeRF); weak texture; MULTIVIEW STEREO; 3D GAUSSIANS; SCALE;
D O I
10.1109/JSEN.2024.3452436
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Free-viewpoint videos offer audiences a more immersive and liberated way to watch sports. The rendering of sports scenes encompasses two essential elements: dynamic targets and static scenes. While much current research focuses on achieving high-quality rendering of human bodies, rendering large-scale sports scenes presents various challenges. Sports arenas are characterized by large spatial extents, restricted camera placement, uncontrollable lighting, weak textures, and repetitive patterns, all of which pose significant obstacles to achieving high-quality scene rendering. In this work, we propose a neural radiance field rendering method based on scene-prior geometric constraints. We introduce prior 3-D geometric dimensions and 2-D semantic masks, to derive high-precision ground plane depth maps from camera imaging parameters. This is a geometry-based method that does not rely on visual features, and thus, it is unaffected by insufficient textures, repetition, and reflections. Subsequently, we apply ground depth maps as geometric consistency constraints to optimize the neural rendering network, thereby reducing the impact of color inconsistencies across viewpoints. The visual sensor network we build can synchronously capture static fields and dynamic targets in sports scenes. Based on the visual sensor network, we collected multiviewpoint datasets of large-scale sports scenes at Invengo and Xidian Gymnasium for performance evaluation. Experimental results demonstrate that our method can generate high-precision and cross-viewpoint scale-consistent depth constraints and helps reduce holes and artifacts in synthesized views. Our method outperforms the state of the art (SOTA) for novel view rendering for challenging large-scale sports scenes.
引用
收藏
页码:35900 / 35913
页数:14
相关论文
共 38 条
  • [21] A software-based high-quality MPEG-2 encoder employing scene change detection and adaptive quantization
    Farin, D
    Mache, N
    de With, PHN
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2002, 48 (04) : 887 - 897
  • [22] An Ultrasonic Sensor Network for High-Quality Range-Bearing-Based Indoor Positioning
    Flores, Sergio
    Geiss, Johanna
    Vossiek, Martin
    PROCEEDINGS OF THE 2016 IEEE/ION POSITION, LOCATION AND NAVIGATION SYMPOSIUM (PLANS), 2016, : 572 - 576
  • [23] High-quality volume rendering of unstructured-grid cell-centered data of flow fields based on upwind FVM
    Ma, Qian-Li
    Li, Si-Kun
    Bai, Xiao-Zheng
    Zeng, Liang
    Xu, Hua-Xun
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2011, 33 (02): : 150 - 156
  • [24] Multi-Modal Sensor Fusion-Based Deep Neural Network for End-to-End Autonomous Driving With Scene Understanding
    Huang, Zhiyu
    Lv, Chen
    Xing, Yang
    Wu, Jingda
    IEEE SENSORS JOURNAL, 2021, 21 (10) : 11781 - 11790
  • [25] Prediction of the high-quality development level of inbound tourism based on adaptive neural network technology
    Zhang, Hongxi
    Wei, Wei
    Liu, Qiong
    JOURNAL OF CONTROL AND DECISION, 2023, 10 (01) : 112 - 125
  • [26] High-quality direct ghost imaging of random dynamic targets based on convolutional neural network
    Liu, Qing
    Yin, LongFei
    Zhan, HaoDi
    Lu, YiQi
    Zhu, LingYun
    Long, XueWen
    Wu, GuoHua
    OPTICS AND LASER TECHNOLOGY, 2024, 179
  • [27] High-quality voice conversion system based on GMM statistical parameters and RBF neural network
    CHEN Xian-tong
    ZHANG Ling-hua
    The Journal of China Universities of Posts and Telecommunications, 2014, 21 (05) : 68 - 75+93
  • [28] High-quality voice conversion system based on GMM statistical parameters and RBF neural network
    CHEN Xian-tong
    ZHANG Ling-hua
    The Journal of China Universities of Posts and Telecommunications, 2014, (05) : 68 - 75
  • [29] High-Resolution Remote Sensing Image Scene Classification via Key Filter Bank Based on Convolutional Neural Network
    Li, Fengpeng
    Feng, Ruyi
    Han, Wei
    Wang, Lizhe
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (11): : 8077 - 8092
  • [30] SCENE SEMANTIC CLASSIFICATION BASED ON RANDOM-SCALE STRETCHED CONVOLUTIONAL NEURAL NETWORK FOR HIGH-SPATIAL RESOLUTION REMOTE SENSING IMAGERY
    Liu, Yanfei
    Zhong, Yanfei
    Fei, Feng
    Zhang, Liangpei
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 763 - 766