Scene-Constrained Neural Radiance Fields for High-Quality Sports Scene Rendering Based on Visual Sensor Network

被引：0

作者：

Dai, Yanran ^{[1
]}

Li, Jing ^{[1
]}

Zhang, Yong ^{[2
,3
]}

Jiang, Yuqi ^{[1
]}

Qin, Haidong ^{[2
,3
]}

Zhou, Xiaoshi ^{[2
,3
]}

Zhang, Yidan ^{[1
]}

Yang, Tao ^{[2
,3
]}

机构：

[1] Xidian Univ, Sch Telecommun Engn, Xian 710071, Peoples R China

[2] Northwestern Polytech Univ, Sch Comp Sci, Natl Engn Lab Integrated Aerosp Ground Ocean Big, Xian 710129, Peoples R China

[3] Northwestern Polytech Univ, SAIIP, Xian 710129, Peoples R China

来源：

IEEE SENSORS JOURNAL | 2024年 / 24卷 / 21期

基金：

中国国家自然科学基金;

关键词：

Depth estimation; large-scale scenes; neural radiation field (NeRF); weak texture; MULTIVIEW STEREO; 3D GAUSSIANS; SCALE;

D O I：

10.1109/JSEN.2024.3452436

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Free-viewpoint videos offer audiences a more immersive and liberated way to watch sports. The rendering of sports scenes encompasses two essential elements: dynamic targets and static scenes. While much current research focuses on achieving high-quality rendering of human bodies, rendering large-scale sports scenes presents various challenges. Sports arenas are characterized by large spatial extents, restricted camera placement, uncontrollable lighting, weak textures, and repetitive patterns, all of which pose significant obstacles to achieving high-quality scene rendering. In this work, we propose a neural radiance field rendering method based on scene-prior geometric constraints. We introduce prior 3-D geometric dimensions and 2-D semantic masks, to derive high-precision ground plane depth maps from camera imaging parameters. This is a geometry-based method that does not rely on visual features, and thus, it is unaffected by insufficient textures, repetition, and reflections. Subsequently, we apply ground depth maps as geometric consistency constraints to optimize the neural rendering network, thereby reducing the impact of color inconsistencies across viewpoints. The visual sensor network we build can synchronously capture static fields and dynamic targets in sports scenes. Based on the visual sensor network, we collected multiviewpoint datasets of large-scale sports scenes at Invengo and Xidian Gymnasium for performance evaluation. Experimental results demonstrate that our method can generate high-precision and cross-viewpoint scale-consistent depth constraints and helps reduce holes and artifacts in synthesized views. Our method outperforms the state of the art (SOTA) for novel view rendering for challenging large-scale sports scenes.

引用

页码：35900 / 35913

页数：14

共 38 条

[31] A two-dimensional MoS2 array based on artificial neural network learning for high-quality imaging
Chen, Long
Chen, Siyuan
Wu, Jinchao
Chen, Luhua
Yang, Shuai
Chu, Jian
Jiang, Chengming
Bi, Sheng
Song, Jinhui
NANO RESEARCH, 2023, 16 (07) : 10139 - 10147
[32] StereoEngine: An FPGA-Based Accelerator for Real-Time High-Quality Stereo Estimation With Binary Neural Network
Chen, Gang
Ling, Yehua
He, Tao
Meng, Haitao
He, Shengyu
Zhang, Yu
Huang, Kai
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (11) : 4179 - 4190
[33] Rapid high-quality PET Patlak parametric image generation based on direct reconstruction and temporal nonlocal neural network
Xie, Nuobei
Gong, Kuang
Guo, Ning
Qin, Zhixing
Wu, Zhifang
Liu, Huafeng
Li, Quanzheng
NEUROIMAGE, 2021, 240
[34] High-quality steganography scheme using hybrid edge detector and Vernam algorithm based on hybrid fuzzy neural network
Dhawan, Sachin
Gupta, Rashmi
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (24):
[35] A two-dimensional MoS2 array based on artificial neural network learning for high-quality imaging
Long Chen
Siyuan Chen
Jinchao Wu
Luhua Chen
Shuai Yang
Jian Chu
Chengming Jiang
Sheng Bi
Jinhui Song
Nano Research, 2023, 16 : 10139 - 10147
[36] High-quality photoacoustic image reconstruction based on deep convolutional neural network: towards intra-operative photoacoustic imaging
Farnia, Parastoo
Mohammadi, Mohammad
Najafzadeh, Ebrahim
Alimohamadi, Maysam
Makkiabadi, Bahador
Ahmadian, Alireza
BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2020, 6 (04)
[37] Direct Filter Learning From Iterative Reconstructed Images for High-Quality Analytical CBCT Reconstruction Using FDK-Based Neural Network
Qi, Hongliang
Long, Chao
Li, Hanwei
Huang, Shuang
Hu, Debin
Xu, Yuan
Chen, Hongwen
IEEE ACCESS, 2024, 12 : 121495 - 121506
[38] Comprehensive High-Quality Three-Dimensional Display System Based on a Simplified Light-Field Image Acquisition Method and a Full-Connected Deep Neural Network
Erdenebat, Munkh-Uchral
Amgalan, Tuvshinjargal
Khuderchuluun, Anar
Nam, Oh-Seung
Jeon, Seok-Hee
Kwon, Ki-Chul
Kim, Nam
SENSORS, 2023, 23 (14)

← 1 2 3 4 →