Sound field reconstruction using neural processes with dynamic kernels

被引：0

作者：

Zining Liang

Wen Zhang

Thushara D. Abhayapala

机构：

[1] Northwestern Polytechnical University,Center of Intelligent Acoustics and Immersive Communications, School of Marine Science and Technology

[2] The Australian National University,Audio and Acoustic Signal Processing Group, College of Engineering and Computer Science

来源：

EURASIP Journal on Audio, Speech, and Music Processing | / 2024卷

关键词：

Sound field reconstruction; Gaussian processes; Kernels; Neural processes;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Accurately representing the sound field with high spatial resolution is crucial for immersive and interactive sound field reproduction technology. In recent studies, there has been a notable emphasis on efficiently estimating sound fields from a limited number of discrete observations. In particular, kernel-based methods using Gaussian processes (GPs) with a covariance function to model spatial correlations have been proposed. However, the current methods rely on pre-defined kernels for modeling, requiring the manual identification of optimal kernels and their parameters for different sound fields. In this work, we propose a novel approach that parameterizes GPs using a deep neural network based on neural processes (NPs) to reconstruct the magnitude of the sound field. This method has the advantage of dynamically learning kernels from data using an attention mechanism, allowing for greater flexibility and adaptability to the acoustic properties of the sound field. Numerical experiments demonstrate that our proposed approach outperforms current methods in reconstructing accuracy, providing a promising alternative for sound field reconstruction.

引用

共 50 条

[21] Neural Fields as Learnable Kernels for 3D Reconstruction
Williams, Francis
Gojcic, Zan
Khamis, Sameh
Zorin, Denis
Bruna, Joan
Fidler, Sanja
Litany, Or
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18479 - 18489
[22] Near field to far field transformation using neural networks and source reconstruction
Ayestaran, R. G.
Las-Heras, F.
JOURNAL OF ELECTROMAGNETIC WAVES AND APPLICATIONS, 2006, 20 (15) : 2201 - 2213
[23] Sound Event Detection Using Multiple Optimized Kernels
Xia, Xianjun
Tognerie, Roberto
Sohel, Ferdous
Zhaoe, Yuanjun
Huang, Defeng
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (28) : 1745 - 1754
[24] Reconstruction of the sound field above a reflecting plane using the equivalent source method
Bi, Chuan-Xing
Jing, Wen-Qian
Zhang, Yong-Bin
Lin, Wang-Lin
JOURNAL OF SOUND AND VIBRATION, 2017, 386 : 149 - 162
[25] Dynamic network selection using kernels
van den Berg, Eric
Gopalakrishnan, Praveen
Kim, Byungsuk
Lyles, Bryan
Kim, Won-Ik
Shin, Yeon Seung
Kim, Yeong Jin
2007 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-14, 2007, : 6049 - +
[26] Sound field reconstruction using compressed modal equivalent point source method
Bi, Chuan-Xing
Liu, Yuan
Xu, Liang
Zhang, Yong-Bin
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (01): : 73 - 79
[27] Polar Codes Using Dynamic Kernels
Ye, Min
Barg, Alexander
2015 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2015, : 231 - 235
[28] Sound Field Reconstruction Using Prolate Spheroidal Wave Functions and Sparse Regularization
Zhang, Xuxin
Lou, Jingjun
Zhu, Shijian
Lu, Jinfang
Li, Ronghua
SENSORS, 2023, 23 (19)
[29] Optically visualized sound field reconstruction using Kirchhoff-Helmholtz equation
Yatabe, Kohei
Oikawa, Yasuhiro
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2015, 36 (04) : 351 - 354
[30] Sound field reconstruction using compressed modal equivalent point source method
Bi, Chuan-Xing (cxbi@hfut.edu.cn), 1600, Acoustical Society of America (141):

← 1 2 3 4 5 →