Sound field reconstruction using neural processes with dynamic kernels

被引：0

作者：

Zining Liang

Wen Zhang

Thushara D. Abhayapala

机构：

[1] Northwestern Polytechnical University,Center of Intelligent Acoustics and Immersive Communications, School of Marine Science and Technology

[2] The Australian National University,Audio and Acoustic Signal Processing Group, College of Engineering and Computer Science

来源：

EURASIP Journal on Audio, Speech, and Music Processing | / 2024卷

关键词：

Sound field reconstruction; Gaussian processes; Kernels; Neural processes;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Accurately representing the sound field with high spatial resolution is crucial for immersive and interactive sound field reproduction technology. In recent studies, there has been a notable emphasis on efficiently estimating sound fields from a limited number of discrete observations. In particular, kernel-based methods using Gaussian processes (GPs) with a covariance function to model spatial correlations have been proposed. However, the current methods rely on pre-defined kernels for modeling, requiring the manual identification of optimal kernels and their parameters for different sound fields. In this work, we propose a novel approach that parameterizes GPs using a deep neural network based on neural processes (NPs) to reconstruct the magnitude of the sound field. This method has the advantage of dynamically learning kernels from data using an attention mechanism, allowing for greater flexibility and adaptability to the acoustic properties of the sound field. Numerical experiments demonstrate that our proposed approach outperforms current methods in reconstructing accuracy, providing a promising alternative for sound field reconstruction.

引用

共 50 条

[11] Striking the right balance: Three-dimensional ocean sound speed field reconstruction using tensor neural networks
Li, Siyuan
Cheng, Lei
Zhang, Ting
Zhao, Hangfang
Li, Jianlong
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 154 (02): : 1106 - 1123
[12] Physics-informed neural network for volumetric sound field reconstruction of speech signals
Olivieri, Marco
Karakonstantis, Xenofon
Pezzoli, Mirco
Antonacci, Fabio
Sarti, Augusto
Fernandez-Grande, Efren
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2024, 2024 (01):
[13] An intelligent dynamic reconstruction filter for audio signal reconstruction using neural networks
Najafi, HL
Moses, DW
Hustig, CH
Kinne, J
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 1998, 11 (01) : 49 - 53
[14] ROBUST ROOM EQUALIZATION USING SPARSE SOUND-FIELD RECONSTRUCTION
Mazur, Radoslaw
Katzberg, Fabrice
Mertins, Alfred
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 4230 - 4234
[15] Sound Field Reconstruction Technology Using a Three Dimensional Loudspeaker Array
Seo, Jeongil
Kano, Kyeongok
Fazi, Filippo M.
Nelson, Philip A.
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2009, 28 (08): : 723 - 731
[16] Investigation on holographic reconstruction of sound field using wave superposition approach
Yu, F
Chen, XZ
Li, WB
Chen, J
ACTA PHYSICA SINICA, 2004, 53 (08) : 2607 - 2613
[17] Generative models for sound field reconstruction
Fernandez-Grande, Efren
Karakonstantis, Xenofon
Caviedes-Nozal, Diego
Gerstoft, Peter
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (02): : 1179 - 1190
[18] Reconstruction of sound pressure field by IFEM
Anderssohn, R.
Marburg, St.
Hardtke, H. -J.
Grossmann, Chr.
THEORETICAL AND COMPUTATIONAL ACOUSTICS 2005, 2006, : 1 - +
[19] Towards Gridless Sound Field Reconstruction
van der Werf, Ids
Martinez-Nuevo, Pablo
Moller, Martin
Hendriks, Richard
Martinez, Jorge
2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 862 - 866
[20] Dynamic Fluid Surface Reconstruction Using Deep Neural Network
Thapa, Simron
Li, Nianyi
Ye, Jinwei
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 21 - 30

← 1 2 3 4 5 →