Sound field reconstruction using neural processes with dynamic kernels

被引:0
|
作者
Zining Liang
Wen Zhang
Thushara D. Abhayapala
机构
[1] Northwestern Polytechnical University,Center of Intelligent Acoustics and Immersive Communications, School of Marine Science and Technology
[2] The Australian National University,Audio and Acoustic Signal Processing Group, College of Engineering and Computer Science
关键词
Sound field reconstruction; Gaussian processes; Kernels; Neural processes;
D O I
暂无
中图分类号
学科分类号
摘要
Accurately representing the sound field with high spatial resolution is crucial for immersive and interactive sound field reproduction technology. In recent studies, there has been a notable emphasis on efficiently estimating sound fields from a limited number of discrete observations. In particular, kernel-based methods using Gaussian processes (GPs) with a covariance function to model spatial correlations have been proposed. However, the current methods rely on pre-defined kernels for modeling, requiring the manual identification of optimal kernels and their parameters for different sound fields. In this work, we propose a novel approach that parameterizes GPs using a deep neural network based on neural processes (NPs) to reconstruct the magnitude of the sound field. This method has the advantage of dynamically learning kernels from data using an attention mechanism, allowing for greater flexibility and adaptability to the acoustic properties of the sound field. Numerical experiments demonstrate that our proposed approach outperforms current methods in reconstructing accuracy, providing a promising alternative for sound field reconstruction.
引用
收藏
相关论文
共 50 条
  • [11] Striking the right balance: Three-dimensional ocean sound speed field reconstruction using tensor neural networks
    Li, Siyuan
    Cheng, Lei
    Zhang, Ting
    Zhao, Hangfang
    Li, Jianlong
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 154 (02): : 1106 - 1123
  • [12] Physics-informed neural network for volumetric sound field reconstruction of speech signals
    Olivieri, Marco
    Karakonstantis, Xenofon
    Pezzoli, Mirco
    Antonacci, Fabio
    Sarti, Augusto
    Fernandez-Grande, Efren
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2024, 2024 (01):
  • [13] An intelligent dynamic reconstruction filter for audio signal reconstruction using neural networks
    Najafi, HL
    Moses, DW
    Hustig, CH
    Kinne, J
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 1998, 11 (01) : 49 - 53
  • [14] ROBUST ROOM EQUALIZATION USING SPARSE SOUND-FIELD RECONSTRUCTION
    Mazur, Radoslaw
    Katzberg, Fabrice
    Mertins, Alfred
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 4230 - 4234
  • [15] Sound Field Reconstruction Technology Using a Three Dimensional Loudspeaker Array
    Seo, Jeongil
    Kano, Kyeongok
    Fazi, Filippo M.
    Nelson, Philip A.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2009, 28 (08): : 723 - 731
  • [16] Investigation on holographic reconstruction of sound field using wave superposition approach
    Yu, F
    Chen, XZ
    Li, WB
    Chen, J
    ACTA PHYSICA SINICA, 2004, 53 (08) : 2607 - 2613
  • [17] Generative models for sound field reconstruction
    Fernandez-Grande, Efren
    Karakonstantis, Xenofon
    Caviedes-Nozal, Diego
    Gerstoft, Peter
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (02): : 1179 - 1190
  • [18] Reconstruction of sound pressure field by IFEM
    Anderssohn, R.
    Marburg, St.
    Hardtke, H. -J.
    Grossmann, Chr.
    THEORETICAL AND COMPUTATIONAL ACOUSTICS 2005, 2006, : 1 - +
  • [19] Towards Gridless Sound Field Reconstruction
    van der Werf, Ids
    Martinez-Nuevo, Pablo
    Moller, Martin
    Hendriks, Richard
    Martinez, Jorge
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 862 - 866
  • [20] Dynamic Fluid Surface Reconstruction Using Deep Neural Network
    Thapa, Simron
    Li, Nianyi
    Ye, Jinwei
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 21 - 30