Sound field reconstruction using neural processes with dynamic kernels

被引:0
|
作者
Zining Liang
Wen Zhang
Thushara D. Abhayapala
机构
[1] Northwestern Polytechnical University,Center of Intelligent Acoustics and Immersive Communications, School of Marine Science and Technology
[2] The Australian National University,Audio and Acoustic Signal Processing Group, College of Engineering and Computer Science
关键词
Sound field reconstruction; Gaussian processes; Kernels; Neural processes;
D O I
暂无
中图分类号
学科分类号
摘要
Accurately representing the sound field with high spatial resolution is crucial for immersive and interactive sound field reproduction technology. In recent studies, there has been a notable emphasis on efficiently estimating sound fields from a limited number of discrete observations. In particular, kernel-based methods using Gaussian processes (GPs) with a covariance function to model spatial correlations have been proposed. However, the current methods rely on pre-defined kernels for modeling, requiring the manual identification of optimal kernels and their parameters for different sound fields. In this work, we propose a novel approach that parameterizes GPs using a deep neural network based on neural processes (NPs) to reconstruct the magnitude of the sound field. This method has the advantage of dynamically learning kernels from data using an attention mechanism, allowing for greater flexibility and adaptability to the acoustic properties of the sound field. Numerical experiments demonstrate that our proposed approach outperforms current methods in reconstructing accuracy, providing a promising alternative for sound field reconstruction.
引用
收藏
相关论文
共 50 条
  • [21] Neural Fields as Learnable Kernels for 3D Reconstruction
    Williams, Francis
    Gojcic, Zan
    Khamis, Sameh
    Zorin, Denis
    Bruna, Joan
    Fidler, Sanja
    Litany, Or
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18479 - 18489
  • [22] Near field to far field transformation using neural networks and source reconstruction
    Ayestaran, R. G.
    Las-Heras, F.
    JOURNAL OF ELECTROMAGNETIC WAVES AND APPLICATIONS, 2006, 20 (15) : 2201 - 2213
  • [23] Sound Event Detection Using Multiple Optimized Kernels
    Xia, Xianjun
    Tognerie, Roberto
    Sohel, Ferdous
    Zhaoe, Yuanjun
    Huang, Defeng
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (28) : 1745 - 1754
  • [24] Reconstruction of the sound field above a reflecting plane using the equivalent source method
    Bi, Chuan-Xing
    Jing, Wen-Qian
    Zhang, Yong-Bin
    Lin, Wang-Lin
    JOURNAL OF SOUND AND VIBRATION, 2017, 386 : 149 - 162
  • [25] Dynamic network selection using kernels
    van den Berg, Eric
    Gopalakrishnan, Praveen
    Kim, Byungsuk
    Lyles, Bryan
    Kim, Won-Ik
    Shin, Yeon Seung
    Kim, Yeong Jin
    2007 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-14, 2007, : 6049 - +
  • [26] Sound field reconstruction using compressed modal equivalent point source method
    Bi, Chuan-Xing
    Liu, Yuan
    Xu, Liang
    Zhang, Yong-Bin
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (01): : 73 - 79
  • [27] Polar Codes Using Dynamic Kernels
    Ye, Min
    Barg, Alexander
    2015 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2015, : 231 - 235
  • [28] Sound Field Reconstruction Using Prolate Spheroidal Wave Functions and Sparse Regularization
    Zhang, Xuxin
    Lou, Jingjun
    Zhu, Shijian
    Lu, Jinfang
    Li, Ronghua
    SENSORS, 2023, 23 (19)
  • [29] Optically visualized sound field reconstruction using Kirchhoff-Helmholtz equation
    Yatabe, Kohei
    Oikawa, Yasuhiro
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2015, 36 (04) : 351 - 354
  • [30] Sound field reconstruction using compressed modal equivalent point source method
    Bi, Chuan-Xing (cxbi@hfut.edu.cn), 1600, Acoustical Society of America (141):