Modeling individual head-related transfer functions from sparse measurements using a convolutional neural network

被引:6
|
作者
Jiang, Ziran [1 ,3 ]
Sang, Jinqiu [2 ]
Zheng, Chengshi [1 ,3 ]
Li, Andong [1 ,3 ]
Li, Xiaodong [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Noise & Vibrat Res, Beijing 100190, Peoples R China
[2] East China Normal Univ, Shanghai Inst AI Educ, Shanghai 200062, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
来源
关键词
INTERPOLATION; RESOLUTION; HRTFS; LOCALIZATION;
D O I
10.1121/10.0016854
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Individual head-related transfer functions (HRTFs) are usually measured with high spatial resolution or modeled with anthropometric parameters. This study proposed an HRTF individualization method using only spatially sparse measurements using a convolutional neural network (CNN). The HRTFs were represented by two-dimensional images, in which the horizontal and vertical ordinates indicated direction and frequency, respectively. The CNN was trained by using the HRTF images measured at specific sparse directions as input and using the corresponding images with a high spatial resolution as output in a prior HRTF database. The HRTFs of a new subject can be recovered by the trained CNN with the sparsely measured HRTFs. Objective experiments showed that, when using 23 directions to recover individual HRTFs at 1250 directions, the spectral distortion (SD) is around 4.4 dB; when using 105 directions, the SD reduced to around 3.8 dB. Subjective experiments showed that the individualized HRTFs recovered from 105 directions had smaller discrimination proportion than the baseline method and were perceptually undistinguishable in many directions. This method combines the spectral and spatial characteristics of HRTF for individualization, which has potential for improving virtual reality experience. (c) 2023 Acoustical Society of America.
引用
收藏
页码:248 / 259
页数:12
相关论文
共 50 条
  • [1] Recovery of individual head-related transfer functions from a small set of measurements
    Xie, Bo-Sun
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (01): : 282 - 294
  • [2] Modeling of Individual Head-Related Transfer Functions (HRTFs) Based on Spatiotemporal and Anthropometric Features Using Deep Neural Networks
    Alotaibi, Saif S.
    Wickert, Mark
    IEEE ACCESS, 2024, 12 : 14620 - 14635
  • [3] APPLIED METHODS FOR SPARSE SAMPLING OF HEAD-RELATED TRANSFER FUNCTIONS
    Arbel, Lior
    Ben-Hur, Zamir
    Alon, David Lou
    Rafaely, Boaz
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 446 - 450
  • [4] Modeling head-related transfer functions with spherical wavelets
    Hu, Shichao
    Trevino, Jorge
    Salvador, Cesar
    Sakamoto, Shuichi
    Suzuki, Yoiti
    APPLIED ACOUSTICS, 2019, 146 : 81 - 88
  • [5] The perceptual relevance of individual differences in head-related transfer functions
    Wightman, FL
    Kistler, DJ
    ACUSTICA, 1996, 82 : S92 - S92
  • [6] Identification of Anthropometric Measurements for Individualization of Head-Related Transfer Functions
    Xu, Song
    Li, Zhizhong
    Salvendy, Gavriel
    ACTA ACUSTICA UNITED WITH ACUSTICA, 2009, 95 (01) : 168 - 177
  • [7] Numerical modeling of head-related transfer functions using the boundary source representation
    Bai, Mingsian R.
    Tsao, Teng-Chieh
    JOURNAL OF VIBRATION AND ACOUSTICS-TRANSACTIONS OF THE ASME, 2006, 128 (05): : 594 - 603
  • [8] Modeling and Customization of Head-Related Transfer Functions using Principal Component Analysis
    Hwang, Sungmok
    Park, Youngjin
    Park, Youn-sik
    2008 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS, VOLS 1-4, 2008, : 205 - 209
  • [9] DISTANCE-DEPENDENT MODELING OF HEAD-RELATED TRANSFER FUNCTIONS
    Zhang, Mengfan
    Qiao, Yue
    Wu, Xihong
    Qu, Tianshu
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 276 - 280
  • [10] Smoothing individual head-related transfer functions in the frequency and spatial domains
    Rasumow, Eugen
    Blau, Matthias
    Hansen, Martin
    van de Par, Steven
    Doclo, Simon
    Mellert, Volker
    Pueschel, Dirk
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (04): : 2012 - 2025