Modeling individual head-related transfer functions from sparse measurements using a convolutional neural network

被引:6
|
作者
Jiang, Ziran [1 ,3 ]
Sang, Jinqiu [2 ]
Zheng, Chengshi [1 ,3 ]
Li, Andong [1 ,3 ]
Li, Xiaodong [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Noise & Vibrat Res, Beijing 100190, Peoples R China
[2] East China Normal Univ, Shanghai Inst AI Educ, Shanghai 200062, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
来源
关键词
INTERPOLATION; RESOLUTION; HRTFS; LOCALIZATION;
D O I
10.1121/10.0016854
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Individual head-related transfer functions (HRTFs) are usually measured with high spatial resolution or modeled with anthropometric parameters. This study proposed an HRTF individualization method using only spatially sparse measurements using a convolutional neural network (CNN). The HRTFs were represented by two-dimensional images, in which the horizontal and vertical ordinates indicated direction and frequency, respectively. The CNN was trained by using the HRTF images measured at specific sparse directions as input and using the corresponding images with a high spatial resolution as output in a prior HRTF database. The HRTFs of a new subject can be recovered by the trained CNN with the sparsely measured HRTFs. Objective experiments showed that, when using 23 directions to recover individual HRTFs at 1250 directions, the spectral distortion (SD) is around 4.4 dB; when using 105 directions, the SD reduced to around 3.8 dB. Subjective experiments showed that the individualized HRTFs recovered from 105 directions had smaller discrimination proportion than the baseline method and were perceptually undistinguishable in many directions. This method combines the spectral and spatial characteristics of HRTF for individualization, which has potential for improving virtual reality experience. (c) 2023 Acoustical Society of America.
引用
收藏
页码:248 / 259
页数:12
相关论文
共 50 条
  • [21] Head-Related Transfer Function Selection Using Neural Networks
    Yao, Shu-Nung
    Collins, Tim
    Liang, Chaoyun
    ARCHIVES OF ACOUSTICS, 2017, 42 (03) : 365 - 373
  • [22] Efficient modelling of head-related transfer functions
    Hartung, K
    Raab, A
    ACUSTICA, 1996, 82 : S88 - S88
  • [23] Measurement of Head-Related Transfer Functions: A Review
    Li, Song
    Peissig, Juergen
    APPLIED SCIENCES-BASEL, 2020, 10 (14):
  • [24] A bottle model for head-related transfer functions
    Ferguson, BS
    Bogner, RE
    Wawryk, S
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 3533 - 3536
  • [25] On the generalization of accommodation to head-related transfer functions
    Meyer, Julie
    Picinali, Lorenzo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2025, 157 (01): : 420 - 432
  • [26] On the spatial symmetry of head-related transfer functions
    Zhong, Xiao-Li
    Zhang, Feng-Chun
    Xie, Bo-Sun
    APPLIED ACOUSTICS, 2013, 74 (06) : 856 - 864
  • [27] Estimating head-related transfer functions of human subjects from pressure-velocity measurements
    Hiipakka, Marko
    Kinnari, Teemu
    Pulkki, Ville
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (05): : 4051 - 4061
  • [28] Low-order modeling of head-related transfer functions using balanced model truncation
    Mackenzie, J
    Huopaniemi, J
    Valimaki, V
    Kale, I
    IEEE SIGNAL PROCESSING LETTERS, 1997, 4 (02) : 39 - 41
  • [29] Wavelet analysis of head-related transfer functions
    Lo, TF
    Wu, LY
    Chan, FHY
    Lam, FK
    Chan, JCK
    PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 20, PTS 1-6: BIOMEDICAL ENGINEERING TOWARDS THE YEAR 2000 AND BEYOND, 1998, 20 : 1549 - 1552
  • [30] Head-related transfer functions of the Rhesus monkey
    Spezio, ML
    Keller, CH
    Marrocco, RT
    Takahashi, TT
    HEARING RESEARCH, 2000, 144 (1-2) : 73 - 88