Modeling individual head-related transfer functions from sparse measurements using a convolutional neural network

被引:6
|
作者
Jiang, Ziran [1 ,3 ]
Sang, Jinqiu [2 ]
Zheng, Chengshi [1 ,3 ]
Li, Andong [1 ,3 ]
Li, Xiaodong [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Noise & Vibrat Res, Beijing 100190, Peoples R China
[2] East China Normal Univ, Shanghai Inst AI Educ, Shanghai 200062, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
来源
关键词
INTERPOLATION; RESOLUTION; HRTFS; LOCALIZATION;
D O I
10.1121/10.0016854
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Individual head-related transfer functions (HRTFs) are usually measured with high spatial resolution or modeled with anthropometric parameters. This study proposed an HRTF individualization method using only spatially sparse measurements using a convolutional neural network (CNN). The HRTFs were represented by two-dimensional images, in which the horizontal and vertical ordinates indicated direction and frequency, respectively. The CNN was trained by using the HRTF images measured at specific sparse directions as input and using the corresponding images with a high spatial resolution as output in a prior HRTF database. The HRTFs of a new subject can be recovered by the trained CNN with the sparsely measured HRTFs. Objective experiments showed that, when using 23 directions to recover individual HRTFs at 1250 directions, the spectral distortion (SD) is around 4.4 dB; when using 105 directions, the SD reduced to around 3.8 dB. Subjective experiments showed that the individualized HRTFs recovered from 105 directions had smaller discrimination proportion than the baseline method and were perceptually undistinguishable in many directions. This method combines the spectral and spatial characteristics of HRTF for individualization, which has potential for improving virtual reality experience. (c) 2023 Acoustical Society of America.
引用
收藏
页码:248 / 259
页数:12
相关论文
共 50 条
  • [41] CONSIDERATIONS REGARDING INDIVIDUALIZATION OF HEAD-RELATED TRANSFER FUNCTIONS
    Jin, C. T.
    Zolfaghari, R.
    Long, X.
    Sebastian, A.
    Hossain, S.
    Glaunes, J.
    Tew, A.
    Shahnawaz, M.
    Sarti, A.
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6787 - 6791
  • [42] A study of morphological influence on head-related transfer functions
    Xu, S.
    Li, Z. Z.
    Zeng, L.
    Salvendy, G.
    2007 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT, VOLS 1-4, 2007, : 472 - 476
  • [43] A localization algorithm based on head-related transfer functions
    MacDonald, Justin A.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (06): : 4290 - 4296
  • [44] A localization algorithm based on head-related transfer functions
    MacDonald, Justin A.
    Journal of the Acoustical Society of America, 2008, 123 (06): : 4290 - 4296
  • [45] Perceptual attributes for the comparison of head-related transfer functions
    Simon, Laurent S. R.
    Zacharov, Nick
    Katz, Brian F. G.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (05): : 3623 - 3632
  • [46] Loudness stability of binaural sound with spherical harmonic representation of sparse head-related transfer functions
    Ben-Hur, Zamir
    Alon, David Lou
    Rafaely, Boaz
    Mehra, Ravish
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2019, 2019 (1)
  • [47] EFFICIENT REPRESENTATION OF HEAD-RELATED TRANSFER FUNCTIONS IN SUBBANDS
    Marenlli, Damian
    Baumgartner, Robert
    Majdak, Piotr
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 915 - 919
  • [48] Anthropometric Parameters Influencing Head-Related Transfer Functions
    Fels, Janina
    Vorlaender, Michael
    ACTA ACUSTICA UNITED WITH ACUSTICA, 2009, 95 (02) : 331 - 342
  • [49] Deep Learning for Synthesis of Head-related Transfer Functions
    Bharitkar, Sunil
    146TH AES CONVENTION, 2019,
  • [50] Head-related transfer function measurements in a compartment fire
    Abbasi, Mustafa Z.
    Wilson, Preston S.
    Ezekoye, Ofodike A.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 151 (03): : 1730 - 1740