Modeling individual head-related transfer functions from sparse measurements using a convolutional neural network

被引:6
|
作者
Jiang, Ziran [1 ,3 ]
Sang, Jinqiu [2 ]
Zheng, Chengshi [1 ,3 ]
Li, Andong [1 ,3 ]
Li, Xiaodong [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Noise & Vibrat Res, Beijing 100190, Peoples R China
[2] East China Normal Univ, Shanghai Inst AI Educ, Shanghai 200062, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
来源
关键词
INTERPOLATION; RESOLUTION; HRTFS; LOCALIZATION;
D O I
10.1121/10.0016854
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Individual head-related transfer functions (HRTFs) are usually measured with high spatial resolution or modeled with anthropometric parameters. This study proposed an HRTF individualization method using only spatially sparse measurements using a convolutional neural network (CNN). The HRTFs were represented by two-dimensional images, in which the horizontal and vertical ordinates indicated direction and frequency, respectively. The CNN was trained by using the HRTF images measured at specific sparse directions as input and using the corresponding images with a high spatial resolution as output in a prior HRTF database. The HRTFs of a new subject can be recovered by the trained CNN with the sparsely measured HRTFs. Objective experiments showed that, when using 23 directions to recover individual HRTFs at 1250 directions, the spectral distortion (SD) is around 4.4 dB; when using 105 directions, the SD reduced to around 3.8 dB. Subjective experiments showed that the individualized HRTFs recovered from 105 directions had smaller discrimination proportion than the baseline method and were perceptually undistinguishable in many directions. This method combines the spectral and spatial characteristics of HRTF for individualization, which has potential for improving virtual reality experience. (c) 2023 Acoustical Society of America.
引用
收藏
页码:248 / 259
页数:12
相关论文
共 50 条
  • [31] A Sparse Spherical Harmonic-Based Model in Subbands for Head-Related Transfer Functions
    Qi, Xiaoke
    Tao, Jianhua
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 540 - 544
  • [32] Obtaining an Optimal Set of Head-Related Transfer Functions with a Small Amount of Measurements
    Parviainen, Mikko
    Pertila, Pasi
    2017 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2017,
  • [33] Frequency and amplitude estimation of the first peak of head-related transfer functions from individual pinna anthropometry
    Mokhtari, Parham
    Takemoto, Hironori
    Nishimura, Ryouichi
    Kato, Hiroaki
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 137 (02): : 690 - 701
  • [34] Common-acoustical-pole and zero modeling of head-related transfer functions
    Haneda, Y
    Makino, S
    Kaneda, Y
    Kitawaki, N
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (02): : 188 - 196
  • [35] INTERPOLATION OF HEAD-RELATED TRANSFER FUNCTIONS USING SPHERICAL FOURIER EXPANSION
    Huang Qinghua Fang Yong (The Key Lab of Specialty Fiber Optics and Optical Access Network
    Journal of Electronics(China), 2009, 26 (04) : 571 - 576
  • [36] AN IMPROVED ANTHROPOMETRY-BASED CUSTOMIZATION METHOD OF INDIVIDUAL HEAD-RELATED TRANSFER FUNCTIONS
    Liu, Xuejie
    Zhong, Xiaoli
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 336 - 339
  • [37] Head movement during head-related transfer function measurements
    Hirahara, Tatsuya
    Sagara, Hiroyuki
    Toshima, Iwaki
    Otani, Makoto
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2010, 31 (02) : 165 - 171
  • [38] Loudness stability of binaural sound with spherical harmonic representation of sparse head-related transfer functions
    Zamir Ben-Hur
    David Lou Alon
    Boaz Rafaely
    Ravish Mehra
    EURASIP Journal on Audio, Speech, and Music Processing, 2019
  • [39] A Hybrid Algorithm for Predicting Median-Plane Head-Related Transfer Functions from Anthropometric Measurements
    Liu, Xuejie
    Song, Hao
    Zhong, Xiaoli
    APPLIED SCIENCES-BASEL, 2019, 9 (11):
  • [40] Audibility of Differences in Adjacent Head-Related Transfer Functions
    Hoffmann, Pablo F.
    Moller, Henrik
    ACTA ACUSTICA UNITED WITH ACUSTICA, 2008, 94 (06) : 945 - 954