Exploring redundancy of HRTFs for fast training DNN-based HRTF personalization

被引:0
|
作者
Chen, Tzu-Yu [1 ]
Hsiao, Po-Wen [1 ]
Chi, Tai-Shih [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu 300, Taiwan
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A deep neural network (DNN) is constructed to predict the magnitude responses of the head-related transfer functions (HRTFs) of users for a specific direction and a specific ear. Using the CIPIC HRTF database (including 25 azimuth angles and 50 elevation angles for both ears), we trained 2500 DNNs to predict magnitude responses of all HRTFs of a user. To reduce training time, we propose to use the final weights of the trained DNN of a nearby direction as the initial weights of the current DNN under training since magnitude responses of the HRTFs are smoothly changing across nearby directions. Analysis of variance (ANOVA) was performed to show that the proposed training scheme produces equivalent magnitude responses of HRTFs as the standard training scheme with random initial weights in terms of the log-spectral distortion (LSD) measure. Meanwhile, the proposed training scheme can dramatically reduce training time by more than 95%.
引用
收藏
页码:1929 / 1933
页数:5
相关论文
共 34 条
  • [21] A KL Divergence and DNN-based Approach to Voice Conversion without Parallel Training Sentences
    Xie, Feng-Long
    Soong, Frank K.
    Li, Haifeng
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 287 - 291
  • [22] FAST DNN TRAINING BASED ON AUXILIARY FUNCTION TECHNIQUE
    Tran, Dung T.
    Ono, Nobutaka
    Vincent, Emmanuel
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2160 - 2164
  • [23] A DNN-BASED ACOUSTIC MODELING OF TONAL LANGUAGE AND ITS APPLICATION TO MANDARIN PRONUNCIATION TRAINING
    Hu, Wenping
    Qian, Yao
    Soong, Frank K.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [24] Pre-Training of DNN-Based Speech Synthesis Based on Bidirectional Conversion between Text and Speech
    Sone, Kentaro
    Nakashika, Toru
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (08) : 1546 - 1553
  • [25] DNN-based QoT Estimation Using Topological Inputs and Training with Synthetic-Physical Data
    Mayer, Kayol S.
    dos Santos, Luan C. M.
    Pinto, Rossano P.
    Dal Maso, Marcos P. A.
    Rothenberg, Christian E.
    Arantes, Dalton S.
    Mello, Darli A. A.
    2023 IEEE PHOTONICS CONFERENCE, IPC, 2023,
  • [26] DNN-based Feature Enhancement using Joint Training Framework for Robust Multichannel Speech Recognition
    Lee, Kang Hyun
    Kang, Tae Gyoon
    Kang, Woo Hyun
    Kim, Nam Soo
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3027 - 3031
  • [27] SOFT-TARGET TRAINING WITH AMBIGUOUS EMOTIONAL UTTERANCES FOR DNN-BASED SPEECH EMOTION CLASSIFICATION
    Ando, Atsushi
    Kobashikawa, Satoshi
    Kamiyama, Hosana
    Masumura, Ryo
    Ijima, Yusuke
    Aono, Yushi
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4964 - 4968
  • [28] DNN-based feature enhancement using joint training framework for robust multichannel speech recognition
    Lee, Kang Hyun
    Kang, Tae Gyoon
    Kang, Woo Hyun
    Kim, Nam Soo
    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2016, 08-12-September-2016 : 3027 - 3031
  • [29] JOINT NOISE AND MASK AWARE TRAINING FOR DNN-BASED SPEECH ENHANCEMENT WITH SUB-BAND FEATURES
    Wang, Qing
    Du, Jun
    Dai, Li-Rong
    Lee, Chin-Hui
    2017 HANDS-FREE SPEECH COMMUNICATIONS AND MICROPHONE ARRAYS (HSCMA 2017), 2017, : 101 - 105
  • [30] Fast and Accurate DNN-Based Approach in Maximizing Ultra-Wideband Fiber-Optic Systems Throughput
    Gan, Zelin
    Shevchenko, Mykyta
    Herzberg, Sam Nallaperuma
    Savory, Seb J.
    2024 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION, OFC, 2024,