Exploring redundancy of HRTFs for fast training DNN-based HRTF personalization

被引：0

作者：

Chen, Tzu-Yu ^{[1
]}

Hsiao, Po-Wen ^{[1
]}

Chi, Tai-Shih ^{[1
]}

机构：

[1] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu 300, Taiwan

来源：

2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC) | 2018年

关键词：

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

A deep neural network (DNN) is constructed to predict the magnitude responses of the head-related transfer functions (HRTFs) of users for a specific direction and a specific ear. Using the CIPIC HRTF database (including 25 azimuth angles and 50 elevation angles for both ears), we trained 2500 DNNs to predict magnitude responses of all HRTFs of a user. To reduce training time, we propose to use the final weights of the trained DNN of a nearby direction as the initial weights of the current DNN under training since magnitude responses of the HRTFs are smoothly changing across nearby directions. Analysis of variance (ANOVA) was performed to show that the proposed training scheme produces equivalent magnitude responses of HRTFs as the standard training scheme with random initial weights in terms of the log-spectral distortion (LSD) measure. Meanwhile, the proposed training scheme can dramatically reduce training time by more than 95%.

引用

页码：1929 / 1933

页数：5

共 34 条

[21] A KL Divergence and DNN-based Approach to Voice Conversion without Parallel Training Sentences
Xie, Feng-Long
Soong, Frank K.
Li, Haifeng
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 287 - 291
[22] FAST DNN TRAINING BASED ON AUXILIARY FUNCTION TECHNIQUE
Tran, Dung T.
Ono, Nobutaka
Vincent, Emmanuel
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2160 - 2164
[23] A DNN-BASED ACOUSTIC MODELING OF TONAL LANGUAGE AND ITS APPLICATION TO MANDARIN PRONUNCIATION TRAINING
Hu, Wenping
Qian, Yao
Soong, Frank K.
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[24] Pre-Training of DNN-Based Speech Synthesis Based on Bidirectional Conversion between Text and Speech
Sone, Kentaro
Nakashika, Toru
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (08) : 1546 - 1553
[25] DNN-based QoT Estimation Using Topological Inputs and Training with Synthetic-Physical Data
Mayer, Kayol S.
dos Santos, Luan C. M.
Pinto, Rossano P.
Dal Maso, Marcos P. A.
Rothenberg, Christian E.
Arantes, Dalton S.
Mello, Darli A. A.
2023 IEEE PHOTONICS CONFERENCE, IPC, 2023,
[26] DNN-based Feature Enhancement using Joint Training Framework for Robust Multichannel Speech Recognition
Lee, Kang Hyun
Kang, Tae Gyoon
Kang, Woo Hyun
Kim, Nam Soo
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3027 - 3031
[27] SOFT-TARGET TRAINING WITH AMBIGUOUS EMOTIONAL UTTERANCES FOR DNN-BASED SPEECH EMOTION CLASSIFICATION
Ando, Atsushi
Kobashikawa, Satoshi
Kamiyama, Hosana
Masumura, Ryo
Ijima, Yusuke
Aono, Yushi
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4964 - 4968
[28] DNN-based feature enhancement using joint training framework for robust multichannel speech recognition
Lee, Kang Hyun
Kang, Tae Gyoon
Kang, Woo Hyun
Kim, Nam Soo
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2016, 08-12-September-2016 : 3027 - 3031
[29] JOINT NOISE AND MASK AWARE TRAINING FOR DNN-BASED SPEECH ENHANCEMENT WITH SUB-BAND FEATURES
Wang, Qing
Du, Jun
Dai, Li-Rong
Lee, Chin-Hui
2017 HANDS-FREE SPEECH COMMUNICATIONS AND MICROPHONE ARRAYS (HSCMA 2017), 2017, : 101 - 105
[30] Fast and Accurate DNN-Based Approach in Maximizing Ultra-Wideband Fiber-Optic Systems Throughput
Gan, Zelin
Shevchenko, Mykyta
Herzberg, Sam Nallaperuma
Savory, Seb J.
2024 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION, OFC, 2024,

← 1 2 3 4 →