NOISE ROBUST ESTIMATION OF THE VOICE SOURCE USING A DEEP NEURAL NETWORK

被引:0
|
作者
Airaksinen, Manu [1 ]
Raitio, Tuomo [1 ]
Alku, Paavo [1 ]
机构
[1] Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland
基金
芬兰科学院;
关键词
Voice source estimation; glottal inverse filtering; deep neural network; noise robustness; INVERSE FILTERING ANALYSIS;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In the analysis of speech production, information about the voice source can be obtained non-invasively with glottal inverse filtering (GIF) methods. Current state-of-the-art GIF methods are capable of producing high-quality estimates in suitable conditions (e.g. low noise and reverberation), but their performance deteriorates in non-ideal conditions because they require noise-sensitive parameter estimation. This study proposes a method for noise robust estimation of the voice source by creating a mapping using a deep neural network (DNN) between robust low-level speech features and the desired reference, a time-domain glottal flow computed by a GIF method. The method was evaluated with two GIF methods, of which one (quasi closed phase analysis, QCP) requires additional parameter estimation and the other (iterative adaptive inverse filtering, IAIF) does not. The results show that the proposed method outperforms the QCP method with SNRs less than 50-20 dB, but the simple IAIF method only with very low SNRs.
引用
收藏
页码:5137 / 5141
页数:5
相关论文
共 50 条
  • [1] PHASE AWARE DEEP NEURAL NETWORK FOR NOISE ROBUST VOICE ACTIVITY DETECTION
    Wang, Longbiao
    Phapatanaburi, Khomdet
    Oo, Zeyan
    Nakagawa, Seiichi
    Iwahashi, Masahiro
    Dang, Jianwu
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1087 - 1092
  • [2] Image operator forensics and sequence estimation using robust deep neural network
    Agarwal, Saurabh
    Jung, Ki-Hyun
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (16) : 47431 - 47454
  • [3] Image operator forensics and sequence estimation using robust deep neural network
    Saurabh Agarwal
    Ki-Hyun Jung
    [J]. Multimedia Tools and Applications, 2024, 83 : 47431 - 47454
  • [4] Deep Neural Network-Based Noise Estimation for Robust ASR in Dual-Microphone Smartphones
    Lopez-Espejo, Ivan
    Peinado, Antonio M.
    Gomez, Angel M.
    Martin-Donas, Juan M.
    [J]. ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2016, 2016, 10077 : 117 - 127
  • [5] Glottal source estimation from coded telephone speech using a deep neural network
    Narendra, N. P.
    Airaksinen, Manu
    Alku, Paavo
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3931 - 3935
  • [6] Robust direction of arrival (DOA) estimation using RBF neural network in impulsive noise enviromnent
    Tang, H
    Qiu, TS
    Li, S
    Guo, Y
    Zhang, WR
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 3, PROCEEDINGS, 2005, 3498 : 332 - 337
  • [7] Binaural Deep Neural Network for Noise Robust Automatic Speech Recognition
    Jiang, Yi
    Zu, Yuan-Yuan
    [J]. INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING AND AUTOMATION (ICCEA 2014), 2014, : 512 - 517
  • [8] DOA estimation based on a deep neural network under impulsive noise
    Ruiyan Cai
    Quan Tian
    Yang Luo
    [J]. Signal, Image and Video Processing, 2024, 18 : 785 - 792
  • [9] DOA estimation based on a deep neural network under impulsive noise
    Cai, Ruiyan
    Tian, Quan
    Luo, Yang
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (01) : 785 - 792
  • [10] Deep Neural Network for Robust Modulation Classification Under Uncertain Noise Conditions
    Hu, Shisheng
    Pei, Yiyang
    Liang, Paul Pu
    Liang, Ying-Chang
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (01) : 564 - 577