Single-channel Speech Enhancement Student under Multi-channel Speech Enhancement Teacher

被引:0
|
作者
Zhang, Yuzhu [1 ]
Zhang, Hui [1 ]
Zhang, Xueliang [1 ]
机构
[1] Inner Mongolia Univ, Dept Comp Sci, Hohhot, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, significant success has been made in single-channel speech enhancement using the deep neural networks. These approaches trained a model on synthetic noisy speech corpus, which was created by adding noise to clean speech. Because there is a mismatch between synthetic training data and the actual application environment, the model's performance is not guaranteed. This paper proposes to use a multi-channel speech enhancement teacher model to guide a single-channel noise suppression student model. We set the multi-channel teacher's processed signal as the single-channel student's training target. With our proposed approach, the single-channel speech enhancement model can be trained using real noisy speech and performed as well as a multi-channel speech enhancement model. Experimental results on CHIME-3 demonstrate that our proposed approach can achieve competitive performance both in speech enhancement and automatic speech recognition tasks, even without ground truth signals.
引用
收藏
页码:372 / 377
页数:6
相关论文
共 50 条
  • [1] Robust Speaker Recognition Based on Single-Channel and Multi-Channel Speech Enhancement
    Taherian, Hassan
    Wang, Zhong-Qiu
    Chang, Jorge
    Wang, DeLiang
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1293 - 1302
  • [2] INCORPORATING MULTI-CHANNEL WIENER FILTER WITH SINGLE-CHANNEL SPEECH ENHANCEMENT ALGORITHM
    Yong, Pei Chee
    Nordholm, Sven
    Dam, Hai Huyen
    Leung, Yee Hong
    Lai, Chiong Ching
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7284 - 7288
  • [3] Weak Speech Recovery for Single-Channel Speech Enhancement
    Wong, Arthur
    Ming, Kok
    Low, Siow Yong
    [J]. 2012 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT AND ADVANCED SYSTEMS (ICIAS), VOLS 1-2, 2012, : 627 - 631
  • [4] Single-Channel Speech Enhancement Techniques for Distant Speech Recognition
    Ashwini, Jaya
    Kumaraswamy, Ramaswamy
    [J]. JOURNAL OF INTELLIGENT SYSTEMS, 2013, 22 (02) : 81 - 93
  • [5] Phase Processing for Single-Channel Speech Enhancement
    Gerkmann, Timo
    Krawczyk-Becker, Martin
    Le Roux, Jonathan
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2015, 32 (02) : 55 - 66
  • [6] Multi-channel psychoacoustically motivated speech enhancement
    Rosca, J
    Balan, R
    Beaugeant, C
    [J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 217 - 220
  • [7] Multi-channel psychoacoustically motivated speech enhancement
    Rosca, J
    Balan, R
    Beaugeant, C
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 84 - 87
  • [8] Multi-channel Speech Enhancement in Driving Environment
    Jin, Weiyun
    Wei, Jie
    Zhong, Xiaofeng
    [J]. 2017 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2017,
  • [9] Single-channel speech enhancement by subspace affinity minimization
    Tran, Dung N.
    Koishida, Kazuhito
    [J]. INTERSPEECH 2020, 2020, : 2447 - 2451
  • [10] Single-Channel Speech Enhancement Based on Psychoacoustic Masking
    Zhou, Tingting
    Zeng, Yumin
    Wang, Rongrong
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2017, 65 (04): : 272 - 284