Single-channel Speech Enhancement Student under Multi-channel Speech Enhancement Teacher

被引：0

作者：

Zhang, Yuzhu ^{[1
]}

Zhang, Hui ^{[1
]}

Zhang, Xueliang ^{[1
]}

机构：

[1] Inner Mongolia Univ, Dept Comp Sci, Hohhot, Peoples R China

来源：

PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC) | 2022年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, significant success has been made in single-channel speech enhancement using the deep neural networks. These approaches trained a model on synthetic noisy speech corpus, which was created by adding noise to clean speech. Because there is a mismatch between synthetic training data and the actual application environment, the model's performance is not guaranteed. This paper proposes to use a multi-channel speech enhancement teacher model to guide a single-channel noise suppression student model. We set the multi-channel teacher's processed signal as the single-channel student's training target. With our proposed approach, the single-channel speech enhancement model can be trained using real noisy speech and performed as well as a multi-channel speech enhancement model. Experimental results on CHIME-3 demonstrate that our proposed approach can achieve competitive performance both in speech enhancement and automatic speech recognition tasks, even without ground truth signals.

引用

页码：372 / 377

页数：6

共 50 条

[1] Robust Speaker Recognition Based on Single-Channel and Multi-Channel Speech Enhancement
Taherian, Hassan
Wang, Zhong-Qiu
Chang, Jorge
Wang, DeLiang
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1293 - 1302
[2] INCORPORATING MULTI-CHANNEL WIENER FILTER WITH SINGLE-CHANNEL SPEECH ENHANCEMENT ALGORITHM
Yong, Pei Chee
Nordholm, Sven
Dam, Hai Huyen
Leung, Yee Hong
Lai, Chiong Ching
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7284 - 7288
[3] Weak Speech Recovery for Single-Channel Speech Enhancement
Wong, Arthur
Ming, Kok
Low, Siow Yong
[J]. 2012 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT AND ADVANCED SYSTEMS (ICIAS), VOLS 1-2, 2012, : 627 - 631
[4] Single-Channel Speech Enhancement Techniques for Distant Speech Recognition
Ashwini, Jaya
Kumaraswamy, Ramaswamy
[J]. JOURNAL OF INTELLIGENT SYSTEMS, 2013, 22 (02) : 81 - 93
[5] Phase Processing for Single-Channel Speech Enhancement
Gerkmann, Timo
Krawczyk-Becker, Martin
Le Roux, Jonathan
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2015, 32 (02) : 55 - 66
[6] Multi-channel psychoacoustically motivated speech enhancement
Rosca, J
Balan, R
Beaugeant, C
[J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 217 - 220
[7] Multi-channel psychoacoustically motivated speech enhancement
Rosca, J
Balan, R
Beaugeant, C
[J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 84 - 87
[8] Multi-channel Speech Enhancement in Driving Environment
Jin, Weiyun
Wei, Jie
Zhong, Xiaofeng
[J]. 2017 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2017,
[9] Single-channel speech enhancement by subspace affinity minimization
Tran, Dung N.
Koishida, Kazuhito
[J]. INTERSPEECH 2020, 2020, : 2447 - 2451
[10] Single-Channel Speech Enhancement Based on Psychoacoustic Masking
Zhou, Tingting
Zeng, Yumin
Wang, Rongrong
[J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2017, 65 (04): : 272 - 284

← 1 2 3 4 5 →