DNN-Based Linear Prediction Residual Enhancement for Speech Dereverberation

被引：0

作者：

Feng, Xinyang ^{[1
]}

Li, Nuo ^{[1
]}

He, Zunwen ^{[1
]}

Zhang, Yan ^{[1
]}

Zhang, Wancheng ^{[1
]}

机构：

[1] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China

来源：

2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC) | 2021年

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

speech dereverberation; linear prediction residual; deep neural network; REVERBERANT; NOISY; INTELLIGIBILITY;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In daily-life scenarios, reverberation inevitably causes a decrease in speech recognizability and speech quality. Exploring methods to eliminate reverberation will benefit both human perception and other speech technology applications such as identity authentication and speech recognition. This paper proposes a speech dereverberation algorithm based on linear prediction (LP) residual processing using deep neural network (DNN). The amplitude spectrum of the LP residual of short-term speech is used as a speech feature to train the DNN, and the mapping relationship between LP residual of the reverberant speech and that of the clean speech is learned. Comparative experiments under different reverberation conditions have verified the effectiveness and robustness of the algorithm.

引用

页码：541 / 545

页数：5

共 50 条

[21] A NEW COST FUNCTION FOR DNN-BASED SPEECH ENHANCEMENT COMBINING NMF AND CASA
Yan, Bofang
Bao, Changchun
Bai, Zhigang
[J]. PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 255 - 259
[22] Power Exponent Based Weighting Criterion for DNN-Based Mask Approximation in Speech Enhancement
Cui, Zihao
Bao, Changchun
[J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 618 - 622
[23] INVERTIBLE DNN-BASED NONLINEAR TIME-FREQUENCY TRANSFORM FOR SPEECH ENHANCEMENT
Lakeuchi, Daiki
Yatabe, Kohei
Koizumi, Yuma
Oikawa, Yasuhiro
Harada, Noboru
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6644 - 6648
[24] Speech enhancement using linear prediction residual
Yegnanarayana, B
Avendano, C
Hermansky, H
Murthy, PS
[J]. SPEECH COMMUNICATION, 1999, 28 (01) : 25 - 42
[25] Concatenated Identical DNN (CI-DNN) to Reduce Noise-Type Dependence in DNN-Based Speech Enhancement
Xu, Ziyi
Strake, Maximilian
Fingscheidt, Tim
[J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
[26] DNN-BASED DISTRIBUTED MULTICHANNEL MASK ESTIMATION FOR SPEECH ENHANCEMENT IN MICROPHONE ARRAYS
Furnon, Nicolas
Serizel, Romain
Illina, Irina
Essid, Slim
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4672 - 4676
[27] DNN-BASED SPEECH RECOGNITION FOR GLOBALPHONE LANGUAGES
Tachbelie, Martha Yifiru
Abulimiti, Ayimunishagu
Abate, Solomon Teferra
Schultz, Tanja
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8269 - 8273
[28] DNN-Based Mask Estimation for Distributed Speech Enhancement in Spatially Unconstrained Microphone Arrays
Furnon, Nicolas
Serizel, Romain
Essid, Slim
Illina, Irina
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2310 - 2323
[29] SNR-Based Features and Diverse Training Data for Robust DNN-Based Speech Enhancement
Rehr, Robert
Gerkmann, Timo
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1937 - 1949
[30] SNR-Based Features and Diverse Training Data for Robust DNN-Based Speech Enhancement
Rehr, Robert
Gerkmann, Timo
[J]. IEEE/ACM Transactions on Audio Speech and Language Processing, 2021, 29 : 1937 - 1949

← 1 2 3 4 5 →