Computationally Efficient and Versatile Framework for Joint Optimization of Blind Speech Separation and Dereverberation

被引：3

作者：

Nakatani, Tomohiro ^{[1
]}

Ikeshita, Rintaro ^{[1
]}

Kinoshita, Keisuke ^{[1
]}

Sawada, Hiroshi ^{[1
]}

Araki, Shoko ^{[1
]}

机构：

[1] NTT Corp, Tokyo, Japan

来源：

INTERSPEECH 2020 | 2020年

关键词：

Blind source separation; dereverberation; automatic speech recognition; INDEPENDENT COMPONENT ANALYSIS; MIXTURES;

D O I：

10.21437/Interspeech.2020-2138

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

This paper proposes new blind signal processing techniques for optimizing a multi-input multi-output (MIMO) convolutional beamformer (CBF) in a computationally efficient way to simultaneously perform dereverberation and source separation. For effective CBF optimization, a conventional technique factorizes it into a multiple-target weighted prediction error (WPE) based dereverberation filter and a separation matrix. However, this technique requires the calculation of a huge spatio-temporal covariance matrix that reflects the statistics of all the sources, which makes the computational cost very high. For computationally efficient optimization, this paper introduces two techniques: one that decomposes the huge covariance matrix into ones for individual sources, and another that decomposes the CBF into sub-filters for estimating individual sources. Both techniques effectively and substantively reduce the size of the covariance matrices that must calculated, and allow us to greatly reduce the computational cost without loss of optimality.

引用

下载

页码：91 / 95

页数：5

共 50 条

[21] A LSTM-Based Joint Progressive Learning Framework for Simultaneous Speech Dereverberation and Denoising
XinTang
JunDu
LiChai
Wang, Yannan
Wang, Qing
Lee, Chin-Hui
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 274 - 278
[22] SEMI-BLIND SPEECH ENHANCEMENT BASED ON RECURRENT NEURAL NETWORK FOR SOURCE SEPARATION AND DEREVERBERATION
Wake, Masaya
Bando, Yoshiaki
Mimura, Masato
Itoyama, Katsutoshi
Yoshii, Kazuyoshi
Kawahara, Tatsuya
2017 IEEE 27TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2017,
[23] BLIND AND NEURAL NETWORK-GUIDED CONVOLUTIONAL BEAMFORMER FOR JOINT DENOISING, DEREVERBERATION, AND SOURCE SEPARATION
Nakatani, Tomohiro
Ikeshita, Rintaro
Kinoshita, Keisuke
Sawada, Hiroshi
Araki, Shoko
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6129 - 6133
[24] A COMPUTATIONALLY CONSTRAINED OPTIMIZATION FRAMEWORK FOR IMPLEMENTATION AND TUNING OF SPEECH ENHANCEMENT SYSTEMS
Giacobello, Daniele
Wung, Jason
Pichevar, Ramin
Atkins, Joshua
2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, : 159 - 163
[25] EFFICIENT BLIND SPEECH SEPARATION SUITABLE FOR EMBEDDED DEVICES
Kondo, Kazunobu
Takahashi, Yu
Hashimoto, Seiichi
Saruwatari, Hiroshi
Nishino, Takanori
Takeda, Kazuya
19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 2319 - 2323
[26] An Efficient Activation Function for Blind Separation of Speech Signals
Sun, Shouyu
PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED ICT AND EDUCATION, 2013, 33 : 410 - 413
[27] JOINT TRAINING OF DEEP NEURAL NETWORKS FOR MULTI-CHANNEL DEREVERBERATION AND SPEECH SOURCE SEPARATION
Togami, Masahito
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3032 - 3036
[28] Low Latency Online Source Separation and Noise Reduction Based on Joint Optimization with Dereverberation
Ueda, Tetsuya
Nakatani, Tomohiro
Ikeshita, Rintaro
Kinoshita, Keisuke
Araki, Shoko
Makino, Shoji
29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1000 - 1004
[29] A COMPUTATIONALLY CHEAPER METHOD FOR BLIND SPEECH SEPARATION BASED ON AUXIVA AND INCOMPLETE DEMIXING TRANSFORM
Jansky, Jakub
Koldovsky, Zbynek
Ono, Nobutaka
2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
[30] A Novel Approach for Blind Separation and Dereverberation of Speech Mixtures using Multiple step Linear Predictive Coding
Ehsan, Wajeeha
Jan, Tariqullah
2015 INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET), 2015,

← 1 2 3 4 5 →