Computationally Efficient and Versatile Framework for Joint Optimization of Blind Speech Separation and Dereverberation

被引:3
|
作者
Nakatani, Tomohiro [1 ]
Ikeshita, Rintaro [1 ]
Kinoshita, Keisuke [1 ]
Sawada, Hiroshi [1 ]
Araki, Shoko [1 ]
机构
[1] NTT Corp, Tokyo, Japan
来源
关键词
Blind source separation; dereverberation; automatic speech recognition; INDEPENDENT COMPONENT ANALYSIS; MIXTURES;
D O I
10.21437/Interspeech.2020-2138
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
This paper proposes new blind signal processing techniques for optimizing a multi-input multi-output (MIMO) convolutional beamformer (CBF) in a computationally efficient way to simultaneously perform dereverberation and source separation. For effective CBF optimization, a conventional technique factorizes it into a multiple-target weighted prediction error (WPE) based dereverberation filter and a separation matrix. However, this technique requires the calculation of a huge spatio-temporal covariance matrix that reflects the statistics of all the sources, which makes the computational cost very high. For computationally efficient optimization, this paper introduces two techniques: one that decomposes the huge covariance matrix into ones for individual sources, and another that decomposes the CBF into sub-filters for estimating individual sources. Both techniques effectively and substantively reduce the size of the covariance matrices that must calculated, and allow us to greatly reduce the computational cost without loss of optimality.
引用
下载
收藏
页码:91 / 95
页数:5
相关论文
共 50 条
  • [21] A LSTM-Based Joint Progressive Learning Framework for Simultaneous Speech Dereverberation and Denoising
    XinTang
    JunDu
    LiChai
    Wang, Yannan
    Wang, Qing
    Lee, Chin-Hui
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 274 - 278
  • [22] SEMI-BLIND SPEECH ENHANCEMENT BASED ON RECURRENT NEURAL NETWORK FOR SOURCE SEPARATION AND DEREVERBERATION
    Wake, Masaya
    Bando, Yoshiaki
    Mimura, Masato
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    2017 IEEE 27TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2017,
  • [23] BLIND AND NEURAL NETWORK-GUIDED CONVOLUTIONAL BEAMFORMER FOR JOINT DENOISING, DEREVERBERATION, AND SOURCE SEPARATION
    Nakatani, Tomohiro
    Ikeshita, Rintaro
    Kinoshita, Keisuke
    Sawada, Hiroshi
    Araki, Shoko
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6129 - 6133
  • [24] A COMPUTATIONALLY CONSTRAINED OPTIMIZATION FRAMEWORK FOR IMPLEMENTATION AND TUNING OF SPEECH ENHANCEMENT SYSTEMS
    Giacobello, Daniele
    Wung, Jason
    Pichevar, Ramin
    Atkins, Joshua
    2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, : 159 - 163
  • [25] EFFICIENT BLIND SPEECH SEPARATION SUITABLE FOR EMBEDDED DEVICES
    Kondo, Kazunobu
    Takahashi, Yu
    Hashimoto, Seiichi
    Saruwatari, Hiroshi
    Nishino, Takanori
    Takeda, Kazuya
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 2319 - 2323
  • [26] An Efficient Activation Function for Blind Separation of Speech Signals
    Sun, Shouyu
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED ICT AND EDUCATION, 2013, 33 : 410 - 413
  • [27] JOINT TRAINING OF DEEP NEURAL NETWORKS FOR MULTI-CHANNEL DEREVERBERATION AND SPEECH SOURCE SEPARATION
    Togami, Masahito
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3032 - 3036
  • [28] Low Latency Online Source Separation and Noise Reduction Based on Joint Optimization with Dereverberation
    Ueda, Tetsuya
    Nakatani, Tomohiro
    Ikeshita, Rintaro
    Kinoshita, Keisuke
    Araki, Shoko
    Makino, Shoji
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1000 - 1004
  • [29] A COMPUTATIONALLY CHEAPER METHOD FOR BLIND SPEECH SEPARATION BASED ON AUXIVA AND INCOMPLETE DEMIXING TRANSFORM
    Jansky, Jakub
    Koldovsky, Zbynek
    Ono, Nobutaka
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [30] A Novel Approach for Blind Separation and Dereverberation of Speech Mixtures using Multiple step Linear Predictive Coding
    Ehsan, Wajeeha
    Jan, Tariqullah
    2015 INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET), 2015,