Computationally Efficient and Versatile Framework for Joint Optimization of Blind Speech Separation and Dereverberation

被引:3
|
作者
Nakatani, Tomohiro [1 ]
Ikeshita, Rintaro [1 ]
Kinoshita, Keisuke [1 ]
Sawada, Hiroshi [1 ]
Araki, Shoko [1 ]
机构
[1] NTT Corp, Tokyo, Japan
来源
关键词
Blind source separation; dereverberation; automatic speech recognition; INDEPENDENT COMPONENT ANALYSIS; MIXTURES;
D O I
10.21437/Interspeech.2020-2138
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
This paper proposes new blind signal processing techniques for optimizing a multi-input multi-output (MIMO) convolutional beamformer (CBF) in a computationally efficient way to simultaneously perform dereverberation and source separation. For effective CBF optimization, a conventional technique factorizes it into a multiple-target weighted prediction error (WPE) based dereverberation filter and a separation matrix. However, this technique requires the calculation of a huge spatio-temporal covariance matrix that reflects the statistics of all the sources, which makes the computational cost very high. For computationally efficient optimization, this paper introduces two techniques: one that decomposes the huge covariance matrix into ones for individual sources, and another that decomposes the CBF into sub-filters for estimating individual sources. Both techniques effectively and substantively reduce the size of the covariance matrices that must calculated, and allow us to greatly reduce the computational cost without loss of optimality.
引用
下载
收藏
页码:91 / 95
页数:5
相关论文
共 50 条
  • [31] An Efficient Analytic Solution for Joint Blind Source Separation
    Gabrielson, Ben
    Baker Siddique Akhonda, Mohammad Abu
    Lehmann, Isabell
    Adali, Tulay
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 2436 - 2449
  • [32] Blind Separation of the Joint Algorithm Based on the Cyclostationarity Optimization
    Jing-hong, Xue
    Min, Li
    2016 IEEE INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION AND COMMUNICATION TECHNOLOGY ICEICT 2016 PROCEEDINGS, 2016, : 266 - 270
  • [33] A stochastic framework for computationally efficient fail-safe topology optimization
    Zhang, Yiming
    Zhang, Hongyi
    Qiu, Lemiao
    Wang, Zili
    Zhang, Shuyou
    Qiu, Na
    Fang, Jianguang
    ENGINEERING STRUCTURES, 2023, 283
  • [34] A convolutive blind signal separation method for joint speech signal separation and echo cancellation
    Pan, QF
    Aboulnasr, T
    Proceedings of the 46th IEEE International Midwest Symposium on Circuits & Systems, Vols 1-3, 2003, : 752 - 755
  • [35] Computationally-Efficient Overdetermined Blind Source Separation Based on Iterative Source Steering
    Du, Yicheng
    Scheibler, Robin
    Togami, Masahito
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 927 - 931
  • [36] Computationally Efficient Blind Source Separation-Based MIMO-PA Linearization
    Shipra
    Rawat, Meenakshi
    IEEE ACCESS, 2024, 12 : 3126 - 3139
  • [37] BLIND SPEECH SEPARATION EMPLOYING DIRECTIONAL STATISTICS IN AN EXPECTATION MAXIMIZATION FRAMEWORK
    Dang Hai Tran Vu
    Haeb-Umbach, Reinhold
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 241 - 244
  • [38] Joint dereverberation and blind source separation using a hybrid autoregressive and convolutive transfer function-based model
    Liu S.
    Yang F.
    Chen R.
    Yang J.
    Applied Acoustics, 2024, 224
  • [39] Riemannian Optimization and Approximate Joint Diagonalization for Blind Source Separation
    Bouchard, Florent
    Malick, Jerome
    Congedo, Marco
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (08) : 2041 - 2054
  • [40] OPTIMIZED JOINT NOISE SUPPRESSION AND DEREVERBERATION BASED ON BLIND SIGNAL EXTRACTION FOR HANDS-FREE SPEECH RECOGNITION SYSTEM
    Aprilyanti, Fine D.
    Saruwatari, Hiroshi
    Nakamura, Satoshi
    Takatani, Tomoya
    2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 182 - 186