LINEAR DEMIXED DOMAIN MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR SPEECH ENHANCEMENT

被引:0
|
作者
Taniguchi, Toru [1 ]
Masuda, Taro [1 ]
机构
[1] Toshiba Co Ltd, Corp Res & Dev Ctr, Saiwai Ku, 1 Komukai Toshiba Cho, Kawasaki, Kanagawa 2128582, Japan
关键词
multichannel NMF; ICA; diagonal spatial co-variance; speech enhancement; source separation; AUDIO SOURCE SEPARATION; MIXTURES;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we investigate blind source separation for audio signals based on multichannel non negative matrix factorization (MNMF) of magnitude spectrograms in a linear demixed domain. The original magnitude MNMF by itself is less effective in general acoustic situations because it discards mutual information between input channels, which is represented by non-diagonal complex elements of the spatial covariance matrices of them. To deal with this problem, several linear transformations of the multichannel input have been proposed in order to diagonalize the covariance matrices without loss of the mutual information. However, when the number of microphones is small, it is difficult for static transformations to work well for various combinations of source positions. For this problem, we first prove that general linear transformations (linear demixing) can be applied as preprocessing of the magnitude MNMF, and then confirm that a transformation adaptive to source positions, such as using frequency domain independent component analysis, is better than the conventional static transformation by experimental comparison of 2- and 4-channel noisy speech enhancement tasks.
引用
收藏
页码:476 / 480
页数:5
相关论文
共 50 条
  • [1] On microphone arrangement for multichannel speech enhancement based on nonnegative matrix factorization in time-channel domain
    Murase, Yoshikazu
    Chiba, Hironobu
    Ono, Nobutaka
    Miyabe, Shigeki
    Yamada, Takeshi
    Makino, Shoji
    [J]. 2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [2] SPEECH ENHANCEMENT USING SEGMENTAL NONNEGATIVE MATRIX FACTORIZATION
    Fan, Hao-Teng
    Hung, Jeih-weih
    Lu, Xugang
    Wang, Syu-Siang
    Tsao, Yu
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [3] Wavelet Speech Enhancement Based on Nonnegative Matrix Factorization
    Wang, Syu-Siang
    Chern, Alan
    Tsao, Yu
    Hung, Jeih-weih
    Lu, Xugang
    Lai, Ying-Hui
    Su, Borching
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (08) : 1101 - 1105
  • [4] Speech enhancement based on nonnegative matrix factorization in constant-Q frequency domain
    Xu, Longting
    Wei, Zhilin
    Zaidi, Syed Faham Ali
    Ren, Bo
    Yang, Jichen
    [J]. APPLIED ACOUSTICS, 2021, 174
  • [5] Unsupervised Robust Speech Enhancement Based on Alpha-Stable Fast Multichannel Nonnegative Matrix Factorization
    Fontaine, Mathieu
    Sekiguchi, Kouhei
    Nugraha, Aditya Arie
    Yoshii, Kazuyoshi
    [J]. INTERSPEECH 2020, 2020, : 4541 - 4545
  • [6] Alpha-Stable Autoregressive Fast Multichannel Nonnegative Matrix Factorization for Joint Speech Enhancement and Dereverberation
    Fontaine, Mathieu
    Sekiguchi, Kouhei
    Nugraha, Aditya Arie
    Bando, Yoshiaki
    Yoshii, Kazuyoshi
    [J]. INTERSPEECH 2021, 2021, : 661 - 665
  • [7] Speech Enhancement Based on Codebook Constrained Nonnegative Matrix Factorization
    Bai, Zhigang
    Bao, Changchun
    Yan, Bofang
    [J]. 2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 361 - 365
  • [8] Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization
    Mohammadiha, Nasser
    Smaragdis, Paris
    Leijon, Arne
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (10): : 2140 - 2151
  • [9] SPEECH ENHANCEMENT USING NONNEGATIVE MATRIX FACTORIZATION WITH TEMPORAL CONTINUITY
    Nam, Seung-Hyon
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2015, 34 (03): : 240 - 246
  • [10] A NEW LINEAR MMSE FILTER FOR SINGLE CHANNEL SPEECH ENHANCEMENT BASED ON NONNEGATIVE MATRIX FACTORIZATION
    Mohammadiha, Nasser
    Gerkmann, Timo
    Leijon, Arne
    [J]. 2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2011, : 45 - 48