Monaural noisy speech separation combining sparse non-negative matrix factorization and deep attractor network

被引:0
|
作者
GE Wanying [1 ]
ZHANG Tianqi [1 ]
FAN Congcong [1 ]
ZHANG Tian [1 ]
机构
[1] School of Communication and Information Engineering,Chongqing University of Posts and Telecommunications
基金
中国国家自然科学基金;
关键词
D O I
10.15949/j.cnki.0217-9776.2021.02.008
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
The performance of the monaural speech separation method is limited when the speech mixture is disordered by background noise.To obtain the enhanced separated speech from the noisy mixture,a monaural noisy speech separation method combining sparse nonnegative matrix factorization(SNMF) and deep attractor network(DANet) is proposed.This method firstly decomposes the noisy mixture into coefficients of speech and noise respectively.Then the speech coefficient is projected to a high-dimensional embedding space and a DANet is trained to force the embeddings to move to different clusters.The attractor points are used to separate the speech coefficients by masking method,and finally the enhanced separated speeches are reconstructed by the speech basis and their corresponding coefficients.Experimental results in various background noise environments show that the proposed algorithm effectively suppress the noises without decreasing the quality of reconstructed speech by comparison with different baseline methods.
引用
收藏
页码:266 / 280
页数:15
相关论文
共 50 条
  • [21] Speech Enhancement Using Sparse Convolutive Non-negative Matrix Factorization with Basis Adaptation
    Carlin, Michael A.
    Malyska, Nicolas
    Quatieri, Thomas F.
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 582 - 585
  • [22] Algorithms for Non-Negative Matrix Factorization on Noisy Data With Negative Values
    Green, Dylan
    Bailey, Stephen
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 5187 - 5197
  • [23] SHIFTED AND CONVOLUTIVE SOURCE-FILTER NON-NEGATIVE MATRIX FACTORIZATION FOR MONAURAL AUDIO SOURCE SEPARATION
    Nakamura, Tomohiko
    Kameoka, Hirokazu
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 489 - 493
  • [24] A Novel Signal Separation Method Based on Improved Sparse Non-Negative Matrix Factorization
    Wang, Huaqing
    Wang, Mengyang
    Li, Junlin
    Song, Liuyang
    Hao, Yansong
    ENTROPY, 2019, 21 (05)
  • [25] SOURCE SEPARATION WITH SCATTERING NON-NEGATIVE MATRIX FACTORIZATION
    Bruna, Joan
    Sprechmann, Pablo
    LeCun, Yann
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1876 - 1880
  • [26] Vocal Separation by Constrained Non-Negative Matrix Factorization
    Ochiai, Eri
    Fujisawa, Takanori
    Ikehara, Masaaki
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 480 - 483
  • [27] Performance Evaluation of Single Channel Speech Separation Using Non-Negative Matrix Factorization
    Nandakumar, Mona M.
    Bijoy, Edet K.
    2014 NATIONAL CONFERENCE ON COMMUNICATION, SIGNAL PROCESSING AND NETWORKING (NCCSN), 2014,
  • [28] Combining Non-negative Matrix Factorization and Sparse Coding for Functional Brain Overlapping Community Detection
    X. Li
    Z. Hu
    H. Wang
    Cognitive Computation, 2018, 10 : 991 - 1005
  • [29] Speech/Music Separation Using Non-negative Matrix Factorization with Combination of Cost Functions
    Nasersharif, Babak
    Abdali, Sara
    2015 INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2015, : 107 - 111
  • [30] Combining Non-negative Matrix Factorization and Sparse Coding for Functional Brain Overlapping Community Detection
    Li, X.
    Hu, Z.
    Wang, H.
    COGNITIVE COMPUTATION, 2018, 10 (06) : 991 - 1005