Monaural noisy speech separation combining sparse non-negative matrix factorization and deep attractor network

被引:0
|
作者
GE Wanying [1 ]
ZHANG Tianqi [1 ]
FAN Congcong [1 ]
ZHANG Tian [1 ]
机构
[1] School of Communication and Information Engineering,Chongqing University of Posts and Telecommunications
基金
中国国家自然科学基金;
关键词
D O I
10.15949/j.cnki.0217-9776.2021.02.008
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
The performance of the monaural speech separation method is limited when the speech mixture is disordered by background noise.To obtain the enhanced separated speech from the noisy mixture,a monaural noisy speech separation method combining sparse nonnegative matrix factorization(SNMF) and deep attractor network(DANet) is proposed.This method firstly decomposes the noisy mixture into coefficients of speech and noise respectively.Then the speech coefficient is projected to a high-dimensional embedding space and a DANet is trained to force the embeddings to move to different clusters.The attractor points are used to separate the speech coefficients by masking method,and finally the enhanced separated speeches are reconstructed by the speech basis and their corresponding coefficients.Experimental results in various background noise environments show that the proposed algorithm effectively suppress the noises without decreasing the quality of reconstructed speech by comparison with different baseline methods.
引用
收藏
页码:266 / 280
页数:15
相关论文
共 50 条
  • [41] Link prediction by deep non-negative matrix factorization
    Chen, Guangfu
    Wang, Haibo
    Fang, Yili
    Jiang, Ling
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 188
  • [42] Non-negative Matrix Factorization For Network Delay Matrix Completion
    Ghandi, Sanaa
    Reiffers-Masson, Alexandre
    Vaton, Sandrine
    Chonavel, Thierry
    PROCEEDINGS OF THE IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM 2022, 2022,
  • [43] Voice Conversion based on Non-negative Matrix Factorization in Noisy Environments
    Fujii, Takao
    Aihara, Ryo
    Takashima, Ryoichi
    Takiguchi, Tetsuya
    Ariki, Yasuo
    2013 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2013, : 495 - 498
  • [44] Unsupervised learning of phonemes of whispered speech in a noisy environment based on convolutive non-negative matrix factorization
    Zhou, Jian
    Liang, Ruiyu
    Zhao, Li
    Tao, Liang
    Zou, Cairong
    INFORMATION SCIENCES, 2014, 257 : 115 - 126
  • [45] Blind primary colorant spectral separation combining ICA and POCS non-negative matrix factorization
    Kuo, CH
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 489 - 492
  • [46] Kernel Non-Negative Matrix Factorization for Seismic Signature Separation
    Mehmood, Asif
    Damarla, Thyagaraju
    JOURNAL OF PATTERN RECOGNITION RESEARCH, 2013, 8 (01): : 13 - 24
  • [47] Supervised single channel dual domains speech enhancement using sparse non-negative matrix factorization
    Islam, Md Shohidul
    Zhu, Yuanyuan
    Hossain, Md Imran
    Ullah, Rizwan
    Ye, Zhongfu
    DIGITAL SIGNAL PROCESSING, 2020, 100
  • [48] Nonlinear hyperspectral unmixing based on sparse non-negative matrix factorization
    Li, Jing
    Li, Xiaorun
    Zhao, Liaoying
    JOURNAL OF APPLIED REMOTE SENSING, 2016, 10
  • [49] Non-negative matrix factorization via adaptive sparse graph regularization
    Zhang, Guifang
    Chen, Jiaxin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (08) : 12507 - 12524
  • [50] Image Denoising based on Sparse Representation and Non-Negative Matrix Factorization
    Farouk, R. M.
    Khalil, H. A.
    LIFE SCIENCE JOURNAL-ACTA ZHENGZHOU UNIVERSITY OVERSEAS EDITION, 2012, 9 (01): : 337 - 341