Single-Channel Speech Separation using Sparse Non-Negative Matrix Factorization

被引:0
|
作者
Schmidt, Mikkel N. [1 ]
Olsson, Rasmus K. [1 ]
机构
[1] Tech Univ Denmark, Lyngby, Denmark
关键词
Single-channel source separation; sparse non-negative matrix factorization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We apply machine learning techniques to the problem of separating multiple speech sources from a single microphone recording. The method of choice is a sparse non-negative matrix factorization algorithm, which in an unsupervised manner can learn sparse representations of the data. This is applied to the learning of personalized dictionaries from a speech corpus, which in turn are used to separate the audio stream into its components. We show that computational savings can be achieved by segmenting the training data on a phoneme level. To split the data, a conventional speech recognizer is used. The performance of the unsupervised and supervised adaptation schemes result in significant improvements in terms of the target-to-masker ratio.
引用
收藏
页码:2614 / 2617
页数:4
相关论文
共 50 条
  • [1] DISCRIMINATIVE NON-NEGATIVE MATRIX FACTORIZATION FOR SINGLE-CHANNEL SPEECH SEPARATION
    Wang, Zi
    Sha, Fei
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [2] Single Channel Music and Speech Separation Using Non-negative Matrix Factorization
    Yidirim, Sinan
    Saraclar, Murat
    [J]. 2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 543 - 546
  • [3] Non-negative Matrix Factorization with Linear Constraints for Single-Channel Speech Enhancement
    Lyubimov, Nikolay
    Kotov, Mikhail
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 446 - 450
  • [4] Performance Evaluation of Single Channel Speech Separation Using Non-Negative Matrix Factorization
    Nandakumar, Mona M.
    Bijoy, Edet K.
    [J]. 2014 NATIONAL CONFERENCE ON COMMUNICATION, SIGNAL PROCESSING AND NETWORKING (NCCSN), 2014,
  • [5] Single-Channel Speech Separation Based on Non-negative Matrix Factorization and Factorial Conditional Random Field
    LI Xu
    TU Ming
    WANG Xiaofei
    WU Chao
    FU Qiang
    YAN Yonghong
    [J]. Chinese Journal of Electronics, 2018, 27 (05) : 1063 - 1070
  • [6] Single-Channel Speech Separation Based on Non-negative Matrix Factorization and Factorial Conditional Random Field
    Li Xu
    Tu Ming
    Wang Xiaofei
    Wu Chao
    Fu Qiang
    Yan Yonghong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2018, 27 (05) : 1063 - 1070
  • [7] Adaptive Sparsity Non-Negative Matrix Factorization for Single-Channel Source Separation
    Gao, Bin
    Woo, W. L.
    Dlay, S. S.
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (05) : 989 - 1001
  • [8] SINGLE-CHANNEL SPEECH SEPARATION BY INCLUDING SPECTRAL STRUCTURE INFORMATION WITHIN NON-NEGATIVE MATRIX FACTORIZATION
    Feng, Yuxiao
    Ritz, Christian
    [J]. 2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 620 - 624
  • [9] Perceptual Single-Channel Audio Source Separation by Non-negative Matrix Factorization
    Kirbiz, Serap
    Gunsel, Bilge
    [J]. 2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 654 - 657
  • [10] Single-channel blind separation using L1-sparse complex non-negative matrix factorization for acoustic signals
    Parathai, P.
    Woo, W. L.
    Dlay, S. S.
    Gao, Bin
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 137 (01): : EL124 - EL129