Single channel speech music separation using nonnegative matrix factorization with sliding windows and spectral masks

被引:0
|
作者
Grais, Emad M. [1 ]
Erdogan, Hakan [1 ]
机构
[1] Sabanci Univ, Fac Engn & Nat Sci, TR-34956 Istanbul, Turkey
关键词
Single channel source separation; source separation; semi-blind source separation; speech music separation; speech processing; nonnegative matrix factorization; Wiener filter;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A single channel speech-music separation algorithm based on nonnegative matrix factorization (NMF) with sliding windows and spectral masks is proposed in this work. We train a set of basis vectors for each source signal using NMF in the magnitude spectral domain. Rather than forming the columns of the matrices to be decomposed by NMF of a single spectral frame, we build them with multiple spectral frames stacked in one column. After observing the mixed signal, NMF is used to decompose its magnitude spectra into a weighted linear combination of the trained basis vector for both sources. An initial spectrogram estimate for each source is found, and a spectral mask is built using these initial estimates. This mask is used to weight the mixed signal spectrogram to find the contributions of each source signal in the mixed signal. The method is shown to perform better than the conventional NMF approach.
引用
下载
收藏
页码:1784 / 1787
页数:4
相关论文
共 50 条
  • [41] Blind spectral unmixing in terahertz domain using nonnegative matrix factorization
    Li, Xian
    Huang, Ping J.
    Ma, Ye H.
    Hou, Di B.
    Zhang, Guang X.
    SELECTED PAPERS OF THE PHOTOELECTRONIC TECHNOLOGY COMMITTEE CONFERENCES, 2015, 9795
  • [43] Spectro-temporal Filtering based on The Beta-divergence for Speech Separation using Nonnegative Matrix Factorization
    Fakhry, Mahmoud
    2021 4TH INTERNATIONAL SEMINAR ON RESEARCH OF INFORMATION TECHNOLOGY AND INTELLIGENT SYSTEMS (ISRITI 2021), 2020,
  • [44] Nonnegative Matrix Partial Co-Factorization for Spectral and Temporal Drum Source Separation
    Kim, Minje
    Yoo, Jiho
    Kang, Kyeongok
    Choi, Seungjin
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (06) : 1192 - 1204
  • [45] Speech/Music Separation Using Non-negative Matrix Factorization with Combination of Cost Functions
    Nasersharif, Babak
    Abdali, Sara
    2015 INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2015, : 107 - 111
  • [46] Blind image separation using Nonnegative Matrix Factorization with Gibbs smoothing
    Zdunek, Rafal
    Cichocki, Andrzej
    NEURAL INFORMATION PROCESSING, PART II, 2008, 4985 : 519 - +
  • [47] Single-Channel Speech Dereverberation Based on Block-wise Weighted Prediction Error and Nonnegative Matrix Factorization
    Kwak, Chan Woong
    Jeon, Kwang Myung
    Park, In Young
    Kim, Hong Kook
    Lim, Jeong Eun
    Park, Ji Hyun
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
  • [48] Tracking the evolution of temporal patterns of usage in bicycle-Sharing systems using nonnegative matrix factorization on multiple sliding windows
    Cazabet, Remy
    Jensen, Pablo
    Borgnat, Pierre
    INTERNATIONAL JOURNAL OF URBAN SCIENCES, 2018, 22 (02) : 147 - 161
  • [49] TRANSDUCTIVE NONNEGATIVE MATRIX FACTORIZATION FOR SEMI-SUPERVISED HIGH-PERFORMANCE SPEECH SEPARATION
    Guan, Naiyang
    Lan, Long
    Tao, Dacheng
    Luo, Zhigang
    Yang, Xuejun
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [50] Robust Music Signal Separation Based on Supervised Nonnegative Matrix Factorization with Prevention of Basis Sharing
    Kitamura, Daichi
    Saruwatari, Hiroshi
    Yagi, Kosuke
    Shikano, Kiyohiro
    Takahashi, Yu
    Kondo, Kazunobu
    2013 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (IEEE ISSPIT 2013), 2013, : 392 - 397