Hybrid Multichannel Signal Separation Using Supervised Nonnegative Matrix Factorization with Spectrogram Restoration

被引:0
|
作者
Kitamura, Daichi [1 ]
Saruwatari, Hiroshi [2 ]
Nakamura, Satoshi [3 ]
Takahashi, Yu [4 ]
Kondo, Kazunobu [4 ]
Kameoka, Hirokazu [2 ]
机构
[1] Grad Univ Adv Studies, Chiyoda Ku, 2-1-2 Hitotsubashi, Tokyo 1018430, Japan
[2] Univ Tokyo, Bunkyo Ku, Tokyo 1138656, Japan
[3] Nara Inst Sci & Technol, Ikoma, Nara 6300192, Japan
[4] Yamaha Corp, Iwata, Shizuoka 4380192, Japan
关键词
ALGORITHMS;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we propose a new hybrid method that concatenates directional clustering and advanced nonnegative matrix factorization (NMF) for the purpose of the specific sound extraction from the multichannel music signal. Multichannel music signal separation technology is aimed to extract a specific target signal from observed multichannel signals that contain multiple instrumental sounds. In the previous studies, various methods using NMF have been proposed, but they remain many problems, e.g., poor convergence in update rules in NMF and lack of robustness. To solve these problems, we propose a new supervised NMF (SNMF) with spectrogram restoration and its hybrid method that concatenates the proposed SNMF after directional clustering. Via extrapolation of supervised spectral bases, the proposed SNMF attempts both target signal separation and reconstruction of the lost target components, which are generated by preceding directional clustering. In addition, we theoretically reveal the trade-off between separation and extrapolation abilities and propose a new scheme for multi-divergence, where optimal divergence can be automatically changed in each time frame according to the local spatial conditions. The results of an evaluation experiment show that our proposed hybrid method outperforms the conventional music signal separation methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] DIVERGENCE OPTIMIZATION IN NONNEGATIVE MATRIX FACTORIZATION WITH SPECTROGRAM RESTORATION FOR MULTICHANNEL SIGNAL SEPARATION
    Kitamura, Daichi
    Saruwatari, Hiroshi
    Nakamura, Satoshi
    Takahashi, Yu
    Kondo, Kazunobu
    Kameoka, Hirokazu
    [J]. 2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 92 - 96
  • [2] Multichannel Signal Separation Combining Directional Clustering and Nonnegative Matrix Factorization with Spectrogram Restoration
    Kitamura, Daichi
    Saruwatari, Hiroshi
    Kameoka, Hirokazu
    Takahashi, Yu
    Kondo, Kazunobu
    Nakamura, Satoshi
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (04) : 654 - 669
  • [3] Music Signal Separation by Supervised Nonnegative Matrix Factorization with Basis Deformation
    Kitamura, Daichi
    Saruwatari, Hiroshi
    Shikano, Kiyohiro
    Kondo, Kazunobu
    Takahashi, Yu
    [J]. 2013 18TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2013,
  • [4] Superresolution-Based Stereo Signal Separation via Supervised Nonnegative Matrix Factorization
    Kitamura, Daichi
    Saruwatari, Hiroshi
    Iwao, Yusuke
    Shikano, Kiyohiro
    Kondo, Kazunobu
    Takahashi, Yu
    [J]. 2013 18TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2013,
  • [5] Robust Music Signal Separation Based on Supervised Nonnegative Matrix Factorization with Prevention of Basis Sharing
    Kitamura, Daichi
    Saruwatari, Hiroshi
    Yagi, Kosuke
    Shikano, Kiyohiro
    Takahashi, Yu
    Kondo, Kazunobu
    [J]. 2013 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (IEEE ISSPIT 2013), 2013, : 392 - 397
  • [6] Novel method of informative frequency band selection for vibration signal using Nonnegative Matrix Factorization of spectrogram matrix
    Wodecki, Jacek
    Kruczek, Piotr
    Bartkowiak, Anna
    Zimroz, Radoslaw
    Wylomanska, Agnieszka
    [J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2019, 130 : 585 - 596
  • [7] BAYESIAN MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR AUDIO SOURCE SEPARATION AND LOCALIZATION
    Itakura, Kousuke
    Bando, Yoshiaki
    Nakamura, Eita
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 551 - 555
  • [8] Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation
    Ozerov, Alexey
    Fevotte, Cedric
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 550 - 563
  • [9] Sequential Initialization of Multichannel Nonnegative Matrix Factorization for Sound Source Separation
    Uramoto, Takanobu
    Tachioka, Yuuki
    Narita, Tomohiro
    Miura, Iori
    Uenohara, Shingo
    Furuya, Ken'ichi
    [J]. 2017 IEEE 6TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE), 2017,
  • [10] Underdetermined blind source separation using normalized spatial covariance matrix and multichannel nonnegative matrix factorization
    Oh, Son-hook
    Kim, Jung-Han
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2020, 39 (02): : 120 - 130