HARMONIC-PERCUSSIVE SOURCE SEPARATION WITH DEEP NEURAL NETWORKS AND PHASE RECOVERY

被引:0
|
作者
Drossos, Konstantinos [1 ]
Magron, Paul [1 ]
Mimilakis, Stylianos Ioannis [2 ]
Virtanen, Tuomas [1 ]
机构
[1] Tampere Univ Technol, Lab Signal Proc, Tampere, Finland
[2] Fraunhofer IDMT, Ilmenau, Germany
基金
欧盟地平线“2020”; 芬兰科学院; 欧洲研究理事会;
关键词
harmonic/percussive source separation; deep neural networks; MaD TwinNet; phase recovery; sinusoidal model;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Harmonic/percussive source separation (HPSS) consists in separating the pitched instruments from the percussive parts in a music mixture. In this paper, we propose to apply the recently introduced Masker-Denoiser with twin networks (MaD TwinNet) system to this task. MaD TwinNet is a deep learning architecture that has reached state-of-the-art results in monaural singing voice separation. Herein, we propose to apply it to HPSS by using it to estimate the magnitude spectrogram of the percussive source. Then, we retrieve the complex-valued short-time Fourier transform of the sources by means of a phase recovery algorithm, which minimizes the reconstruction error and enforces the phase of the harmonic part to follow a sinusoidal phase model. Experiments conducted on realistic music mixtures show that this novel separation system outperforms the previous state-of-the art kernel additive model approach.
引用
收藏
页码:421 / 425
页数:5
相关论文
共 50 条
  • [1] UNIFYING LOCAL AND GLOBAL METHODS FOR HARMONIC-PERCUSSIVE SOURCE SEPARATION
    Dittmar, Christian
    Lopez-Serrano, Patricio
    Mueller, Meinard
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 176 - 180
  • [2] Adversarial Unsupervised Domain Adaptation for Harmonic-Percussive Source Separation
    Lordelo, C.
    Benetos, E.
    Dixon, S.
    Ahlback, S.
    Ohlsson, P.
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 81 - 85
  • [3] Harmonic-Percussive Source Separation of Polyphonic Music by Suppressing Impulsive Noise Events
    Reddy, Gurunath M.
    Rao, K. Sreenivasa
    Das, Partha Pratim
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 831 - 835
  • [4] Automatic Wheeze Segmentation Using Harmonic-Percussive Source Separation and Empirical Mode Decomposition
    Rocha, Bruno Machado
    Pessoa, Diogo
    Marques, Alda
    de Carvalho, Paulo
    Paiva, Rui Pedro
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (04) : 1926 - 1934
  • [5] INVESTIGATING KERNEL SHAPES AND SKIP CONNECTIONS FOR DEEP LEARNING-BASED HARMONIC-PERCUSSIVE SEPARATION
    Lordelo, Carlos
    Benetos, Emmanouil
    Dixon, Simon
    Ahlback, Sven
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 40 - 44
  • [6] Music Genre Recognition Using Spectrograms with Harmonic-Percussive Sound Separation
    Aguiar, Rafael de Lima
    da Costa, Yandre Maldonado e Gomes
    Nanni, Loris
    PROCEEDINGS OF THE 2016 35TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2016,
  • [7] Phase-recovery algorithm for harmonic/percussive source separation based on observed phase information and analytic computation
    Kobayashi, Kenji
    Masuyama, Yoshiki
    Yatabe, Kohei
    Oikawa, Yasuhiro
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2021, 42 (05) : 261 - 269
  • [8] Improving Time-Scale Modification of Music Signals Using Harmonic-Percussive Separation
    Driedger, Jonathan
    Mueller, Meinard
    Ewert, Sebastian
    IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (01) : 105 - 109
  • [9] Exploiting Continuity/Discontinuity of Basis Vectors in Spectrogram Decomposition for Harmonic-Percussive Sound Separation
    Park, Jeongsoo
    Shin, Jaeyoung
    Lee, Kyogu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (05) : 1061 - 1074
  • [10] Improving snore detection under limited dataset through harmonic/percussive source separation and convolutional neural networks
    Gonzalez-Martinez, F. D.
    Carabias-Orti, J. J.
    Canadas-Quesada, F. J.
    Ruiz-Reyes, N.
    Martinez-Munoz, D.
    Garcia-Galan, S.
    APPLIED ACOUSTICS, 2024, 216