VOCAL ACTIVITY INFORMED SINGING VOICE SEPARATION WITH THE IKALA DATASET

被引:0
|
作者
Chan, Tak-Shing [1 ]
Yeh, Tzu-Chun [2 ]
Fan, Zhe-Cheng [2 ]
Chen, Hung-Wei [3 ]
Sui, Li [1 ]
Yang, Yi-Hsuan [1 ]
Jang, Roger [2 ]
机构
[1] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei, Taiwan
[2] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
[3] iKala Interact Media Inc, Taipei, Taiwan
关键词
Low-rank and sparse decomposition; singing voice separation; informed source separation; RECORDINGS; MUSIC; SOUND;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A new algorithm is proposed for robust principal component analysis with predefined sparsity patterns. The algorithm is then applied to separate the singing voice from the instrumental accompaniment using vocal activity information. To evaluate its performance, we construct a new publicly available iKala dataset that features longer durations and higher quality than the existing MIR-IK dataset for singing voice separation. Part of it will be used in the MIREX Singing Voice Separation task. Experimental results on both the MIR-IK dataset and the new iKala dataset confirmed that the more informed the algorithm is, the better the separation results are.
引用
收藏
页码:718 / 722
页数:5
相关论文
共 50 条
  • [1] Informed Group-Sparse Representation for Singing Voice Separation
    Chan, Tak-Shing T.
    Yang, Yi-Hsuan
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (02) : 156 - 160
  • [2] VocEmb4SVS: Improving Singing Voice Separation with Vocal Embeddings
    Li, Chenyi
    Li, Yi
    Du, Xuhao
    Ju, Yaolong
    Hu, Shichao
    Wu, Zhiyong
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 234 - 239
  • [3] ANALYSIS OF VOCAL REGISTERS IN SINGING VOICE
    ARAUZ, JC
    JACKSON, CA
    NAIDICH, S
    FOLIA PHONIATRICA, 1986, 38 (5-6): : 281 - 281
  • [4] Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation
    Schulze-Forster, Kilian
    Doire, Clement S. J.
    Richard, Gael
    Badeau, Roland
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 (29) : 2382 - 2395
  • [5] Vocal tract resonances in singing: The soprano voice
    Joliveau, E
    Smith, J
    Wolfe, J
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 116 (04): : 2434 - 2439
  • [6] Multichannel Singing Voice Separation by Deep Neural Network Informed DOA Constrained CMNMF
    Munoz-Montoro, Antonio J.
    Politis, Archontis
    Drossos, Konstantinos
    Carabias-Orti, Julio J.
    2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,
  • [7] Vocal tract resonances in singing: The soprano voice
    Joliveau, Elodie
    Smith, John
    Wolfe, Joe
    1600, Acoustical Society of America (116):
  • [8] Annotated-VocalSet: A Singing Voice Dataset
    Faghih, Behnam
    Timoney, Joseph
    APPLIED SCIENCES-BASEL, 2022, 12 (18):
  • [9] On the Improvement of Singing Voice Separation for Monaural Recordings Using the MIR-1K Dataset
    Hsu, Chao-Ling
    Jang, Jyh-Shing Roger
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (02): : 310 - 319
  • [10] Hybrid binaural singing voice separation
    Kasak, Peter
    Jarina, Roman
    Jakubec, Maros
    Ticha, Dasa
    2023 33RD INTERNATIONAL CONFERENCE RADIOELEKTRONIKA, RADIOELEKTRONIKA, 2023,