VOCAL ACTIVITY INFORMED SINGING VOICE SEPARATION WITH THE IKALA DATASET

被引:0
|
作者
Chan, Tak-Shing [1 ]
Yeh, Tzu-Chun [2 ]
Fan, Zhe-Cheng [2 ]
Chen, Hung-Wei [3 ]
Sui, Li [1 ]
Yang, Yi-Hsuan [1 ]
Jang, Roger [2 ]
机构
[1] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei, Taiwan
[2] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
[3] iKala Interact Media Inc, Taipei, Taiwan
关键词
Low-rank and sparse decomposition; singing voice separation; informed source separation; RECORDINGS; MUSIC; SOUND;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A new algorithm is proposed for robust principal component analysis with predefined sparsity patterns. The algorithm is then applied to separate the singing voice from the instrumental accompaniment using vocal activity information. To evaluate its performance, we construct a new publicly available iKala dataset that features longer durations and higher quality than the existing MIR-IK dataset for singing voice separation. Part of it will be used in the MIREX Singing Voice Separation task. Experimental results on both the MIR-IK dataset and the new iKala dataset confirmed that the more informed the algorithm is, the better the separation results are.
引用
收藏
页码:718 / 722
页数:5
相关论文
共 50 条
  • [21] Mechanical construction of a human vocal system for singing voice production
    Sawada, H
    Hashimoto, S
    ADVANCED ROBOTICS, 2000, 13 (07) : 647 - 661
  • [22] SINGING VOICE SEPARATION: A STUDY ON TRAINING DATA
    Pretet, Laure
    Hennequin, Romain
    Royo-Letelier, Jimena
    Vaglio, Andrea
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 506 - 510
  • [23] The Singing Voice Before and After Vocal Warm-up by Students of Chinese National Singing
    Chen, Yu
    Kong, Weifeng
    Chi, Yujie
    Chen, Yanting
    Wei, Jianguo
    Dang, Jianwu
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [24] Integration Strategies of American Voice Singing in Singing and Vocal Teaching Based on Multiscale Feature Fusion
    Liu Y.
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [25] Vocal92: Audio Dataset With a Cappella Solo Singing and Speech
    Deng, Zhuo
    Zhou, Ruohua
    IEEE ACCESS, 2023, 11 : 140958 - 140966
  • [26] Unsupervised Single-Channel Singing Voice Separation with Weighted Robust Principal Component Analysis Based on Gammatone Auditory Filterbank and Vocal Activity Detection
    Li, Feng
    Hu, Yujun
    Wang, Lingling
    SENSORS, 2023, 23 (06)
  • [27] Research on Singing Voice Detection Based on a Long-Term Recurrent Convolutional Network with Vocal Separation and Temporal Smoothing
    Zhang, Xulong
    Yu, Yi
    Gao, Yongwei
    Chen, Xi
    Li, Wei
    ELECTRONICS, 2020, 9 (09) : 1 - 23
  • [28] Validation of the Spanish version of the voice handicap index for vocal singing(SVHI)
    Garcia-Lopez, Isabel
    Nunez-Batalla, Faustino
    Gavilan Bouzas, Javier
    Gorriz-Gil, Carmen
    ACTA OTORRINOLARINGOLOGICA ESPANOLA, 2010, 61 (06): : 469 - 469
  • [29] Validation of the Spanish version of the voice handicap index for vocal singing (SVHI)
    Garcia-Lopez, Isabel
    Nunez-Batalla, Faustino
    Gavilan Bouzas, Javier
    Gorriz-Gil, Carmen
    ACTA OTORRINOLARINGOLOGICA ESPANOLA, 2010, 61 (04): : 247 - 254
  • [30] Singing Voice Separation and Vocal FO Estimation Based on Mutual Combination of Robust Principal Component Analysis and Subharmonic Summation
    Ikemiya, Yukara
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (11) : 2084 - 2095