VOCAL ACTIVITY INFORMED SINGING VOICE SEPARATION WITH THE IKALA DATASET

被引：0

作者：

Chan, Tak-Shing ^{[1
]}

Yeh, Tzu-Chun ^{[2
]}

Fan, Zhe-Cheng ^{[2
]}

Chen, Hung-Wei ^{[3
]}

Sui, Li ^{[1
]}

Yang, Yi-Hsuan ^{[1
]}

Jang, Roger ^{[2
]}

机构：

[1] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei, Taiwan

[2] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan

[3] iKala Interact Media Inc, Taipei, Taiwan

来源：

2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) | 2015年

关键词：

Low-rank and sparse decomposition; singing voice separation; informed source separation; RECORDINGS; MUSIC; SOUND;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

A new algorithm is proposed for robust principal component analysis with predefined sparsity patterns. The algorithm is then applied to separate the singing voice from the instrumental accompaniment using vocal activity information. To evaluate its performance, we construct a new publicly available iKala dataset that features longer durations and higher quality than the existing MIR-IK dataset for singing voice separation. Part of it will be used in the MIREX Singing Voice Separation task. Experimental results on both the MIR-IK dataset and the new iKala dataset confirmed that the more informed the algorithm is, the better the separation results are.

引用

页码：718 / 722

页数：5

共 50 条

[21] Mechanical construction of a human vocal system for singing voice production
Sawada, H
Hashimoto, S
ADVANCED ROBOTICS, 2000, 13 (07) : 647 - 661
[22] SINGING VOICE SEPARATION: A STUDY ON TRAINING DATA
Pretet, Laure
Hennequin, Romain
Royo-Letelier, Jimena
Vaglio, Andrea
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 506 - 510
[23] The Singing Voice Before and After Vocal Warm-up by Students of Chinese National Singing
Chen, Yu
Kong, Weifeng
Chi, Yujie
Chen, Yanting
Wei, Jianguo
Dang, Jianwu
2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
[24] Integration Strategies of American Voice Singing in Singing and Vocal Teaching Based on Multiscale Feature Fusion
Liu Y.
Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
[25] Vocal92: Audio Dataset With a Cappella Solo Singing and Speech
Deng, Zhuo
Zhou, Ruohua
IEEE ACCESS, 2023, 11 : 140958 - 140966
[26] Unsupervised Single-Channel Singing Voice Separation with Weighted Robust Principal Component Analysis Based on Gammatone Auditory Filterbank and Vocal Activity Detection
Li, Feng
Hu, Yujun
Wang, Lingling
SENSORS, 2023, 23 (06)
[27] Research on Singing Voice Detection Based on a Long-Term Recurrent Convolutional Network with Vocal Separation and Temporal Smoothing
Zhang, Xulong
Yu, Yi
Gao, Yongwei
Chen, Xi
Li, Wei
ELECTRONICS, 2020, 9 (09) : 1 - 23
[28] Validation of the Spanish version of the voice handicap index for vocal singing(SVHI)
Garcia-Lopez, Isabel
Nunez-Batalla, Faustino
Gavilan Bouzas, Javier
Gorriz-Gil, Carmen
ACTA OTORRINOLARINGOLOGICA ESPANOLA, 2010, 61 (06): : 469 - 469
[29] Validation of the Spanish version of the voice handicap index for vocal singing (SVHI)
Garcia-Lopez, Isabel
Nunez-Batalla, Faustino
Gavilan Bouzas, Javier
Gorriz-Gil, Carmen
ACTA OTORRINOLARINGOLOGICA ESPANOLA, 2010, 61 (04): : 247 - 254
[30] Singing Voice Separation and Vocal FO Estimation Based on Mutual Combination of Robust Principal Component Analysis and Subharmonic Summation
Ikemiya, Yukara
Itoyama, Katsutoshi
Yoshii, Kazuyoshi
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (11) : 2084 - 2095

← 1 2 3 4 5 →