VOCAL ACTIVITY INFORMED SINGING VOICE SEPARATION WITH THE IKALA DATASET

被引：0

作者：

Chan, Tak-Shing ^{[1
]}

Yeh, Tzu-Chun ^{[2
]}

Fan, Zhe-Cheng ^{[2
]}

Chen, Hung-Wei ^{[3
]}

Sui, Li ^{[1
]}

Yang, Yi-Hsuan ^{[1
]}

Jang, Roger ^{[2
]}

机构：

[1] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei, Taiwan

[2] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan

[3] iKala Interact Media Inc, Taipei, Taiwan

来源：

2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) | 2015年

关键词：

Low-rank and sparse decomposition; singing voice separation; informed source separation; RECORDINGS; MUSIC; SOUND;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

A new algorithm is proposed for robust principal component analysis with predefined sparsity patterns. The algorithm is then applied to separate the singing voice from the instrumental accompaniment using vocal activity information. To evaluate its performance, we construct a new publicly available iKala dataset that features longer durations and higher quality than the existing MIR-IK dataset for singing voice separation. Part of it will be used in the MIREX Singing Voice Separation task. Experimental results on both the MIR-IK dataset and the new iKala dataset confirmed that the more informed the algorithm is, the better the separation results are.

引用

页码：718 / 722

页数：5

共 50 条

[1] Informed Group-Sparse Representation for Singing Voice Separation
Chan, Tak-Shing T.
Yang, Yi-Hsuan
IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (02) : 156 - 160
[2] VocEmb4SVS: Improving Singing Voice Separation with Vocal Embeddings
Li, Chenyi
Li, Yi
Du, Xuhao
Ju, Yaolong
Hu, Shichao
Wu, Zhiyong
PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 234 - 239
[3] ANALYSIS OF VOCAL REGISTERS IN SINGING VOICE
ARAUZ, JC
JACKSON, CA
NAIDICH, S
FOLIA PHONIATRICA, 1986, 38 (5-6): : 281 - 281
[4] Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation
Schulze-Forster, Kilian
Doire, Clement S. J.
Richard, Gael
Badeau, Roland
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 (29) : 2382 - 2395
[5] Vocal tract resonances in singing: The soprano voice
Joliveau, E
Smith, J
Wolfe, J
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 116 (04): : 2434 - 2439
[6] Multichannel Singing Voice Separation by Deep Neural Network Informed DOA Constrained CMNMF
Munoz-Montoro, Antonio J.
Politis, Archontis
Drossos, Konstantinos
Carabias-Orti, Julio J.
2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,
[7] Vocal tract resonances in singing: The soprano voice
Joliveau, Elodie
Smith, John
Wolfe, Joe
1600, Acoustical Society of America (116):
[8] Annotated-VocalSet: A Singing Voice Dataset
Faghih, Behnam
Timoney, Joseph
APPLIED SCIENCES-BASEL, 2022, 12 (18):
[9] On the Improvement of Singing Voice Separation for Monaural Recordings Using the MIR-1K Dataset
Hsu, Chao-Ling
Jang, Jyh-Shing Roger
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (02): : 310 - 319
[10] Hybrid binaural singing voice separation
Kasak, Peter
Jarina, Roman
Jakubec, Maros
Ticha, Dasa
2023 33RD INTERNATIONAL CONFERENCE RADIOELEKTRONIKA, RADIOELEKTRONIKA, 2023,

← 1 2 3 4 5 →