Singing voice detection for karaoke application

被引：2

作者：

Shenoy, A ^{[1
]}

Wu, YS ^{[1
]}

Wang, Y ^{[1
]}

机构：

[1] Natl Univ Singapore, Sch Comp, Singapore 117543, Singapore

来源：

Visual Communications and Image Processing 2005, Pts 1-4 | 2005年 / 5960卷

关键词：

karaoke; singing voice; vocal segmentation; tonic; key; inverse comb filtering; rhythm; lyrics;

D O I：

10.1117/12.631645

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

We present a framework to detect the regions of singing voice in musical audio signals. This work is oriented towards the development of a robust transcriber of lyrics for karaoke applications. The technique leverages on a combination of low-level audio features and higher level musical knowledge of rhythm and tonality. Musical knowledge of the key is used to create a song-specific filterbank to attenuate the presence of the pitched musical instruments. This is followed by subband processing of the audio to detect the musical octaves in which the vocals are present. Text processing is employed to approximate the duration of the sung passages using freely available lyrics. This is used to obtain a dynamic threshold for vocal/ non-vocal segmentation. This pairing of audio and text processing helps create a more accurate system. Experimental evaluation on a small database of popular songs shows the validity of the proposed approach. Holistic and per-component evaluation of the system is conducted and various improvements are discussed.

引用

页码：752 / 762

页数：11

共 50 条

[21] Singing Voice Detection Based on Convolutional Neural Networks
Huang, Hong-Ming
Chen, Woei-Kae
Liu, Chien-Hung
You, Shingchern D.
[J]. 2018 7TH IEEE INTERNATIONAL SYMPOSIUM ON NEXT-GENERATION ELECTRONICS (ISNE), 2018, : 223 - 226
[22] Singing voice detection across different music genres
Scholz, Florian
Vatolkin, Igor
Rudolph, Guenter
[J]. 2017 AES INTERNATIONAL CONFERENCE ON SEMANTIC AUDIO, 2017,
[23] The voice and singing
Osborne, Conrad L.
[J]. OPERA NEWS, 2006, 71 (04): : 78 - 78
[24] SINGING VOICE DETECTION WITH DEEP RECURRENT NEURAL NETWORKS
Leglaive, Simon
Hennequin, Romain
Badeau, Roland
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 121 - 125
[25] Unsupervised Singing Voice Detection Using Dictionary Learning
Pikrakis, Aggelos
Kopsinis, Yannis
Kroher, Nadine
Diaz-Banez, Jose-Miguel
[J]. 2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1212 - 1216
[26] Exploiting Music Source Separation For Singing Voice Detection
Bonzi, Francesco
Mancusi, Michele
Deo, Simone Del
Melucci, Pierfrancesco
Tavella, Maria Stella
Parisi, Loreto
Rodola, Emanuele
[J]. IEEE International Workshop on Machine Learning for Signal Processing, MLSP, 2023, 2023-September
[27] The singing voice
Garcia-Lopez, Isabel
Gavilan Bouzas, Javier
[J]. ACTA OTORRINOLARINGOLOGICA ESPANOLA, 2010, 61 (06): : 441 - 451
[28] Validation and Adaptation of the Singing Voice Handicap Index for Egyptian Singing Voice
Abou-Elsaad, Tamer
Baz, Hemmat
Afsah, Omayma
Abo-Elsoud, Hend
[J]. JOURNAL OF VOICE, 2017, 31 (01) : 130.e1 - 130.e6
[29] Karaoke around the world: Global technology, local singing.
Allison, A
[J]. JOURNAL OF JAPANESE STUDIES, 2000, 26 (01): : 169 - 173
[30] Karaoke around the world: Global technology, local singing.
Pershey, EJ
[J]. TECHNOLOGY AND CULTURE, 1999, 40 (03) : 711 - 713

← 1 2 3 4 5 →