Singing voice detection for karaoke application

被引:2
|
作者
Shenoy, A [1 ]
Wu, YS [1 ]
Wang, Y [1 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore 117543, Singapore
关键词
karaoke; singing voice; vocal segmentation; tonic; key; inverse comb filtering; rhythm; lyrics;
D O I
10.1117/12.631645
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
We present a framework to detect the regions of singing voice in musical audio signals. This work is oriented towards the development of a robust transcriber of lyrics for karaoke applications. The technique leverages on a combination of low-level audio features and higher level musical knowledge of rhythm and tonality. Musical knowledge of the key is used to create a song-specific filterbank to attenuate the presence of the pitched musical instruments. This is followed by subband processing of the audio to detect the musical octaves in which the vocals are present. Text processing is employed to approximate the duration of the sung passages using freely available lyrics. This is used to obtain a dynamic threshold for vocal/ non-vocal segmentation. This pairing of audio and text processing helps create a more accurate system. Experimental evaluation on a small database of popular songs shows the validity of the proposed approach. Holistic and per-component evaluation of the system is conducted and various improvements are discussed.
引用
收藏
页码:752 / 762
页数:11
相关论文
共 50 条
  • [21] Singing Voice Detection Based on Convolutional Neural Networks
    Huang, Hong-Ming
    Chen, Woei-Kae
    Liu, Chien-Hung
    You, Shingchern D.
    [J]. 2018 7TH IEEE INTERNATIONAL SYMPOSIUM ON NEXT-GENERATION ELECTRONICS (ISNE), 2018, : 223 - 226
  • [22] Singing voice detection across different music genres
    Scholz, Florian
    Vatolkin, Igor
    Rudolph, Guenter
    [J]. 2017 AES INTERNATIONAL CONFERENCE ON SEMANTIC AUDIO, 2017,
  • [23] The voice and singing
    Osborne, Conrad L.
    [J]. OPERA NEWS, 2006, 71 (04): : 78 - 78
  • [24] SINGING VOICE DETECTION WITH DEEP RECURRENT NEURAL NETWORKS
    Leglaive, Simon
    Hennequin, Romain
    Badeau, Roland
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 121 - 125
  • [25] Unsupervised Singing Voice Detection Using Dictionary Learning
    Pikrakis, Aggelos
    Kopsinis, Yannis
    Kroher, Nadine
    Diaz-Banez, Jose-Miguel
    [J]. 2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1212 - 1216
  • [26] Exploiting Music Source Separation For Singing Voice Detection
    Bonzi, Francesco
    Mancusi, Michele
    Deo, Simone Del
    Melucci, Pierfrancesco
    Tavella, Maria Stella
    Parisi, Loreto
    Rodola, Emanuele
    [J]. IEEE International Workshop on Machine Learning for Signal Processing, MLSP, 2023, 2023-September
  • [27] The singing voice
    Garcia-Lopez, Isabel
    Gavilan Bouzas, Javier
    [J]. ACTA OTORRINOLARINGOLOGICA ESPANOLA, 2010, 61 (06): : 441 - 451
  • [28] Validation and Adaptation of the Singing Voice Handicap Index for Egyptian Singing Voice
    Abou-Elsaad, Tamer
    Baz, Hemmat
    Afsah, Omayma
    Abo-Elsoud, Hend
    [J]. JOURNAL OF VOICE, 2017, 31 (01) : 130.e1 - 130.e6
  • [29] Karaoke around the world: Global technology, local singing.
    Allison, A
    [J]. JOURNAL OF JAPANESE STUDIES, 2000, 26 (01): : 169 - 173
  • [30] Karaoke around the world: Global technology, local singing.
    Pershey, EJ
    [J]. TECHNOLOGY AND CULTURE, 1999, 40 (03) : 711 - 713