Segmentation of singing voice within music signals

被引:0
|
作者
Setubal, PJ [1 ]
Noceti, S [1 ]
Seara, R [1 ]
机构
[1] Univ Fed Santa Catarina, Dept Elect Engn, LINSE, Circuits & Signal Proc Lab, BR-88040900 Florianopolis, SC, Brazil
来源
关键词
dynamic harmonic content; segmentation of singing voice; spectral peak track; ripple-shaped tracks; small pitch variation;
D O I
10.1117/12.571280
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel approach to accomplish the automatic segmentation of singing voice within music signals, based on the difference between the dynamic harmonic content of singing voice and that of musical instrument signals. The obtained results are compared with those of another approach proposed in the literature, considering the same music database. For both techniques, an accuracy rate around 80% is obtained, even using a more rigorous performance measure for our approach only. As an advantage, the new procedure presents lower computational complexity. In addition, we discuss other results obtained by extending the tests over the whole database (upholding the same performance level) and by discriminating the error types (boundaries shifted in time, insertion and deletion of singing segments). The analysis of these errors suggests some alternative ways of reducing them, as for example, to adopt a confidence level based on a minimum harmonic content for the input signals. In this way, considering only signals with confidence level equal to one, the obtained performance is improved to almost 87%.
引用
收藏
页码:69 / 78
页数:10
相关论文
共 50 条
  • [32] Music Player Based on Emotion Recognition of voice signals
    Lukose, Sneha
    Upadhya, Savitha S.
    2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, INSTRUMENTATION AND CONTROL TECHNOLOGIES (ICICICT), 2017, : 1751 - 1754
  • [33] The effectiveness of individual/small-group singing activities on kindergartners' use of singing voice and developmental music aptitude
    Rutkowski, J
    JOURNAL OF RESEARCH IN MUSIC EDUCATION, 1996, 44 (04) : 353 - 368
  • [34] Singing voice phoneme segmentation by hierarchically inferring syllable and phoneme onset positions
    Gong, Rong
    Serra, Xavier
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 716 - 720
  • [35] Validation and Adaptation of the Singing Voice Handicap Index for Egyptian Singing Voice
    Abou-Elsaad, Tamer
    Baz, Hemmat
    Afsah, Omayma
    Abo-Elsoud, Hend
    JOURNAL OF VOICE, 2017, 31 (01) : 130.e1 - 130.e6
  • [36] Singing just to make music: Betty Smith in her own voice (Interview)
    Bryant, Annie
    Jamison, Phil
    Jenkins, Carl
    Pepper, Susan
    Weinstein, Leila
    Beaver, Patricia D.
    Smith, Betty
    APPALACHIAN JOURNAL, 2007, 34 (02) : 160 - 184
  • [37] Transfer Learning for Improving Singing-voice Detection in Polyphonic Instrumental Music
    Hou, Yuanbo
    Soong, Frank K.
    Luang, Jian
    Li, Shengchen
    INTERSPEECH 2020, 2020, : 1236 - 1240
  • [38] Defining the Therapeutic Singing Voice: An Analysis of Four Music Therapists' Clinical Work
    Bingham, Martina C.
    MUSIC THERAPY PERSPECTIVES, 2019, 37 (02) : 176 - 186
  • [39] Realtime feedback of singing voice information for assisting students learning music therapy
    Kawahara, Hideki
    Haneishi, Eri
    Hagiwara, Kaori
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON ORANGE TECHNOLOGIES (ICOT), 2017, : 99 - 102
  • [40] Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis
    Shi, Jiatong
    Guo, Shuai
    Qian, Tao
    Huo, Nan
    Hayashi, Tomoki
    Wu, Yuning
    Xu, Frank
    Chang, Xuankai
    Li, Huazhe
    Wu, Peter
    Watanabe, Shinji
    Jin, Qin
    INTERSPEECH 2022, 2022, : 4277 - 4281