Music detection from broadcast contents using convolutional neural networks with a Mel-scale kernel

被引:20
|
作者
Jang, Byeong-Yong [1 ]
Heo, Woon-Haeng [1 ]
Kim, Jung-Hyun [2 ]
Kwon, Oh-Wook [1 ]
机构
[1] Chungbuk Natl Univ, Sch Elect Engn, Cheongju, South Korea
[2] ETRI, Daejeon, South Korea
关键词
Music detection; Music segmentation; Convolutional neural networks; Mel-scale filter bank;
D O I
10.1186/s13636-019-0155-y
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose a new method for music detection from broadcasting contents using the convolutional neural networks with a Mel-scale kernel. In this detection task, music segments should be annotated from the broadcast data, where music, speech, and noise are mixed. The convolutional neural network is composed of a convolutional layer with kernel that is trained to extract robust features. The Mel-scale changes the kernel size, and the backpropagation algorithm trains the kernel shape. We used 52h of mixed broadcast data (25h of music) to train the convolutional network and 24h of collected broadcast data (ratio of music of 50-76%) for testing. The test data consisted of various genres (drama, documentary, news, kids, reality, and so on) that are broadcast in British English, Spanish, and Korean languages. The proposed method consistently showed better performance in all the three languages than the baseline system, and the F-score ranged from 86.5% for British data to 95.9% for Korean drama data. Our music detection system takes about 28s to process a 1-min signal using only one CPU with 4 cores.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Music detection from broadcast contents using convolutional neural networks with a Mel-scale kernel
    Byeong-Yong Jang
    Woon-Haeng Heo
    Jung-Hyun Kim
    Oh-Wook Kwon
    EURASIP Journal on Audio, Speech, and Music Processing, 2019
  • [2] Boundary detection in music structure analysis using convolutional neural networks
    Ullrich, Karen
    Schlüter, Jan
    Grill, Thomas
    Proceedings of the 15th International Society for Music Information Retrieval Conference, ISMIR 2014, 2014, : 417 - 422
  • [3] Affective Burst Detection from Speech using Kernel-fusion Dilated Convolutional Neural Networks
    Kopru, Berkay
    Erzin, Engin
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 105 - 109
  • [4] Driver Distraction Detection using MEL Cepstrum Representation of Galvanic Skin Responses and Convolutional Neural Networks
    Dehzangi, Omid
    Taherisadr, Mojtaba
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1481 - 1486
  • [5] Secondary Learning and Kernel Initialization on Auto-tagging of Music Events Using Convolutional Neural Networks
    Wu, Chi-Sheng
    Pan, Lei
    Soo, Von-Wun
    PROCEEDINGS OF THE 2017 IEEE INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND ENGINEERING (IEEE-ICICE 2017), 2017, : 412 - 415
  • [6] MUSIC GENRE CLASSIFICATION USING CONVOLUTIONAL NEURAL NETWORKS
    Subhani, G. M.
    Shravya, Perala
    Kumar, Gorighe Akhil
    Hrithika, Chitumalla
    Shrinivas, Chimalpade Ajay
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (05) : 1519 - 1526
  • [7] Detection of abnormal phonocardiograms through the Mel-frequency ceptrum and convolutional neural networks
    Duggento, Andrea
    Conti, Allegra
    Guerrisi, Maria
    Toschi, Nicola
    2020 11TH CONFERENCE OF THE EUROPEAN STUDY GROUP ON CARDIOVASCULAR OSCILLATIONS (ESGCO): COMPUTATION AND MODELLING IN PHYSIOLOGY NEW CHALLENGES AND OPPORTUNITIES, 2020,
  • [8] Region Prediction from Hungarian Folk Music Using Convolutional Neural Networks
    Kiss, Anna
    Sulyok, Csaba
    Bodo, Zalan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: TEXT AND TIME SERIES, PT IV, 2019, 11730 : 581 - 594
  • [9] Detection of Arrhythmia Using Convolutional Neural Networks
    Greeshma, Burla
    Sireesha, Moturi
    Rao, S. N. Thirumala
    PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON SUSTAINABLE EXPERT SYSTEMS (ICSES 2021), 2022, 351 : 21 - 30
  • [10] Supernovae Detection by Using Convolutional Neural Networks
    Cabrera-Vives, Guillermo
    Reyes, Ignacio
    Forster, Francisco
    Estevez, Pablo A.
    Maureira, Juan-Carlos
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 251 - 258