An evaluation of deep neural network models for music classification using spectrograms

被引:18
|
作者
Li, Jingxian [1 ,2 ]
Han, Lixin [1 ]
Li, Xiaoshuang [1 ]
Zhu, Jun [1 ]
Yuan, Baohua [3 ]
Gou, Zhinan [4 ]
机构
[1] Hohai Univ, Sch Comp & Informat, Nanjing, Peoples R China
[2] Jinling Inst Technol, Sch Software Engn, Nanjing, Peoples R China
[3] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
[4] Hebei Univ Econ & Business, Coll Informat Technol, Shijiazhuang, Hebei, Peoples R China
关键词
DNN models; Deep learning; Transfer learning; Music classification; Spectrograms; GENRE CLASSIFICATION; ARCHITECTURES;
D O I
10.1007/s11042-020-10465-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Neural Network (DNN) models have lately received considerable attention for that the network structure can extract deep features to improve classification accuracy and achieve excellent results in the field of image. However, due to the different content forms of music and images, transferring deep learning to music classification is still a problem. To address this issue, in the paper, we transfer the state-of-the-art DNN models to music classification and evaluate the performance of the models using spectrograms. Firstly, we convert the music audio files into spectrograms by modal transformation, and then classify music through deep learning. In order to alleviate the problem of overfitting during training, we propose a balanced trusted loss function and build the balanced trusted model ResNet50_trust. Finally, we compare the performance of different DNN models in music classification. Furthermore, this work adds music sentiment analysis based on the newly constructed music emotion dataset. Extensive experimental evaluations on three music datasets show that our proposed model Resnet50_trust consistently outperforms other DNN models.
引用
收藏
页码:4621 / 4647
页数:27
相关论文
共 50 条
  • [41] Robust Emotion Classification using Neural Network Models
    Salari, Soorena
    Ansarian, Amin
    Atrianfar, Hajar
    [J]. 2018 6TH IRANIAN JOINT CONGRESS ON FUZZY AND INTELLIGENT SYSTEMS (CFIS), 2018, : 190 - 194
  • [42] Music Track Recommendation Using Deep-CNN and Mel Spectrograms
    Yin, Tingrong
    [J]. MOBILE NETWORKS & APPLICATIONS, 2023, 28 (06): : 2130 - 2137
  • [43] A Deep Neural Network for Modeling Music
    Zhang, Pengjing
    Zheng, Xiaoqing
    Zhang, Wenqiang
    Li, Siyan
    Qian, Sheng
    He, Wenqi
    Zhang, Shangtong
    Wang, Ziyuan
    [J]. ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 379 - 386
  • [44] Evaluation of transfer learning in deep convolutional neural network models for cardiac short axis slice classification
    Ho, Namgyu
    Kim, Yoon-Chul
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [45] Evaluation of transfer learning in deep convolutional neural network models for cardiac short axis slice classification
    Namgyu Ho
    Yoon-Chul Kim
    [J]. Scientific Reports, 11
  • [46] Handwritten Music Symbol Classification Using Deep Convolutional Neural Networks
    Lee, Sangkuk
    Son, Sung Joon
    Oh, Jiyong
    Kwak, Nojun
    [J]. 2016 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND SECURITY (ICISS), 2014, : 99 - 103
  • [47] Deep Neural Network Models for Paraphrased Text Classification in the Arabic Language
    Mahmoud, Adnen
    Zrigui, Mounir
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2019), 2019, 11608 : 3 - 16
  • [48] Music Emotion Classification with Deep Neural Nets
    Pandeya, Yagya Raj
    Bhattarai, Bhuwan
    Lee, Joonwhoan
    [J]. PROCEEDINGS OF 2021 6TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES (ICMLT 2021), 2021, : 132 - 137
  • [49] Metrics Evaluation of Bell Pepper Disease Classification Using Deep Convolutional Neural Network (DCNN)
    [J]. Thenmozhi, M. (thenmozm@srmist.edu.in), 1600, Springer Science and Business Media Deutschland GmbH (1095):
  • [50] Music Emotion Classification Method Using Improved Deep Belief Network
    Tong, Guiying
    [J]. MOBILE INFORMATION SYSTEMS, 2022, 2022