An evaluation of deep neural network models for music classification using spectrograms

被引:18
|
作者
Li, Jingxian [1 ,2 ]
Han, Lixin [1 ]
Li, Xiaoshuang [1 ]
Zhu, Jun [1 ]
Yuan, Baohua [3 ]
Gou, Zhinan [4 ]
机构
[1] Hohai Univ, Sch Comp & Informat, Nanjing, Peoples R China
[2] Jinling Inst Technol, Sch Software Engn, Nanjing, Peoples R China
[3] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
[4] Hebei Univ Econ & Business, Coll Informat Technol, Shijiazhuang, Hebei, Peoples R China
关键词
DNN models; Deep learning; Transfer learning; Music classification; Spectrograms; GENRE CLASSIFICATION; ARCHITECTURES;
D O I
10.1007/s11042-020-10465-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Neural Network (DNN) models have lately received considerable attention for that the network structure can extract deep features to improve classification accuracy and achieve excellent results in the field of image. However, due to the different content forms of music and images, transferring deep learning to music classification is still a problem. To address this issue, in the paper, we transfer the state-of-the-art DNN models to music classification and evaluate the performance of the models using spectrograms. Firstly, we convert the music audio files into spectrograms by modal transformation, and then classify music through deep learning. In order to alleviate the problem of overfitting during training, we propose a balanced trusted loss function and build the balanced trusted model ResNet50_trust. Finally, we compare the performance of different DNN models in music classification. Furthermore, this work adds music sentiment analysis based on the newly constructed music emotion dataset. Extensive experimental evaluations on three music datasets show that our proposed model Resnet50_trust consistently outperforms other DNN models.
引用
收藏
页码:4621 / 4647
页数:27
相关论文
共 50 条
  • [1] An evaluation of deep neural network models for music classification using spectrograms
    Jingxian Li
    Lixin Han
    Xiaoshuang Li
    Jun Zhu
    Baohua Yuan
    Zhinan Gou
    [J]. Multimedia Tools and Applications, 2022, 81 : 4621 - 4647
  • [2] An evaluation of Convolutional Neural Networks for music classification using spectrograms
    Costa, Yandre M. G.
    Oliveira, Luiz S.
    Silla, Carlos N., Jr.
    [J]. APPLIED SOFT COMPUTING, 2017, 52 : 28 - 38
  • [3] Seismic Event Classification Using Spectrograms and Deep Neural Nets
    Salazar, Aaron
    Arroyo, Rodrigo
    Perez, Noel
    Benitez, Diego S.
    [J]. APPLICATIONS OF COMPUTATIONAL INTELLIGENCE, COLCACI 2020, 2021, 1346 : 16 - 30
  • [4] Singer Gender Classification using Feature-based and Spectrograms with Deep Convolutional Neural Network
    Jitendra, Mukkamala S. N., V
    Radhika, Y.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (02) : 135 - 144
  • [5] Frequency line detection in spectrograms using a deep neural network with attention
    Jiang, DingLin
    Luo, Xinwei
    Shen, Qifan
    [J]. Journal of the Acoustical Society of America, 2024, 156 (05): : 3204 - 3216
  • [6] "Multilingual" Deep Neural Network For Music Genre Classification
    Dai, Jia
    Liu, Wenju
    Ni, Chongjia
    Dong, Like
    Yang, Hong
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2907 - 2911
  • [7] Performance Evaluation of Multi-class Sentiment Classification Using Deep Neural Network Models Optimised for Binary Classification
    Merwick, Fiachra
    Bi, Yaxin
    Nicholl, Peter
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2021, PT II, 2021, 12816 : 624 - 635
  • [8] Understanding human emotions through speech spectrograms using deep neural network
    Gupta, Vedika
    Juyal, Stuti
    Hu, Yu-Chen
    [J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (05): : 6944 - 6973
  • [9] Novel mathematical model for the classification of music and rhythmic genre using deep neural network
    Patil, Swati A.
    Pradeepini, G.
    Komati, Thirupathi Rao
    [J]. JOURNAL OF BIG DATA, 2023, 10 (01)
  • [10] Novel mathematical model for the classification of music and rhythmic genre using deep neural network
    Swati A. Patil
    G. Pradeepini
    Thirupathi Rao Komati
    [J]. Journal of Big Data, 10