Music emotion recognition using deep convolutional neural networks

被引:0
|
作者
Li, Ting [1 ]
机构
[1] Zibo Vocat Inst, Coll Innovat & Entrepreneurship, Zibo, Shandong, Peoples R China
关键词
Deep convolutional neural network; music emotion recognition; audio feature extraction; long short term memory; self-attention network;
D O I
10.3233/JCM-247551
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Traditional music emotion recognition (MER) faces problems such as lack of contextual information, inaccurate recognition of music emotions, and difficulty in handling nonlinear relationships. This article first used long short-term memory (LSTM) networks to capture global information and contextual relationships of music. Subsequently, the DCNN was chosen to process sequence data and capture global dependencies to improve the accuracy of MER. Finally, a MER model was constructed based on DCNN to recognize and classify music emotions. This article obtained the impact of different parameter values on model training iterations by adjusting hyperparameters related to training. The optimal values for learning rate mu, momentum coefficient alpha, weight attenuation coefficient gamma, and Dropout coefficient were 0.01, 0.7, 0.0003, and 0.5, respectively. The DCNN used in this article was iteratively trained with recurrent neural networks, convolutional recurrent neural networks, and transform domain neural networks for audio spectrograms, and the results were compared. The experimental findings indicated that the spectral recognition accuracy of DCNN was stable at 95.68%, far higher than the other three different networks. The results showed that the DCNN method used in this article could more accurately distinguish different negative emotions and positive emotions.
引用
收藏
页码:3063 / 3078
页数:16
相关论文
共 50 条
  • [1] Music emotion recognition using convolutional long short term memory deep neural networks
    Hizlisoy, Serhat
    Yildirim, Serdar
    Tufekci, Zekeriya
    [J]. ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2021, 24 (03): : 760 - 767
  • [2] Music instrument recognition using deep convolutional neural networks
    Solanki A.
    Pandey S.
    [J]. International Journal of Information Technology, 2022, 14 (3) : 1659 - 1668
  • [3] Speech emotion recognition with deep convolutional neural networks
    Issa, Dias
    Demirci, M. Fatih
    Yazici, Adnan
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 59
  • [4] Recognition of emotion in music based on deep convolutional neural network
    Sarkar, Rajib
    Choudhury, Sombuddha
    Dutta, Saikat
    Roy, Aneek
    Saha, Sanjoy Kumar
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (1-2) : 765 - 783
  • [5] Recognition of emotion in music based on deep convolutional neural network
    Rajib Sarkar
    Sombuddha Choudhury
    Saikat Dutta
    Aneek Roy
    Sanjoy Kumar Saha
    [J]. Multimedia Tools and Applications, 2020, 79 : 765 - 783
  • [6] Facial emotion recognition based music system using convolutional neural networks
    Sana, S. K.
    Sruthi, G.
    Suresh, D.
    Rajesh, G.
    Reddy, G. V. Subba
    [J]. MATERIALS TODAY-PROCEEDINGS, 2022, 62 : 4699 - 4706
  • [7] Speech Emotion Recognition using Convolution Neural Networks and Deep Stride Convolutional Neural Networks
    Wani, Taiba Majid
    Gunawan, Teddy Surya
    Qadri, Syed Asif Ahmad
    Mansor, Hasmah
    Kartiwi, Mira
    Ismail, Nanang
    [J]. PROCEEDING OF 2020 6TH INTERNATIONAL CONFERENCE ON WIRELESS AND TELEMATICS (ICWT), 2020,
  • [8] FSER: Deep Convolutional Neural Networks for Speech Emotion Recognition
    Dossou, Bonaventure F. P.
    Gbenou, Yeno K. S.
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3526 - 3531
  • [9] Facial Emotion Recognition using Deep Convolutional Networks
    Mohammadpour, Mostafa
    Khaliliardali, Hossein
    Hashemi, Seyyed Mohammad R.
    AlyanNezhadi, Mohammad M.
    [J]. 2017 IEEE 4TH INTERNATIONAL CONFERENCE ON KNOWLEDGE-BASED ENGINEERING AND INNOVATION (KBEI), 2017, : 17 - 21
  • [10] Facial Emotion Recognition using Convolutional Neural Networks
    Rzayeva, Zeynab
    Alasgarov, Emin
    [J]. 2019 IEEE 13TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT 2019), 2019, : 91 - 95