Multi-label emotion recognition from Indian classical music using gradient descent SNN model

被引:6
|
作者
Tiple, Bhavana [1 ]
Patwardhan, Manasi [2 ]
机构
[1] Dr Vishwanath Karad MIT World Peace Univ, Sch SCET, Pune, Maharashtra, India
[2] TCS Innovat Labs, Pune, Maharashtra, India
关键词
Convolutional neural network; Spiking neural network; Gradient descent; Temporal; Spectral; Short Term Fourier Transform; SPIKING NEURAL-NETWORK; CLASSIFICATION;
D O I
10.1007/s11042-022-11975-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Music enthusiasts are growing exponentially and based on this, many songs are being introduced to the market and stored in signal music libraries. Due to this development emotion recognition model from music contents has received increasing attention in today's world. Of these technologies, a novel Music Emotion Recognition (MER) system is introduced to meet the ever-increasing demand for easy and efficient access to music information. Even though this system was well-developed it lacks in maintaining accuracy of the system and finds difficulty in predicting multi-label emotion type. To address these shortcomings, in this research article, a novel MER system is developed by inter-linking the pre-processing, feature extraction and classification steps. Initially, pre-processing step is employed to convert larger audio files into smaller audio frames. Afterwards, music related temporal, spectral and energy features are extracted for those pre-processed frames which are subjected to the proposed gradient descent based Spiking Neural Network (SNN) classifier. While learning SNN, it is important to determine the optimal weight values for reducing the training error so that gradient descent optimization approach is adopted. To prove the effectiveness of proposed research, proposed model is compared with conventional classification algorithms. The proposed methodology was experimentally tested using various evaluation metrics and it achieves 94.55% accuracy. Hence the proposed methodology attains a good accuracy measure and outperforms well than other algorithms.
引用
收藏
页码:8853 / 8870
页数:18
相关论文
共 50 条
  • [21] Multi-modal, Multi-task and Multi-label for Music Genre Classification and Emotion Regression
    Pandeya, Yagya Raj
    You, Jie
    Bhattarai, Bhuwan
    Lee, Joonwhoan
    12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 1042 - 1045
  • [22] Multi-label, multi-task CNN approach for context-based emotion recognition
    Bendjoudi, Ilyes
    Vanderhaegen, Frederic
    Hamad, Denis
    Dornaika, Fadi
    INFORMATION FUSION, 2021, 76 : 422 - 428
  • [23] CARAT: Contrastive Feature Reconstruction and Aggregation for Multi-Modal Multi-Label Emotion Recognition
    Peng, Cheng
    Chen, Ke
    Shou, Lidan
    Chen, Gang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 13, 2024, : 14581 - 14589
  • [24] Multi-Label Emotion Tagging for Online News by Supervised Topic Model
    Zhang, Ying
    Su, Lili
    Yang, Zhifan
    Zhao, Xue
    Yuan, Xiaojie
    WEB TECHNOLOGIES AND APPLICATIONS (APWEB 2015), 2015, 9313 : 67 - 79
  • [25] A multi-genre model for music emotion recognition using linear regressors
    Griffiths, Darryl
    Cunningham, Stuart
    Weinel, Jonathan
    Picking, Richard
    JOURNAL OF NEW MUSIC RESEARCH, 2021, 50 (04) : 355 - 372
  • [26] An optimized multi-label TSK fuzzy system for emotion recognition of multimodal physiological signals
    Li, Yixuan
    Fu, Zhongzheng
    He, Xinrun
    Huang, Jian
    2022 IEEE INTERNATIONAL CONFERENCE ON CYBORG AND BIONIC SYSTEMS, CBS, 2022, : 362 - 367
  • [27] Method of Multi-Label Visual Emotion Recognition Fusing Fore-Background Features
    Feng, Yuehua
    Wei, Ruoyan
    APPLIED SCIENCES-BASEL, 2024, 14 (18):
  • [28] Robust partial face recognition using multi-label attributes
    Sang, Gaoli
    Zeng, Dan
    Yan, Chao
    Veldhuis, Raymond
    Spreeuwers, Luuk
    INTELLIGENT DATA ANALYSIS, 2024, 28 (01) : 377 - 392
  • [29] Robots with Language: Multi-Label Visual Recognition Using NLP
    Yang, Yezhou
    Teo, Ching L.
    Fermueller, Cornelia
    Aloimonos, Yiannis
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 4256 - 4262
  • [30] Emotion Recognition From Singing Voices Using Contemporary Commercial Music and Classical Styles
    Hakanpaa, Tua
    Waaramaa, Teija
    Laukkanen, Anne-Maria
    JOURNAL OF VOICE, 2019, 33 (04) : 501 - 509