A hybrid deep learning approach for classification of music genres using wavelet and spectrogram analysis

被引:0
|
作者
Kalyan Kumar Jena
Sourav Kumar Bhoi
Sonalisha Mohapatra
Sambit Bakshi
机构
[1] Parala Maharaja Engineering College (Government),Department of Computer Science and Engineering
[2] National Institute of Technology (NIT),Department of Computer Science and Engineering
关键词
Music genre classification; Deep learning; Transfer learning; Multimodal;
D O I
暂无
中图分类号
学科分类号
摘要
Manual classification of millions of songs of the same or different genres is a challenging task for human beings. Therefore, there should be a machine intelligent model that can classify the genres of the songs very accurately. In this paper, a deep learning-based hybrid model is proposed for the analysis and classification of different music genre files. The proposed hybrid model mainly uses a combination of multimodal and transfer learning-based models for classification. This model is analyzed using GTZAN and Ballroom datasets. The GTZAN dataset contains 1000 music files classified with 10 different kinds of music genres such as Metal, Classical, Rock, Reggae, Pop, Disco, Blues, Country, Hip-Hop and Jazz, and the duration of each music file is 30 s. The Ballroom dataset contains 698 music files classified into 8 different kinds of music genres such as Tango, ChaChaCha, Rumba, Viennese waltz, Jlive, Waltz, Quickstep and Samba, and the duration of each music file is 30 s. The performance of the model is evaluated using the Python tool. The macro-average and weighted average are taken for computing the percentage of accuracy of each model. From the results, it is found that the proposed hybrid model is able to perform better as compared to other deep learning models such as the convolution neural network model, transfer learning-based model, multimodal model, machine learning models and other existing models in terms of training accuracy, validation accuracy, training loss, validation loss, precision, recall, F1-score and support.
引用
收藏
页码:11223 / 11248
页数:25
相关论文
共 50 条
  • [1] A hybrid deep learning approach for classification of music genres using wavelet and spectrogram analysis
    Jena, Kalyan Kumar
    Bhoi, Sourav Kumar
    Mohapatra, Sonalisha
    Bakshi, Sambit
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (15): : 11223 - 11248
  • [2] A Deep Learning Approach for Mapping Music Genres
    Panwar, Sharaj
    Das, Arun
    Roopaei, Mehdi
    Rad, Paul
    [J]. 2017 12TH SYSTEM OF SYSTEMS ENGINEERING CONFERENCE (SOSE), 2017,
  • [3] The Classification of Music and Art Genres under the Visual Threshold of Deep Learning
    Zheng, Zhiqiang
    [J]. Computational Intelligence and Neuroscience, 2022, 2022
  • [4] Document Classification by Using Hybrid Deep Learning Approach
    Bui Thanh Hung
    [J]. CONTEXT-AWARE SYSTEMS AND APPLICATIONS, AND NATURE OF COMPUTATION AND COMMUNICATION, 2019, 298 : 167 - 177
  • [5] The Classification of Music and Art Genres under the Visual Threshold of Deep Learning
    Zheng, Zhiqiang
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [6] Speech and Music Classification Using Hybrid Form of Spectrogram and Fourier Transformation
    Neammalai, Piyawat
    Phimoltares, Suphakant
    Lursinsap, Chidchanok
    [J]. 2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [7] Automated Classification of Electrocardiograms Using Wavelet Analysis and Deep Learning
    Demonbreun, Andrew
    Mirsky, Grace M.
    [J]. 2020 COMPUTING IN CARDIOLOGY, 2020,
  • [8] A Deep Learning Method Approach for Sleep Stage Classification with EEG Spectrogram
    Li, Chengfan
    Qi, Yueyu
    Ding, Xuehai
    Zhao, Junjuan
    Sang, Tian
    Lee, Matthew
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (10)
  • [9] Classification of Brain Tumor using Hybrid Deep Learning Approach
    Singh, Manu
    Shrimali, Vibhakar
    [J]. BRAIN-BROAD RESEARCH IN ARTIFICIAL INTELLIGENCE AND NEUROSCIENCE, 2022, 13 (02): : 308 - 327
  • [10] Diabetic Retinopathy Classification Using Hybrid Deep Learning Approach
    Menaouer B.
    Dermane Z.
    El Houda Kebir N.
    Matta N.
    [J]. SN Computer Science, 3 (5)