A hybrid deep learning approach for classification of music genres using wavelet and spectrogram analysis

被引:0
|
作者
Kalyan Kumar Jena
Sourav Kumar Bhoi
Sonalisha Mohapatra
Sambit Bakshi
机构
[1] Parala Maharaja Engineering College (Government),Department of Computer Science and Engineering
[2] National Institute of Technology (NIT),Department of Computer Science and Engineering
关键词
Music genre classification; Deep learning; Transfer learning; Multimodal;
D O I
暂无
中图分类号
学科分类号
摘要
Manual classification of millions of songs of the same or different genres is a challenging task for human beings. Therefore, there should be a machine intelligent model that can classify the genres of the songs very accurately. In this paper, a deep learning-based hybrid model is proposed for the analysis and classification of different music genre files. The proposed hybrid model mainly uses a combination of multimodal and transfer learning-based models for classification. This model is analyzed using GTZAN and Ballroom datasets. The GTZAN dataset contains 1000 music files classified with 10 different kinds of music genres such as Metal, Classical, Rock, Reggae, Pop, Disco, Blues, Country, Hip-Hop and Jazz, and the duration of each music file is 30 s. The Ballroom dataset contains 698 music files classified into 8 different kinds of music genres such as Tango, ChaChaCha, Rumba, Viennese waltz, Jlive, Waltz, Quickstep and Samba, and the duration of each music file is 30 s. The performance of the model is evaluated using the Python tool. The macro-average and weighted average are taken for computing the percentage of accuracy of each model. From the results, it is found that the proposed hybrid model is able to perform better as compared to other deep learning models such as the convolution neural network model, transfer learning-based model, multimodal model, machine learning models and other existing models in terms of training accuracy, validation accuracy, training loss, validation loss, precision, recall, F1-score and support.
引用
收藏
页码:11223 / 11248
页数:25
相关论文
共 50 条
  • [21] A Hybrid Deep Learning Approach for Automatic Fish Classification
    Chhabra, Harshit Singh
    Srivastava, Akshay Kumar
    Nijhawan, Rahul
    [J]. PROCEEDINGS OF ICETIT 2019: EMERGING TRENDS IN INFORMATION TECHNOLOGY, 2020, 605 : 427 - 436
  • [22] Gas turbine failure classification using acoustic emissions with wavelet analysis and deep learning
    Nashed, M. S.
    Renno, J.
    Mohamed, M. S.
    Reuben, R. L.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 232
  • [23] Heart Sound Classification Using Wavelet Analysis Approaches and Ensemble of Deep Learning Models
    Lee, Jin-A
    Kwak, Keun-Chang
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (21):
  • [24] Automatic Classification of Bird Sounds: Using MFCC and Mel Spectrogram Features with Deep Learning
    Carvalho, Silvestre
    Gomes, Elsa Ferreira
    [J]. VIETNAM JOURNAL OF COMPUTER SCIENCE, 2023, 10 (01) : 39 - 54
  • [25] Using Hybrid Deep Learning Models of Sentiment Analysis and Item Genres in Recommender Systems for Streaming Services
    Dang, Cach N.
    Moreno-Garcia, Maria N.
    De la Prieta, Fernando
    [J]. ELECTRONICS, 2021, 10 (20)
  • [26] A Deep Learning Approach for Tree Root Detection using GPR Spectrogram Imagery
    Lantini, Livia
    Massimi, Federica
    Tosti, Fabio
    Alani, Amir M.
    Benedetto, Francesco
    [J]. 2022 45TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING, TSP, 2022, : 391 - 394
  • [27] Music genres classification using text categorization method
    Chen, Kai
    Gao, Sheng
    Zhu, Yongwei
    Sun, Qibin
    [J]. 2006 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2006, : 221 - +
  • [28] An intelligent music genre analysis using feature extraction and classification using deep learning techniques
    Wang Hongdan
    SalmiJamali, Siti
    Chen Zhengping
    Shan Qiaojuan
    Ren Le
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100
  • [29] Fingerprint classification using deep learning approach
    Rim, Beanbonyka
    Kim, Junseob
    Hong, Min
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (28-29) : 35809 - 35825
  • [30] Fingerprint classification using deep learning approach
    Beanbonyka Rim
    Junseob Kim
    Min Hong
    [J]. Multimedia Tools and Applications, 2021, 80 : 35809 - 35825