Generative Adversarial Networks Based Framework for Music Genre Classification

被引:0
|
作者
Pulkit Dwivedi [1 ]
Benazir Islam [2 ]
机构
[1] IILM University,School of Computer Science and Engineering
[2] New Jersey Institute of Technology,undefined
关键词
Music genre classification; Generative adversarial networks (GANs); Deep learning; Feature extraction; Classification models;
D O I
10.1007/s42979-024-03531-8
中图分类号
学科分类号
摘要
Music genre classification plays a crucial role in organizing and exploring large music collections, enabling personalized music recommendations, and enhancing music-related services. This paper presents a novel approach to music genre classification using Generative Adversarial Networks (GANs), Fourier Transform, and Wavelet Transform. The main objective is to leverage the power of GANs to extract discriminative features from audio data and accurately classify music into different genres. The proposed methodology involves two key components: the generator and the discriminator. The generator generates synthetic audio samples that resemble real music, while the discriminator learns to distinguish between real and synthetic audio samples. By training the GAN on a diverse dataset of music samples from various genres, the discriminator becomes proficient in recognizing genre-specific features. To enhance classification accuracy, Fourier Transform and Wavelet Transform are applied to extract both frequency and time-domain features from the audio data. Additionally, classifiers such as support vector machines and neural networks are employed to effectively distinguish between different music genres. The experimental results demonstrate the effectiveness of the proposed approach across multiple datasets. The method achieves 98.97% accuracy on the GTZAN dataset, 92.47% accuracy on the FMA-Small dataset, and 92.98% accuracy on the ISMIR Genre dataset, significantly outperforming traditional classification methods These results highlight the power of GANs, Fourier Transform, and Wavelet Transform in enhancing the accuracy and robustness of music genre classification.
引用
收藏
相关论文
共 50 条
  • [41] Spec2Spec: Towards the general framework of music processing using generative adversarial networks
    Choi, Hyeong-Seok
    Lee, Juheon
    Lee, Kyogu
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2020, 41 (01) : 160 - 165
  • [42] Music Genre Classification using Deep Neural Networks
    Yimer, Mekonen Hiwot
    Yu, Yongbin
    Adu, Kwabena
    Favour, Ekong
    Liyih, Sinishaw Melikamu
    Patamia, Rutherford Agbeshi
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2384 - 2391
  • [43] Hierarchical mining with complex networks for music genre classification
    Salazar, Andres Eduardo Coca
    DIGITAL SIGNAL PROCESSING, 2022, 127
  • [44] Deep Belief Networks for Automatic Music Genre Classification
    Yang, Xiaohong
    Chen, Qingcai
    Zhou, Shusen
    Wang, Xiaolong
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2444 - 2447
  • [45] Convolutional Neural Networks Approach for Music Genre Classification
    Cheng, Yu-Huei
    Chang, Pang-Ching
    Kuo, Che-Nan
    2020 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2020), 2021, : 399 - 403
  • [46] Wide Ensembles of Neural Networks in Music Genre Classification
    Kostrzewa, Daniel
    Mazur, Wojciech
    Brzeski, Robert
    COMPUTATIONAL SCIENCE, ICCS 2022, PT II, 2022, : 64 - 71
  • [47] Improved Music Genre Classification with Convolutional Neural Networks
    Zhang, Weibin
    Lei, Wenkang
    Xu, Xiangmin
    Xing, Xiaofeng
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3304 - 3308
  • [48] MUSIC GENRE CLASSIFICATION USING CONVOLUTIONAL NEURAL NETWORKS
    Subhani, G. M.
    Shravya, Perala
    Kumar, Gorighe Akhil
    Hrithika, Chitumalla
    Shrinivas, Chimalpade Ajay
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (05) : 1519 - 1526
  • [49] Music Feature Maps with Convolutional Neural Networks for Music Genre Classification
    Senac, Christine
    Pellegrini, Thomas
    Mouret, Florian
    Pinquier, Julien
    PROCEEDINGS OF THE 15TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2017,
  • [50] Music Creation Technology Based on Generative Adversarial Network
    Liu, Feng
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (09) : 626 - 632