Music Feature Maps with Convolutional Neural Networks for Music Genre Classification

被引:30
|
作者
Senac, Christine [1 ]
Pellegrini, Thomas [1 ]
Mouret, Florian [1 ]
Pinquier, Julien [1 ]
机构
[1] Univ Toulouse, IRIT, 118 Route Narbonne, F-31062 Toulouse, France
关键词
convolutional neural networks; music features; music classification;
D O I
10.1145/3095713.3095733
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, deep learning is more and more used for Music Genre Classification: particularly Convolutional Neural Networks (CNN) taking as entry a spectrogram considered as an image on which are sought different types of structure. But, facing the criticism relating to the difficulty in understanding the underlying relationships that neural networks learn in presence of a spectrogram, we propose to use, as entries of a CNN, a small set of eight music features chosen along three main music dimensions: dynamics, timbre and tonality. With CNNs trained in such a way that filter dimensions are interpretable in time and frequency, results show that only eight music features are more efficient than 513 frequency bins of a spectrogram and that late score fusion between systems based on both feature types reaches 91% accuracy on the GTZAN database.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Convolutional Neural Networks Approach for Music Genre Classification
    Cheng, Yu-Huei
    Chang, Pang-Ching
    Kuo, Che-Nan
    2020 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2020), 2021, : 399 - 403
  • [2] MUSIC GENRE CLASSIFICATION USING CONVOLUTIONAL NEURAL NETWORKS
    Subhani, G. M.
    Shravya, Perala
    Kumar, Gorighe Akhil
    Hrithika, Chitumalla
    Shrinivas, Chimalpade Ajay
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (05) : 1519 - 1526
  • [3] Improved Music Genre Classification with Convolutional Neural Networks
    Zhang, Weibin
    Lei, Wenkang
    Xu, Xiangmin
    Xing, Xiaofeng
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3304 - 3308
  • [4] Local-feature-map Integration Using Convolutional Neural Networks for Music Genre Classification
    Nakashika, Toru
    Garcia, Christophe
    Takiguchi, Tetsuya
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1750 - 1753
  • [5] Music Genre Classification Using Duplicated Convolutional Layers in Neural Networks
    Yang, Hansi
    Zhang, Wei-Qiang
    INTERSPEECH 2019, 2019, : 3382 - 3386
  • [6] Arabic Music Genre Classification Using Deep Convolutional Neural Networks (CNNs)
    Almazaydeh, Laiali
    Atiewi, Saleh
    Al Tawil, Arar
    Elleithy, Khaled
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (03): : 5443 - 5458
  • [7] Music genre classification with parallel convolutional neural networks and capuchin search algorithm
    Zhang, Yuxin
    Li, Teng
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [8] Recurrent Neural Networks for Music Genre Classification
    Kakarla, Chaitanya
    Eshwarappa, Vidyashree
    Saheer, Lakshmi Babu
    Oghaz, Mahdi Maktabdar
    ARTIFICIAL INTELLIGENCE XXXIX, AI 2022, 2022, 13652 : 267 - 279
  • [9] Music genre classification and recognition using convolutional neural network
    Narkhede N.
    Mathur S.
    Bhaskar A.
    Kalla M.
    Multimedia Tools and Applications, 2025, 84 (4) : 1845 - 1860
  • [10] CONVOLUTIONAL RECURRENT NEURAL NETWORKS FOR MUSIC CLASSIFICATION
    Choi, Keunwoo
    Fazekas, Gyorgy
    Sandler, Mark
    Cho, Kyunghyun
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2392 - 2396