Parallel attention of representation global time–frequency correlation for music genre classification

被引:0
|
作者
Zhifang Wen
Aibin Chen
Guoxiong Zhou
Jizheng Yi
Weixiong Peng
机构
[1] Central South University of Forestry and Technology,Institute of Artificial Intelligence Application
[2] Hunan Zixing Artificial Intelligence Technology Group Co,undefined
来源
关键词
Music genre classification; Attention mechanism; Convolutional neural network; Global time–frequency correlation; Mel-spectrogram;
D O I
暂无
中图分类号
学科分类号
摘要
Music genre classification (MGC) is an indispensable branch of music information retrieval. With the prevalence of end-to-end learning, the research on MGC has made some breakthroughs. However, the limited receptive field of convolutional neural network (CNN) cannot capture a correlation between temporal frames of sounding at any moment and sound frequencies of all vibrations in the song. Meanwhile, time–frequency information of channels is not equally important. In order to deal with the above problems, we apply dual parallel attention (DPA) in CNN-5 to focus on global dependencies. First, we propose parallel channel attention (PCA) to build global time–frequency dependencies in the song and study the influence of different weighting methods for PCA. Next, we design dual parallel attention, which focuses on global time–frequency dependencies in the song and adaptively calibrates contribution of different channels to feature map. Then, we analyzed the effect of applying different numbers and positions of DPA in CNN-5 for performance and compared DPA with multiple attention mechanisms. The results on GTZAN dataset demonstrated that the proposed method achieves a classification accuracy of 91.4%, and DPA has the highest performance.
引用
收藏
页码:10211 / 10231
页数:20
相关论文
共 50 条
  • [1] Parallel attention of representation global time-frequency correlation for music genre classification
    Wen, Zhifang
    Chen, Aibin
    Zhou, Guoxiong
    Yi, Jizheng
    Peng, Weixiong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 10211 - 10231
  • [2] Deep attention based music genre classification
    Yu, Yang
    Luo, Sen
    Liu, Shenglan
    Qiao, Hong
    Liu, Yang
    Feng, Lin
    NEUROCOMPUTING, 2020, 372 : 84 - 91
  • [3] Music Genre Classification using Residual Attention Network
    Nguyen, Quang H.
    Do, Trang T. T.
    Chu, Thanh B.
    Trinh, Loan, V
    Nguyen, Dung H.
    Phan, Cuong, V
    Phan, Tuan A.
    Doan, Dung, V
    Pham, Hung N.
    Nguyen, Binh P.
    Chua, Matthew C. H.
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING (ICSSE), 2019, : 115 - 119
  • [4] Music Genre Classification Algorithm Based on Multihead Attention Mechanism
    Cheng, Peng
    ADVANCES IN MULTIMEDIA, 2022, 2022
  • [5] Genre Classification Empowered by Knowledge-Embedded Music Representation
    Ding, Han
    Zhai, Linwei
    Zhao, Cui
    Wang, Fei
    Wang, Ge
    Xi, Wei
    Wang, Zhi
    Zhao, Jizhong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 2764 - 2776
  • [6] Music Genre Classification Using Frequency Domain Features
    Sarkar, Rajib
    Biswas, Nimagna
    Chakraborty, Saurajit
    PROCEEDINGS OF 2018 FIFTH INTERNATIONAL CONFERENCE ON EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT), 2018,
  • [7] Time-Frequency Analysis for Music Genre Classification by using Wavelet Package Decompositions
    Chou, Chih-Hsun
    Shi, Jun-Han
    PROCEEDINGS OF THE 2018 1ST IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE INNOVATION AND INVENTION (ICKII 2018), 2018, : 134 - 137
  • [8] A robust music genre classification approach for global and regional music datasets evaluation
    de Sousa, Jefferson Martins
    Pereira, Eanes Torres
    Veloso, Luciana Ribeiro
    2016 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2016, : 109 - 113
  • [9] Music Genre Classification Using Spectral Analysis and Sparse Representation of the Signals
    Mehdi Banitalebi-Dehkordi
    Amin Banitalebi-Dehkordi
    Journal of Signal Processing Systems, 2014, 74 : 273 - 280
  • [10] Representation Learning vs. Handcrafted Features for Music Genre Classification
    Pereira, Rodolfo M.
    Costa, Yandre M. G.
    Aguiar, Rafael L.
    Britto, Alceu S., Jr.
    Oliveira, Luiz E. S.
    Silla, Carlos N., Jr.
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,