Genre Classification Empowered by Knowledge-Embedded Music Representation

被引:1
|
作者
Ding, Han [1 ]
Zhai, Linwei [1 ]
Zhao, Cui [1 ]
Wang, Fei [1 ]
Wang, Ge [1 ]
Xi, Wei [1 ]
Wang, Zhi [1 ]
Zhao, Jizhong [1 ]
机构
[1] Xi An Jiao Tong Univ, Fac Elect & Informat Engn, Xian 710049, Peoples R China
基金
国家重点研发计划;
关键词
Instruments; Music; Knowledge graphs; Feature extraction; Correlation; Semantics; Task analysis; Music genre classification; knowledge graph embedding; multi-modality fusion; RETRIEVAL;
D O I
10.1109/TASLP.2024.3402115
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces a pioneering framework for music representation learning, which harnesses knowledge graph embeddings to enrich genre classification. Leveraging metadata from publicly available datasets like FMA and OpenMIC-2018, the constructed knowledge graph delineates intricate relationships among genres, artists, and instruments, offering valuable insights for genre representation. Within this framework, we propose two models tailored for distinct genre classification scenarios: fixed-set genre classification and open-set genre classification. These models exploit the knowledge graph to unveil correlations among different genres and integrate this knowledge into the audio representation. Notably, our approach is the first to merge audio data with high-level knowledge for music genre classification. Experimental results demonstrate that our proposed methods outperform state-of-the-art approaches, achieving an average genre classification accuracy of 68.07% on the FMA-medium dataset and 42.4% for open-set classification on the FMA-large dataset.
引用
收藏
页码:2764 / 2776
页数:13
相关论文
共 50 条
  • [1] Knowledge-Embedded Representation Learning for Fine-Grained Image Recognition
    Chen, Tianshui
    Lin, Liang
    Chen, Riquan
    Wu, Yang
    Luo, Xiaonan
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 627 - 634
  • [2] Knowledge-Embedded Mutual Guidance for Visual Reasoning
    Zheng, Wenbo
    Yan, Lan
    Chen, Long
    Li, Qiang
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (04) : 2579 - 2591
  • [3] Knowledge-embedded Prompt Learning for Zero-shot Social Media Text Classification
    Li, Jingyi
    Chen, Qi
    Wang, Wei
    Wu, Fangyu
    2023 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING, SMARTCOMP, 2023, : 222 - 224
  • [4] Webly Supervised Knowledge-Embedded Model for Visual Reasoning
    Zheng, Wenbo
    Yan, Lan
    Zhang, Wenwen
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9829 - 9843
  • [5] Knowledge-embedded symmetry transform for accurate eyeball location
    Lu, CY
    Zhou, J
    Zhang, CS
    Yan, PF
    Li, YD
    ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 1076 - 1079
  • [6] Music Genre Classification Using Spectral Analysis and Sparse Representation of the Signals
    Mehdi Banitalebi-Dehkordi
    Amin Banitalebi-Dehkordi
    Journal of Signal Processing Systems, 2014, 74 : 273 - 280
  • [7] Representation Learning vs. Handcrafted Features for Music Genre Classification
    Pereira, Rodolfo M.
    Costa, Yandre M. G.
    Aguiar, Rafael L.
    Britto, Alceu S., Jr.
    Oliveira, Luiz E. S.
    Silla, Carlos N., Jr.
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [8] Music Genre Classification Using Spectral Analysis and Sparse Representation of the Signals
    Banitalebi-Dehkordi, Mehdi
    Banitalebi-Dehkordi, Amin
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 74 (02): : 273 - 280
  • [9] Knowledge-Embedded Routing Network for Scene Graph Generation
    Chen, Tianshui
    Yu, Weihao
    Chen, Riquan
    Lin, Liang
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6156 - 6164
  • [10] A knowledge-embedded database system for composite material selection
    Zhao, JZ
    Hoa, SV
    Xiao, X
    COMPUTER AIDED DESIGN IN COMPOSITE MATERIAL TECHNOLOGY V, 1996, : 207 - 219