Music Artist Classification with Convolutional Recurrent Neural Networks

被引:0
|
作者
Nasrullah, Zain [1 ]
Zhao, Yue [1 ]
机构
[1] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
关键词
artist classification; music; information retrieval; deep learning; convolutional recurrent neural network; GENRE CLASSIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous attempts at music artist classification use frame level audio features which summarize frequency content within short intervals of time. Comparatively, more recent music information retrieval tasks take advantage of temporal structure in audio spectrograms using deep convolutional and recurrent models. This paper revisits artist classification with this new framework and empirically explores the impacts of incorporating temporal structure in the feature representation. To this end, an established classification architecture, a Convolutional Recurrent Neural Network (CRNN), is applied to the artist20 music artist identification dataset under a comprehensive set of conditions. These include audio clip length, which is a novel contribution in this work, and previously identified considerations such as dataset split and feature level. Our results improve upon baseline works, verify the influence of the producer effect on classification performance and demonstrate the trade-offs between audio length and training set size. The best performing model achieves an average F1 score of 0.937 across three independent trials which is a substantial improvement over the corresponding baseline under similar conditions. Additionally, to showcase the effectiveness of the CRNN's feature extraction capabilities, we visualize audio samples at the model's bottleneck layer demonstrating that learned representations segment into clusters belonging to their respective artists.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] CONVOLUTIONAL RECURRENT NEURAL NETWORKS FOR MUSIC CLASSIFICATION
    Choi, Keunwoo
    Fazekas, Gyorgy
    Sandler, Mark
    Cho, Kyunghyun
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2392 - 2396
  • [2] Convolutional Recurrent Neural Networks for Text Classification
    Lyu, Shengfei
    Liu, Jiaqi
    [J]. JOURNAL OF DATABASE MANAGEMENT, 2021, 32 (04) : 65 - 82
  • [3] Recurrent Convolutional Neural Networks for Text Classification
    Lai, Siwei
    Xu, Liheng
    Liu, Kang
    Zhao, Jun
    [J]. PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2267 - 2273
  • [4] Convolutional Recurrent Neural Networks for Text Classification
    Wang, Ruishuang
    Li, Zhao
    Cao, Jian
    Chen, Tong
    Wang, Lei
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [5] Convolutional Recurrent Neural Networks for Electrocardiogram Classification
    Zihlmann, Martin
    Perekrestenko, Dmytro
    Tschannen, Michael
    [J]. 2017 COMPUTING IN CARDIOLOGY (CINC), 2017, 44
  • [6] Recurrent Neural Networks for Music Genre Classification
    Kakarla, Chaitanya
    Eshwarappa, Vidyashree
    Saheer, Lakshmi Babu
    Oghaz, Mahdi Maktabdar
    [J]. ARTIFICIAL INTELLIGENCE XXXIX, AI 2022, 2022, 13652 : 267 - 279
  • [7] Convolutional Neural Networks Approach for Music Genre Classification
    Cheng, Yu-Huei
    Chang, Pang-Ching
    Kuo, Che-Nan
    [J]. 2020 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2020), 2021, : 399 - 403
  • [8] MUSIC GENRE CLASSIFICATION USING CONVOLUTIONAL NEURAL NETWORKS
    Subhani, G. M.
    Shravya, Perala
    Kumar, Gorighe Akhil
    Hrithika, Chitumalla
    Shrinivas, Chimalpade Ajay
    [J]. INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (05) : 1519 - 1526
  • [9] Improved Music Genre Classification with Convolutional Neural Networks
    Zhang, Weibin
    Lei, Wenkang
    Xu, Xiangmin
    Xing, Xiaofeng
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3304 - 3308
  • [10] Sentiment Classification Via Recurrent Convolutional Neural Networks
    Du, Changshun
    Huang, Lei
    [J]. 2ND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING, INFORMATION SCIENCE AND INTERNET TECHNOLOGY, CII 2017, 2017, : 308 - 316