Music Artist Classification with Convolutional Recurrent Neural Networks

被引:0
|
作者
Nasrullah, Zain [1 ]
Zhao, Yue [1 ]
机构
[1] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
关键词
artist classification; music; information retrieval; deep learning; convolutional recurrent neural network; GENRE CLASSIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous attempts at music artist classification use frame level audio features which summarize frequency content within short intervals of time. Comparatively, more recent music information retrieval tasks take advantage of temporal structure in audio spectrograms using deep convolutional and recurrent models. This paper revisits artist classification with this new framework and empirically explores the impacts of incorporating temporal structure in the feature representation. To this end, an established classification architecture, a Convolutional Recurrent Neural Network (CRNN), is applied to the artist20 music artist identification dataset under a comprehensive set of conditions. These include audio clip length, which is a novel contribution in this work, and previously identified considerations such as dataset split and feature level. Our results improve upon baseline works, verify the influence of the producer effect on classification performance and demonstrate the trade-offs between audio length and training set size. The best performing model achieves an average F1 score of 0.937 across three independent trials which is a substantial improvement over the corresponding baseline under similar conditions. Additionally, to showcase the effectiveness of the CRNN's feature extraction capabilities, we visualize audio samples at the model's bottleneck layer demonstrating that learned representations segment into clusters belonging to their respective artists.
引用
下载
收藏
页数:8
相关论文
共 50 条
  • [41] CLASSIFICATION OF SEVERELY OCCLUDED IMAGE SEQUENCES VIA CONVOLUTIONAL RECURRENT NEURAL NETWORKS
    Zheng, Jian
    Wang, Yifan
    Zhang, Xiaonan
    Li, Xiaohua
    2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 395 - 399
  • [42] Regression and Classification for Direction-of-Arrival Estimation with Convolutional Recurrent Neural Networks
    Tang, Zhenyu
    Kanu, John D.
    Hogan, Kevin
    Manocha, Dinesh
    INTERSPEECH 2019, 2019, : 654 - 658
  • [43] Multi-channel lung sound classification with convolutional recurrent neural networks
    Messner, Elmar
    Fediuk, Melanie
    Swatek, Paul
    Scheidl, Stefan
    Smolle-Juettner, Freyja-Maria
    Olschewski, Horst
    Pernkopf, Franz
    COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 122
  • [44] Comparing recurrent convolutional neural networks for large scale bird species classification
    Gaurav Gupta
    Meghana Kshirsagar
    Ming Zhong
    Shahrzad Gholami
    Juan Lavista Ferres
    Scientific Reports, 11
  • [45] ECG Classification With a Convolutional Recurrent Neural Network
    Sigurthorsdottir, Halla
    Van Zaen, Jerome
    Delgado-Gonzalo, Ricard
    Lemay, Mathieu
    2020 COMPUTING IN CARDIOLOGY, 2020,
  • [46] RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR VIDEO CLASSIFICATION
    Xu, Zhenqi
    Hu, Jiani
    Deng, Weihong
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [47] Music Feature Classification Based on Recurrent Neural Networks with Channel Attention Mechanism
    Gan, Jie
    MOBILE INFORMATION SYSTEMS, 2021, 2021
  • [48] Recurrent neural networks for music computation
    Franklin, Judy A.
    INFORMS JOURNAL ON COMPUTING, 2006, 18 (03) : 321 - 338
  • [49] Convolutional Neural Networks for event classification
    Rubio Jimenez, Adrian
    Garcia Navarro, Jose Enrique
    Moreno Llacer, Maria
    NINTH ANNUAL CONFERENCE ON LARGE HADRON COLLIDER PHYSICS, LHCP2021, 2021,
  • [50] Convolutional Neural Networks for image classification
    Jmour, Nadia
    Zayen, Sehla
    Abdelkrim, Afef
    2018 INTERNATIONAL CONFERENCE ON ADVANCED SYSTEMS AND ELECTRICAL TECHNOLOGIES (IC_ASET), 2017, : 397 - 402