LEARNING A REPRESENTATION FOR COVER SONG IDENTIFICATION USING CONVOLUTIONAL NEURAL NETWORK

被引:0
|
作者
Yu, Zhesong [1 ]
Xu, Xiaoshuo [1 ]
Chen, Xiaoou [1 ]
Yang, Deshun [1 ]
机构
[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China
关键词
Music Information Retrieval; Cover Song Identification; SIMILARITY;
D O I
10.1109/icassp40776.2020.9053839
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Cover song identification is a challenging task in the field of Music Information Retrieval (MIR) due to complex musical variations between query tracks and cover versions. Previous works typically utilize hand-crafted features and alignment algorithms for the task. More recently, further breakthroughs are achieved by employing neural network approaches. In this paper, we propose a novel Convolutional Neural Network (CNN) towards cover song identification. We train the network through classification criteria. Having been trained, the network is used to extract music representation for cover song identification. A training scheme is designed to train robust models against tempo changes. Experimental results show that our approach outperforms state-of-the-art methods on several public datasets with low time complexity.
引用
收藏
页码:541 / 545
页数:5
相关论文
共 50 条
  • [1] COVER SONG IDENTIFICATION USING SONG-TO-SONG CROSS-SIMILARITY MATRIX WITH CONVOLUTIONAL NEURAL NETWORK
    Lee, Juheon
    Chang, Sungkyun
    Choe, Sang Keun
    Lee, Kyogu
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 396 - 400
  • [2] Temporal Pyramid Pooling Convolutional Neural Network for Cover Song Identification
    Yu, Zhesong
    Xu, Xiaoshuo
    Chen, Xiaoou
    Yang, Deshun
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4846 - 4852
  • [3] KEY-INVARIANT CONVOLUTIONAL NEURAL NETWORK TOWARD EFFICIENT COVER SONG IDENTIFICATION
    Xu, Xiaoshuo
    Chen, Xiaoou
    Yang, Deshun
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [4] DisCover: Disentangled Music Representation Learning for Cover Song Identification
    Xun, Jiahao
    Zhang, Shengyu
    Yang, Yanting
    Zhu, Jieming
    Deng, Liqun
    Zhao, Zhou
    Dong, Zhenhua
    Li, Ruiqi
    Zhang, Lichao
    Wu, Fei
    [J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 453 - 463
  • [5] Malware Traffic Classification Using Convolutional Neural Network for Representation Learning
    Wang, Wei
    Zhu, Ming
    Zeng, Xuewen
    Ye, Xiaozhou
    Sheng, Yiqiang
    [J]. 2017 31ST INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN), 2017, : 712 - 717
  • [6] WideResNet with Joint Representation Learning and Data Augmentation for Cover Song Identification
    Hu, Shichao
    Zhang, Bin
    Lu, Jinhong
    Jiang, Yiliang
    Wang, Wucheng
    Kong, Lingcheng
    Zhao, Weifeng
    Jiang, Tao
    [J]. INTERSPEECH 2022, 2022, : 4187 - 4191
  • [7] Deep learning of chroma representation for cover song identification in compression domain
    Jiunn-Tsair Fang
    Yu-Ruey Chang
    Pao-Chi Chang
    [J]. Multidimensional Systems and Signal Processing, 2018, 29 : 887 - 902
  • [8] Deep learning of chroma representation for cover song identification in compression domain
    Fang, Jiunn-Tsair
    Chang, Yu-Ruey
    Chang, Pao-Chi
    [J]. MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2018, 29 (03) : 887 - 902
  • [9] Speed Breaker Identification Using Deep Learning Convolutional Neural Network
    Manikandan, B.
    Athilingam, R.
    Arivalagan, M.
    Nandhini, C.
    Tamilselvi, T.
    Preethicaa, R.
    [J]. UBIQUITOUS INTELLIGENT SYSTEMS, 2022, 302 : 479 - 491
  • [10] Semantic Representation Learning of Convolutional Neural Network Based on Tensor Computation
    Yang, Li-Ji
    Wang, Jia-Qi
    Jing, Li-Ping
    Yu, Jian
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2023, 46 (03): : 568 - 578