Ensemble of Transformer and Convolutional Recurrent Neural Network for Improving Discrimination Accuracy in Automatic Chord Recognition

被引:0
|
作者
Yamaga, Hikaru [1 ]
Momma, Toshifumi [1 ]
Kojima, Kazunori [1 ]
Itoh, Yoshiaki [1 ]
机构
[1] Iwate Prefectural Univ, Takizawa, Japan
关键词
D O I
10.1109/APSIPAASC58517.2023.10317349
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic chord recognition is a task of recognizing and transcribing chords from music data such as popular music. Manual chord transcription requires highly technical knowledge and great effort. A chord is a typical musical feature. Realization of automatic chord recognition can enable their use for many purposes such as musical notation and structural analysis. For this reason, automatic chord recognition has become a major research task in the field of music information retrieval. In recent years, automatic chord recognition has widely used deep learning models. Convolutional Recurrent Neural Network (CRNN) and Transformer have achieved high accuracy. For this study, we focus on the differences in feature extraction approaches used by CRNN and Transformer, and propose an ensemble learning method using the two models. Additionally, we adopt an original overlap inference method to improve their accuracy by complementing the lack of temporal information. Results show that we achieved average accuracy of 78.92% under the traditional evaluation metrics, which are, respectively, 1.64% and 2.43% higher than those of CRNN and Transformer.
引用
收藏
页码:2299 / 2305
页数:7
相关论文
共 50 条
  • [31] Automatic Segmentation Algorithm of Dermoscopy Image Based on Transformer and Convolutional Neural Network
    Wei C.
    Xu Y.
    Jiang X.
    Wei Y.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (12): : 1877 - 1886
  • [32] An amalgamation of vision transformer with convolutional neural network for automatic lung tumor segmentation
    Tyagi, Shweta
    Kushnure, Devidas T.
    Talbar, Sanjay N.
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2023, 108
  • [33] Improving ECG Classification Accuracy Using an Ensemble of Neural Network Modules
    Javadi, Mehrdad
    Ebrahimpour, Reza
    Sajedin, Atena
    Faridi, Soheil
    Zakernejad, Shokoufeh
    PLOS ONE, 2011, 6 (10):
  • [34] Improving the Accuracy of Stock Price Prediction Using Ensemble Neural Network
    San, Phang Wai
    Im, Tan Li
    Anthony, Patricia
    On, Chin Kim
    ADVANCED SCIENCE LETTERS, 2018, 24 (02) : 1524 - 1527
  • [35] A multitask cascading convolutional neural network for high-accuracy pointer meter automatic recognition in outdoor environments
    Liu, Fang
    Pan, Lei
    Gao, Rui
    Zhang, Liyang
    Pang, Yi
    Ning, Xucheng
    Zhang, Hao
    Liu, Kunlei
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (05)
  • [36] Improving CBIR Accuracy using Convolutional Neural Network for Feature Extraction
    Shah, Amjad
    Naseem, Rashid
    Sadia
    Iqbal, Shahid
    Shah, Muhammad Arif
    2017 13TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET 2017), 2017,
  • [37] Fundus Image Classification Research Based on Ensemble Convolutional Neural Network and Vision Transformer
    Yuan Yuan
    Chen Minghui
    Ke Shuting
    Wang Teng
    He Longxi
    Lu Linjie
    Sun Hao
    Liu Jiannan
    CHINESE JOURNAL OF LASERS-ZHONGGUO JIGUANG, 2022, 49 (20):
  • [38] Scene text recognition using residual convolutional recurrent neural network
    Lei, Zhengchao
    Zhao, Sanyuan
    Song, Hongmei
    Shen, Jianbing
    MACHINE VISION AND APPLICATIONS, 2018, 29 (05) : 861 - 871
  • [39] Attend It Again: Recurrent Attention Convolutional Neural Network for Action Recognition
    Yang, Haodong
    Zhang, Jun
    Li, Shuohao
    Lei, Jun
    Chen, Shiqi
    APPLIED SCIENCES-BASEL, 2018, 8 (03):
  • [40] Improved Very Deep Recurrent Convolutional Neural Network for Object Recognition
    Brahimi, Sourour
    Ben Aoun, Najib
    Ben Amar, Chokri
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 2497 - 2502