Harmonic Representation for CNN-LSTM Automatic Chord Recognition

被引:0
|
作者
Ito, Tsuyoshi [1 ]
Arai, Shuichi [1 ]
机构
[1] Tokyo City Univ, Grad Sch Engn, Setagaya Ku, 1-28-1 Tamazutsumi, Tokyo 1588557, Japan
关键词
Deep Learning; Signal Processing; Pattern Recognition; Music Information Retrieval; Automatic Chord Recognition;
D O I
10.1109/ICORIS52787.2021.9649565
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Paper Since Chord progression is the element that determines the harmony of a piece of music, Automatic Chord Recognition (ACR) from audio signals is a crucial task in the field of Music Information Retrieval(MIR). Recently, various models using deep learning have been proposed, but there are few studies on their input features. Notes parts of the chord are the fundamental note, and its overtone ringed simultaneously. In order to model these audio signals efficiently, feature transforms such as "Constat-Q-Transform(CQT)" is used. However, due to the super-position of fundamental notes and overtones of various instruments in polyphonic music, it is considered difficult to model chords even by deep learning. Therefore, we focused on the structure, including fundamental notes are on the logarithm and its overtones are on the linear. In this paper, we propose a feature representation that can represent overtone structure for each fundamental note. Based on these feature representations, data-driven approach to learn the chord by CNN-LSTM model. We evaluated performance using 383 songs with publicly available annotations, and achieved the same performance with approximately one-tenth of the number of parameters than the existing methods.
引用
收藏
页码:196 / 200
页数:5
相关论文
共 50 条
  • [1] Automatic recognition of radar signal types based on CNN-LSTM
    Ruan, G.
    Wang, Ya.
    Wang, Sh.L.
    Zheng, Yu.
    Guo, Q.
    Shulga, S.
    [J]. Telecommunications and Radio Engineering (English translation of Elektrosvyaz and Radiotekhnika), 2020, 79 (04): : 305 - 321
  • [2] CNN-LSTM for automatic emotion recognition using contactless photoplythesmographic signals
    Mellouk, Wafa
    Handouzi, Wahida
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85
  • [3] Multiview Attention CNN-LSTM Network for SAR Automatic Target Recognition
    Wang, Chenwei
    Liu, Xiaoyu
    Pei, Jifang
    Huang, Yulin
    Zhang, Yin
    Yang, Jianyu
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 12504 - 12513
  • [4] Learning Temporal Representation of Transaction Amount for Fraudulent Transaction Recognition using CNN, Stacked LSTM, and CNN-LSTM
    Heryadi, Yaya
    Warnars, Harco Leslie Hendric Spits
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND COMPUTATIONAL INTELLIGENCE (CYBERNETICSCOM), 2017, : 84 - 89
  • [5] Facial Expression Recognition Based on CNN-LSTM
    Liu, Anping
    Yue, Hongjie
    [J]. PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION TECHNOLOGY AND COMPUTER ENGINEERING, EITCE 2023, 2023, : 486 - 491
  • [6] ?-OTDR pattern recognition based on CNN-LSTM
    Wang, Ming
    Feng, Hao
    Qi, Dunzhe
    Du, Lipu
    Sha, Zhou
    [J]. OPTIK, 2023, 272
  • [7] MULTI-VIEW CNN-LSTM NEURAL NETWORK FOR SAR AUTOMATIC TARGET RECOGNITION
    Wang, Chenwei
    Pei, Jifang
    Wang, Zhiyong
    Huang, Yuling
    Yang, Jianyu
    [J]. IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1755 - 1758
  • [8] Multi-View CNN-LSTM Neural Network for SAR Automatic Target Recognition
    Wang, Chenwei
    Pei, Jifang
    Wang, Zhiyong
    Huang, Yuling
    Yang, Jianyu
    [J]. International Geoscience and Remote Sensing Symposium (IGARSS), 2020, : 1755 - 1758
  • [9] Automatic Modulation Classification using amalgam CNN-LSTM
    Chakravarty, Nidhi
    Dua, Mohit
    Dua, Shelza
    [J]. 2023 IEEE RADIO AND ANTENNA DAYS OF THE INDIAN OCEAN, RADIO, 2023,
  • [10] Multidimensional CNN-LSTM Network for Automatic Modulation Classification
    Wang, Na
    Liu, Yunxia
    Ma, Liang
    Yang, Yang
    Wang, Hongjun
    [J]. ELECTRONICS, 2021, 10 (14)