Long Short-term Memory Recurrent Neural Network based Segment Features for Music Genre Classification

被引:0
|
作者
Dai, Jia [1 ,2 ]
Liang, Shan [1 ,2 ]
Xue, Wei [1 ,2 ]
Ni, Chongjia [3 ]
Liu, Wenju [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, NLPR, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Shandong Univ Finance & Econ, Jinan, Shandong, Peoples R China
关键词
Long short-term memory; recurrent neural network; music genre classification; scattering transform;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the conventional frame feature based music genre classification methods, the audio data is represented by independent frames and the sequential nature of audio is totally ignored. If the sequential knowledge is well modeled and combined, the classification performance can be significantly improved. The long short-term memory(LSTM) recurrent neural network (RNN) which uses a set of special memory cells to model for long-range feature sequence, has been successfully used for many sequence labeling and sequence prediction tasks. In this paper, we propose the LSTM RNN based segment features for music genre classification. The LSTM RNN is used to learn the representation of LSTM frame feature. The segment features are the statistics of frame features in each segment. Furthermore, the LSTM segment feature is combined with the segment representation of initial frame feature to obtain the fusional segment feature. The evaluation on ISMIR database show that the LSTM segment feature performs better than the frame feature. Overall, the fusional segment feature achieves 89.71% classification accuracy, about 4.19% improvement over the baseline model using deep neural network (DNN). This significant improvement show the effectiveness of the proposed segment feature.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Chemical Substance Classification Using Long Short-Term Memory Recurrent Neural Network
    Zhang, Jinlei
    Liu, Junxiu
    Luo, Yuling
    Fu, Qiang
    Bi, Jinjie
    Qiu, Senhui
    Cao, Yi
    Ding, Xuemei
    [J]. 2017 17TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT 2017), 2017, : 1994 - 1997
  • [2] Predicting Short-term Traffic Flow by Long Short-Term Memory Recurrent Neural Network
    Tian, Yongxue
    Pan, Li
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SMART CITY/SOCIALCOM/SUSTAINCOM (SMARTCITY), 2015, : 153 - 158
  • [3] Multiclass Classification of Driver Perceived Workload Using Long Short-Term Memory based Recurrent Neural Network
    Manawadu, Udara E.
    Kawano, Takahiro
    Murata, Shingo
    Kamezaki, Mitsuhiro
    Muramatsu, Junya
    Sugano, Shigeki
    [J]. 2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 2009 - 2014
  • [4] Music Emotion Recognition Based on Long Short-Term Memory and Forward Neural Network
    Liu, Aizhen
    [J]. EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2022, 9 (04):
  • [5] Spam SMS Detection Based on Long Short-Term Memory and Recurrent Neural Network
    Alseid, Marya
    Nassif, Ali Bou
    AlShabi, Mohammad
    [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS V, 2023, 12538
  • [6] Application of Long Short-term Memory Based Neural Network for Classification of Customer Behavior
    Zhao, Licheng
    Zuo, Yi
    Yada, Katsutoshi
    Liu, Meijun
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 994 - 999
  • [8] Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) Network
    Sherstinsky, Alex
    [J]. arXiv, 2018,
  • [9] Wind Power Prediction based on Recurrent Neural Network with Long Short-Term Memory Units
    Dong, Danting
    Sheng, Zhihao
    Yang, Tiancheng
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON RENEWABLE ENERGY AND POWER ENGINEERING (REPE 2018), 2018, : 34 - 38
  • [10] Terahertz Spectral Recognition Based on Bidirectional Long Short-Term Memory Recurrent Neural Network
    Yu Hao-yue
    Shen Tao
    Zhu Yan
    Liu Ying-li
    Yu Zheng-tao
    [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2019, 39 (12) : 3737 - 3742