Improved Music Genre Classification with Convolutional Neural Networks

被引:42
|
作者
Zhang, Weibin [1 ]
Lei, Wenkang [1 ]
Xu, Xiangmin [1 ]
Xing, Xiaofeng [1 ]
机构
[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China
关键词
music genre classification; convolutional neural network; residual learning;
D O I
10.21437/Interspeech.2016-1236
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In recent years, deep neural networks have been shown to be effective in many classification tasks, including music genre classification. In this paper, we proposed two ways to improve music genre classification with convolutional neural networks: 1) combining max- and average pooling to provide more statistical information to higher level neural networks; 2) using shortcut connections to skip one or more layers, a method inspired by residual learning method. The input of the CNN is simply the short time Fourier transforms of the audio signal. The output of the CNN is fed into another deep neural network to do classification. By comparing two different network topologies, our preliminary experimental results on the GTZAN data set show that the above two methods can effectively improve the classification accuracy, especially the second one.
引用
收藏
页码:3304 / 3308
页数:5
相关论文
共 50 条
  • [41] Bangla Music Genre Classification Using Neural Network
    Al Mamunl, Md Afif
    Kadir, Imamul
    Rabby, A. k M. Shahariar Azad
    Al Azmi, Abdullah
    PROCEEDINGS OF THE 2019 8TH INTERNATIONAL CONFERENCE ON SYSTEM MODELING & ADVANCEMENT IN RESEARCH TRENDS (SMART-2019), 2019, : 397 - 403
  • [42] Hierarchical mining with complex networks for music genre classification
    Salazar, Andres Eduardo Coca
    DIGITAL SIGNAL PROCESSING, 2022, 127
  • [43] Deep Belief Networks for Automatic Music Genre Classification
    Yang, Xiaohong
    Chen, Qingcai
    Zhou, Shusen
    Wang, Xiaolong
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2444 - 2447
  • [44] Convolutional Neural Networks for event classification
    Rubio Jimenez, Adrian
    Garcia Navarro, Jose Enrique
    Moreno Llacer, Maria
    NINTH ANNUAL CONFERENCE ON LARGE HADRON COLLIDER PHYSICS, LHCP2021, 2021,
  • [45] Convolutional Neural Networks for image classification
    Jmour, Nadia
    Zayen, Sehla
    Abdelkrim, Afef
    2018 INTERNATIONAL CONFERENCE ON ADVANCED SYSTEMS AND ELECTRICAL TECHNOLOGIES (IC_ASET), 2017, : 397 - 402
  • [46] Convolutional Neural Networks for Electrocardiogram Classification
    Mohamad M. Al Rahhal
    Yakoub Bazi
    Mansour Al Zuair
    Esam Othman
    Bilel BenJdira
    Journal of Medical and Biological Engineering, 2018, 38 : 1014 - 1025
  • [47] Flower Classification with Convolutional Neural Networks
    Mitrovic, Katarina
    Milosevic, Danijela
    2019 23RD INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2019, : 845 - 850
  • [48] Music Genre Recognition Using Residual Neural Networks
    Bisharad, Dipjyoti
    Laskar, Rabul Hussain
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 2063 - 2068
  • [49] Glomerulus Classification with Convolutional Neural Networks
    Pedraza, Anibal
    Gallego, Jaime
    Lopez, Samuel
    Gonzalez, Lucia
    Laurinavicius, Arvydas
    Bueno, Gloria
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS (MIUA 2017), 2017, 723 : 839 - 849
  • [50] Convolutional Neural Networks for Electrocardiogram Classification
    Al Rahhal, Mohamad M.
    Bazi, Yakoub
    Al Zuair, Mansour
    Othman, Esam
    BenJdira, Bilel
    JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2018, 38 (06) : 1014 - 1025