Music genre classification based on auditory image, spectral and acoustic features

被引:0
|
作者
Xin Cai
Hongjuan Zhang
机构
[1] Shanghai University,Department of Mathematics
来源
Multimedia Systems | 2022年 / 28卷
关键词
Music genre classification; Auditory image feature; Spectral feature; Acoustic feature; Feature fusion;
D O I
暂无
中图分类号
学科分类号
摘要
Music genre is one of the conventional ways to describe music content, and also is one of the important labels of music information retrieval. Therefore, the effective and precise music genre classification method becomes an urgent need for realizing automatic organization of large music archives. Inspired by the fact that humans have a better automatic recognizing music genre ability, which may attribute to our auditory system, even for the participants with little musical literacy. In this paper, a novel classification framework incorporating the auditory image feature with traditional acoustic features and spectral feature is proposed to improve the classification accuracy. In detail, auditory image feature is extracted based on the auditory image model which simulates the auditory system of the human ear and has also been successfully used in other fields apart from music genre classification to our best knowledge. Moreover, the logarithmic frequency spectrogram rather than linear is adopted to extract the spectral feature to capture the information about the low-frequency part adequately. These above two features and the traditional acoustic feature are evaluated, compared, respectively, and fused finally based on the GTZAN, GTZAN-NEW, ISMIR2004 and Homburg datasets. Experimental results show that the proposed method owns the higher classification accuracy and the better stability than many state-of-the-art classification methods.
引用
收藏
页码:779 / 791
页数:12
相关论文
共 50 条
  • [1] Music genre classification based on auditory image, spectral and acoustic features
    Cai, Xin
    Zhang, Hongjuan
    MULTIMEDIA SYSTEMS, 2022, 28 (03) : 779 - 791
  • [2] Extraction of acoustic features based on auditory spike code and its application to music genre classification
    Shin, Seong-Hyeon
    Yun, Ho-Won
    Jang, Woo-Jin
    Park, Hochong
    IET SIGNAL PROCESSING, 2019, 13 (02) : 230 - 234
  • [3] Combining visual and acoustic features for music genre classification
    Nanni, Loris
    Costa, Yandre M. G.
    Lumini, Alessandra
    Kim, Moo Young
    Baek, Seung Ryul
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 45 : 108 - 117
  • [4] Automatic Music Genre Classification Based on Modulation Spectral Analysis of Spectral and Cepstral Features
    Lee, Chang-Hsing
    Shih, Jau-Ling
    Yu, Kun-Ming
    Lin, Hwai-San
    IEEE TRANSACTIONS ON MULTIMEDIA, 2009, 11 (04) : 670 - 682
  • [5] The Analysis and Comparison of Vital Acoustic Features in Content -Based Classification of Music Genre
    Wang, Zhe
    Xia, Jingbo
    Luo, Bin
    2013 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA), 2013, : 404 - 408
  • [6] Combining Acoustic and Multilevel Visual Features for Music Genre Classification
    Wu, Ming-Ju
    Jang, Jyh-Shing R.
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2015, 12 (01) : 1 - 17
  • [7] Ensemble of deep learning, visual and acoustic features for music genre classification
    Nanni, Loris
    Costa, Yandre M. G.
    Aguiar, Rafael L.
    Silla, Carlos N., Jr.
    Brahnam, Sheryl
    JOURNAL OF NEW MUSIC RESEARCH, 2018, 47 (04) : 383 - 397
  • [8] Music Features based on Hu Moments for Genre Classification
    Lopes, Renia
    Chapaneri, Santosh
    Jayaswal, Deepak
    2017 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS, COMPUTING AND IT APPLICATIONS (CSCITA), 2017, : 22 - 27
  • [9] Fusion of Static and Transitional Information of Cepstral and Spectral Features for Music Genre Classification
    Lee, Chang-Hsing
    Shih, Jau-Ling
    Yu, Kun-Ming
    Lin, Hwai-San
    Wei, Ming-Hui
    2008 IEEE ASIA-PACIFIC SERVICES COMPUTING CONFERENCE, VOLS 1-3, PROCEEDINGS, 2008, : 751 - 756
  • [10] Evaluation of Music Features for PUK Kernel based Genre Classification
    Chapaneri, Santhosh
    Lopes, Renia
    Jayaswal, Deepak
    INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING TECHNOLOGIES AND APPLICATIONS (ICACTA), 2015, 45 : 186 - 196