Music genre classification based on auditory image, spectral and acoustic features

被引:0
|
作者
Xin Cai
Hongjuan Zhang
机构
[1] Shanghai University,Department of Mathematics
来源
Multimedia Systems | 2022年 / 28卷
关键词
Music genre classification; Auditory image feature; Spectral feature; Acoustic feature; Feature fusion;
D O I
暂无
中图分类号
学科分类号
摘要
Music genre is one of the conventional ways to describe music content, and also is one of the important labels of music information retrieval. Therefore, the effective and precise music genre classification method becomes an urgent need for realizing automatic organization of large music archives. Inspired by the fact that humans have a better automatic recognizing music genre ability, which may attribute to our auditory system, even for the participants with little musical literacy. In this paper, a novel classification framework incorporating the auditory image feature with traditional acoustic features and spectral feature is proposed to improve the classification accuracy. In detail, auditory image feature is extracted based on the auditory image model which simulates the auditory system of the human ear and has also been successfully used in other fields apart from music genre classification to our best knowledge. Moreover, the logarithmic frequency spectrogram rather than linear is adopted to extract the spectral feature to capture the information about the low-frequency part adequately. These above two features and the traditional acoustic feature are evaluated, compared, respectively, and fused finally based on the GTZAN, GTZAN-NEW, ISMIR2004 and Homburg datasets. Experimental results show that the proposed method owns the higher classification accuracy and the better stability than many state-of-the-art classification methods.
引用
收藏
页码:779 / 791
页数:12
相关论文
共 50 条
  • [31] Music Genre Classification Based on Deep Learning
    Zhang, Wenlong
    MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [32] Deep attention based music genre classification
    Yu, Yang
    Luo, Sen
    Liu, Shenglan
    Qiao, Hong
    Liu, Yang
    Feng, Lin
    NEUROCOMPUTING, 2020, 372 : 84 - 91
  • [33] Automatic Music Genre Classification Based on CRNN
    Cheng, Yu-Huei
    Chang, Pang-Ching
    Nguyen, Duc-Man
    Kuo, Che-Nan
    ENGINEERING LETTERS, 2021, 29 (01) : 312 - 316
  • [34] Music Genre Classification Using Spectral Analysis and Sparse Representation of the Signals
    Mehdi Banitalebi-Dehkordi
    Amin Banitalebi-Dehkordi
    Journal of Signal Processing Systems, 2014, 74 : 273 - 280
  • [35] Automatic music genre classification using modulation spectral contrast feature
    Lee, Chang-Hsing
    Shih, Jau-Ling
    Yu, Kun-Ming
    Su, Jung-Mau
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 204 - 207
  • [36] Music Genre Classification Using Spectral Analysis and Sparse Representation of the Signals
    Banitalebi-Dehkordi, Mehdi
    Banitalebi-Dehkordi, Amin
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 74 (02): : 273 - 280
  • [37] A Novel Approach of Automatic Music Genre Classification based on Timbral Texture and Rhythmic Content Features
    Baniya, Babu Kaji
    Ghimire, Deepak
    Lee, Joonwhoan
    2014 16TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT), 2014, : 96 - 102
  • [38] Music-Genre Classification System based on Spectro-Temporal Features Feature Selection
    Lim, Shin-Cheol
    Lee, Jong-Seol
    Jang, Sei-Jin
    Lee, Soek-Pil
    Kim, Moo Young
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (04) : 1262 - 1268
  • [39] On the Use of Sequential Patterns Mining as Temporal Features for Music Genre Classification
    Ren, Jia-Min
    Chen, Zhi-Sheng
    Jang, Jyh-Shing Roger
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 2294 - 2297
  • [40] Gabor-LBP Features and Combined Classifiers for Music Genre Classification
    Wu, Haiqian
    Zhang, Ming
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION APPLICATIONS (ICCIA 2012), 2012, : 419 - 422