Automatic Genre Classification Using Fractional Fourier Transform Based Mel Frequency Cepstral Coefficient and Timbral Features

被引:7
|
作者
Bhalke, Daulappa Guranna [1 ]
Rajesh, Betsy [1 ]
Bormane, Dattatraya Shankar [1 ]
机构
[1] SPPU, JSPMs Rajarshi Shahu Coll Engn, Deptartment Elect & Telecommun, Pune, Maharashtra, India
关键词
feature extraction; Timbral features; MFCC; Fractional Fourier Transform (FrFT); Fractional MFCC; Tamil Carnatic music; MUSIC; SIGNALS;
D O I
10.1515/aoa-2017-0024
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents the Automatic Genre Classification of Indian Tamil Music and Western Music using Timbral and Fractional Fourier Transform (FrFT) based Mel Frequency Cepstral Coefficient (MFCC) features. The classifier model for the proposed system has been built using K-NN (K-Nearest Neighbours) and Support Vector Machine (SVM). In this work, the performance of various features extracted from music excerpts has been analysed, to identify the appropriate feature descriptors for the two major genres of Indian Tamil music, namely Classical music (Carnatic based devotional hymn compositions) & Folk music and for western genres of Rock and Classical music from the GTZAN dataset. The results for Tamil music have shown that the feature combination of Spectral Roll off, Spectral Flux, Spectral Skewness and Spectral Kurtosis, combined with Fractional MFCC features, outperforms all other feature combinations, to yield a higher classification accuracy of 96.05%, as compared to the accuracy of 84.21% with conventional MFCC. It has also been observed that the FrFT based MFCC effieciently classifies the two western genres of Rock and Classical music from the GTZAN dataset with a higher classification accuracy of 96.25% as compared to the classification accuracy of 80% with MFCC.
引用
收藏
页码:213 / 222
页数:10
相关论文
共 50 条
  • [1] PARABOLIC FILTER MEL FREQUENCY CEPSTRAL COEFFICIENT AND FUSION OF FEATURES FOR SPEAKER AGE CLASSIFICATION
    Osman, Mohammed Muntaz
    Buyuk, Osman
    [J]. SIGMA JOURNAL OF ENGINEERING AND NATURAL SCIENCES-SIGMA MUHENDISLIK VE FEN BILIMLERI DERGISI, 2020, 38 (04): : 2177 - 2191
  • [2] Automatic Music Genre Classification Using Timbral Texture and Rhythmic Content Features
    Baniya, Babu Kaji
    Ghimire, Deepak
    Lee, Joonwhoan
    [J]. 2015 17TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT), 2015,
  • [3] Target Classification Using Features Based on Fractional Fourier Transform
    Seok, Jongwon
    Bae, Keunsung
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (09): : 2518 - 2521
  • [4] Whispered Speech Conversion Based on the Inversion of Mel Frequency Cepstral Coefficient Features
    Zhu, Qiang
    Wang, Zhong
    Dou, Yunfeng
    Zhou, Jian
    [J]. ALGORITHMS, 2022, 15 (02)
  • [5] Analysis of Asthma By Using Mel Frequency Cepstral Coefficient
    Dighore, V. D.
    Thool, V. R.
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2016, : 976 - 980
  • [6] Hybridisation of Mel Frequency Cepstral Coefficient and Higher Order Spectral Features for Musical Instruments Classification
    Bhalke, Daulappa Guranna
    Rama Rao, C. B.
    Bormane, Dattatraya
    [J]. ARCHIVES OF ACOUSTICS, 2016, 41 (03) : 427 - 436
  • [7] Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features
    Eskidere, Omer
    Gurhanli, Ahmet
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2015, 2015
  • [8] Classification of heart sounds using fractional fourier transform based mel-frequency spectral coefficients and traditional classifiers
    Abduh, Zaid
    Nehary, Ebrahim Ameen
    Wahed, Manal Abdel
    Kadah, Yasser M.
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 57
  • [9] Mel-Frequency Cepstral Coefficients as Features for Automatic Speaker Recognition
    Jokic, Ivan D.
    Jokic, Stevan D.
    Delic, Vlado D.
    Peric, Zoran H.
    [J]. 2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 419 - 424
  • [10] Fusion of Linear and Mel Frequency Cepstral Coefficients for Automatic Classification of Reptiles
    Noda, Juan J.
    Travieso, Carlos M.
    Sanchez-Rodriguez, David
    [J]. APPLIED SCIENCES-BASEL, 2017, 7 (02):