Minimum Classification Error Based Spectro-Temporal Feature Extraction for Robust Audio Classification

被引:0
|
作者
Liao, Yuan-Fu [1 ]
Lin, Chia-Hsing [1 ]
Fang, We-Der [1 ]
机构
[1] Natl Taipei Univ Technol, Dept Elect Engn, Taipei, Taiwan
关键词
spectro-temporal feature extraction; robust audio classification; minimum classification error; SPEECH;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mel-frequency cepstral coefficients (MFCCs) are the most popular features for automatic audio classification (AAC). However, MFCCs are often not robust in adverse environment. In this paper, a minimum classification error (MCE)-based method is proposed to extract new and robust spectro-temporal features as alternatives to MFCCs. The robustness of the proposed new features is evaluated on noisy non-speech sound of RWCP Sound Scene Database in Real Acoustic Environment database with Aurora 2 multi-condition training task-like settings. Experimental results show the proposed new features achieved the lowest average recognition error rate of 3.17% which is much better than state-of-the-art MFCCs plus mean subtraction, variance normalization and ARMA filtering (MFCC+MVA, 4.31%), Gabor filters with principle component analysis (Gabor+PCA, 4.43%) and linear discriminant analysis (LDA, 4.20%) features. We thus confirm the robustness of the proposed spectro-temporal feature extraction approach.
引用
收藏
页码:248 / 251
页数:4
相关论文
共 50 条
  • [1] A Novel Spectro-Temporal Feature Extraction Method for Phoneme Classification
    Fartash, Mehdi
    Setayeshi, Saeed
    Razzazi, Farbod
    [J]. 2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 569 - +
  • [2] New approaches for spectro-temporal feature extraction with applications to respiratory sound classification
    Jin, F.
    Sattar, F.
    Goh, D. Y. T.
    [J]. NEUROCOMPUTING, 2014, 123 : 362 - 371
  • [3] Bioinspired sparse spectro-temporal representation of speech for robust classification
    Martinez, C.
    Goddard, J.
    Milone, D.
    Rufiner, H.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2012, 26 (05): : 336 - 348
  • [4] Music-Genre Classification System based on Spectro-Temporal Features Feature Selection
    Lim, Shin-Cheol
    Lee, Jong-Seol
    Jang, Sei-Jin
    Lee, Soek-Pil
    Kim, Moo Young
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (04) : 1262 - 1268
  • [5] A hierarchical framework for spectro-temporal feature extraction
    Heckmann, Martin
    Domont, Xavier
    Joublin, Frank
    Goerick, Christian
    [J]. SPEECH COMMUNICATION, 2011, 53 (05) : 736 - 752
  • [6] Spectro-temporal features for environmental sound classification
    Thwe, Khine Zar
    Thaw, Mie Mie
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2019, 20 (02) : 179 - 189
  • [7] Nonnegative features of spectro-temporal sounds for classification
    Cho, YC
    Choi, SJ
    [J]. PATTERN RECOGNITION LETTERS, 2005, 26 (09) : 1327 - 1336
  • [8] Robust Audio Identification Using Spectro-Temporal Subband Centroids
    Seo, Jin Soo
    Lee, Seungjae
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2008, 27 (05): : 239 - 243
  • [9] Spectro-Temporal Feature Based Multi-Channel Convolutional Neural Network for ECG Beat Classification
    Hao, Chen
    Wibowo, Sandi
    Majmudar, Maulik
    Rajput, Kuldeep Singh
    [J]. 2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 5642 - 5645
  • [10] Using Spectro-Temporal Features to Improve AFE Feature Extraction for ASR
    Ravuri, Suman V.
    Morgan, Nelson
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1181 - 1184