Evaluation of a feature selection scheme on ICA-based filter-bank for speech recognition

被引:0
|
作者
Faraji, Neda [1 ]
Ahadi, S. M. [1 ]
机构
[1] Amirkabir Univ Technol, Dept Elect Engn, Tehran, Iran
关键词
feature extraction; feature selection; filter bank; independent component analysis;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a new feature selection scheme that can contribute to an ICA-based feature extraction block for speech recognition. The initial set of speech basis functions obtained in Independent Component Analysis (ICA) training phase, has some redundancies. Thus, finding a minimal-size optimal subset of these basis functions is rather vital. On the contrary to the previous works that used reordering methods on all the frequency bands, we have introduced an algorithm that finds optimal basis functions in each discriminative frequency band. This leads to an appropriate coverage of various frequency components and easy extension to other data is also provided. Our experiments show that the proposed method is very useful, specifically in larger vocabulary size tasks, where the selected basis functions trained using a limited dataset, may get localized in certain frequency bands and not appropriately generalized to residual dataset. The proposed algorithm surmounts this problem by a local reordering method in which contribution of a basis function is specified with three factors: class separability power, energy and central frequency. The experiments on a Persian continuous speech corpus indicated that the proposed method has led to 17% improvement in noisy condition recognition rate in comparison to a conventional MFCC-based system.
引用
收藏
页码:1277 / 1281
页数:5
相关论文
共 50 条
  • [1] Adaptive Wavelet Packet Filter-Bank Based Acoustic Feature for Speech Emotion Recognition
    Li, Yue
    Zhang, Guobao
    Huang, Yongming
    [J]. PROCEEDINGS OF 2013 CHINESE INTELLIGENT AUTOMATION CONFERENCE: INTELLIGENT INFORMATION PROCESSING, 2013, 256 : 359 - 366
  • [2] Speech recognition using filter-bank features
    Ravindran, S
    Demiroglu, C
    Anderson, DV
    [J]. CONFERENCE RECORD OF THE THIRTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2003, : 1900 - 1903
  • [3] Filtering of filter-bank energies for robust speech recognition
    Jung, HY
    [J]. ETRI JOURNAL, 2004, 26 (03) : 273 - 276
  • [4] Bilinear map of filter-bank outputs for DNN-based speech recognition
    Ogawa, Tetsuji
    Ueda, Kenshiro
    Katsurada, Kouichi
    Kobayashi, Tetsunori
    Nitta, Tsuneo
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 16 - 20
  • [5] ICA-based Noise Reduction Algorithm for Speech Recognition
    Xu, Yang
    Liu, Ting
    [J]. INTERNATIONAL CONFERENCE ON FRONTIERS OF ENERGY, ENVIRONMENTAL MATERIALS AND CIVIL ENGINEERING (FEEMCE 2013), 2013, : 779 - 784
  • [6] Frequency and time filtering of filter-bank energies for HMM speech recognition
    Nadeu, C
    Marino, JB
    Hernando, J
    Nogueiras, A
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 430 - 433
  • [7] Generalized Filter-bank Features for Robust Speech Recognition Against Reverberation
    Pardede, Hilman F.
    Zilvan, Vicky
    Krisnandi, Dikdik
    Heryana, Ana
    Kusumo, R. Budiarianto S.
    [J]. 2019 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS AND ITS APPLICATIONS (IC3INA), 2019, : 19 - 24
  • [8] Phoneme recognition using ICA-based feature extraction and transformation
    Kwon, OW
    Lee, TW
    [J]. SIGNAL PROCESSING, 2004, 84 (06) : 1005 - 1019
  • [9] Optimization of filter-bank to improve the extraction of MFCC features in speech recognition
    Hung, JW
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 675 - 678
  • [10] Wavelet-based compression of medical images: filter-bank selection and evaluation
    A. Saffor
    A. R. bin Ramli
    K. H. Ng
    [J]. Australasian Physics & Engineering Sciences in Medicine, 2003, 26 (2): : 38 - 43