New sub-band processing framework using non-linear predictive models for speech feature extraction

被引:0
|
作者
Chetouani, M [1 ]
Hussain, A
Gas, B
Zarader, JL
机构
[1] Univ Paris 06, Lab Instruments & Syst Ile De France, Paris, France
[2] Univ Stirling, Dept Comp Sci & Math, Stirling, Scotland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech feature extraction methods are commonly based on time and frequency processing approaches. In this paper, we propose a new framework based on sub-band processing and non-linear prediction. The key idea is to pre-process the speech signal by a filter bank. From the resulting signals, non-linear predictors are computed. The feature extraction method involves the association of different Neural Predictive Coding (NPC) models. We apply this new framework to phoneme classification and experiments carried out with the NTIMIT database show an improvement of the classification rates in comparison with the full-band approach. The new method is also shown to give better performance than the traditional Linear Predictive Coding (LPC), Mel Frequency Cepstral Coding (MFCC) and Perceptual Linear Prediction (PLP) methods.
引用
收藏
页码:284 / 290
页数:7
相关论文
共 50 条
  • [1] Non-linear sub-band processing for binaural adaptive speech-enhancement
    Hussain, A
    [J]. NINTH INTERNATIONAL CONFERENCE ON ARTIFICIAL NEURAL NETWORKS (ICANN99), VOLS 1 AND 2, 1999, (470): : 121 - 125
  • [2] Non-linear predictive models for speech processing
    Chetouani, M
    Hussain, A
    Faundez-Zanuy, M
    Gas, B
    [J]. ARTIFICIAL NEURAL NETWORKS: FORMAL MODELS AND THEIR APPLICATIONS - ICANN 2005, PT 2, PROCEEDINGS, 2005, 3697 : 779 - 784
  • [3] Spoofing Attack Detection using the Non-linear Fusion of Sub-band Classifiers
    Tak, Hemlata
    Patino, Jose
    Nautsch, Andreas
    Evans, Nicholas
    Todisco, Massimiliano
    [J]. INTERSPEECH 2020, 2020, : 1106 - 1110
  • [4] Sub-band SNR estimation using auditory feature processing
    Kleinschmidt, M
    Hohmann, V
    [J]. SPEECH COMMUNICATION, 2003, 39 (1-2) : 47 - 63
  • [5] Acoustic Feature Extraction using ERB Like Wavelet Sub-band Perceptual Wiener Filtering for Noisy Speech Recognition
    Biswas, Astik
    Sahu, P. K.
    Bhowmick, Anirban
    Chandra, Mahesh
    [J]. 2014 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2014,
  • [6] Feature extraction using non-linear transformation for robust speech recognition on the AURORA database
    Sharma, S
    Ellis, D
    Kajarekar, S
    Jain, P
    Hermansky, H
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1117 - 1120
  • [7] Feature Extraction Using Linear and Non-linear Subspace Techniques
    Teixeira, Ana R.
    Tome, Ana Maria
    Lang, E. W.
    [J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2009, PT II, 2009, 5769 : 115 - +
  • [8] Speech enhancement using sub-band adaptive Griffiths-Jim signal processing
    Campbell, DR
    Shields, PW
    [J]. SPEECH COMMUNICATION, 2003, 39 (1-2) : 97 - 110
  • [10] Kernel based non-linear feature extraction methods for speech recognition
    Huang, Hao
    Zhu, Jie
    [J]. ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 2, 2006, : 749 - +