Non-linear predictors based on the functionally expanded neural networks for speech feature extraction

被引:5
|
作者
Chetouani, Mohamed [1 ]
Hussain, Amir
Gas, Bruno
Zarader, Jean-Luc
机构
[1] Univ Paris Pierre & Marie Curie, Paris, France
关键词
D O I
10.1109/ICEIS.2006.1703129
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we focus on the design of the feature extractor stage of the speech recognition system which aims to compute optimal vectors for the next phoneme classification stage. We propose a new non-linear feature extraction method based on the linear-in-parameters Functionally Expanded Neural Network (FENN) model. The main idea is to design an improved and flexible feature extractor which can effectively account for some of the significant non-tinear phenomena usually observed in the speech production process. The effectiveness of the proposed method is assessed on phoneme classification tasks. Specifically, we evaluate the performances on the telephone quality NTIMIT database, focusing the investigations on highly confusable phonemes such as front vowels: /ih/, /ey/, /eh/, /ae/. The results are compared with other widely used coding methods namely, the Linear Predictive Coding (LPC) and the Mel Frequency Cepstral Coding (MFCC). The experiments show a relative improvement in the rates through the use of our proposed non-linear feature extractor technique.
引用
收藏
页码:1 / +
页数:2
相关论文
共 50 条
  • [1] Kernel based non-linear feature extraction methods for speech recognition
    Huang, Hao
    Zhu, Jie
    [J]. ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 2, 2006, : 749 - +
  • [2] Hand written character feature extraction using non-linear feedforward neural networks
    Jalil, A
    Qureshi, IM
    Cheema, TA
    Naveed, A
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2005, 21 (02) : 453 - 473
  • [3] A NON-LINEAR OPERATOR BASED METHOD FOR HARMONIC FEATURE EXTRACTION FROM SPEECH SIGNALS
    Kavanagh, Darren F.
    Boland, Frank
    [J]. ICSPC: 2007 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, 2007, : 217 - 220
  • [4] Non-linear Feature Selection Based on Convolution Neural Networks with Sparse Regularization
    Wen-Bin Wu
    Si-Bao Chen
    Chris Ding
    Bin Luo
    [J]. Cognitive Computation, 2024, 16 : 654 - 670
  • [5] Non-linear Feature Selection Based on Convolution Neural Networks with Sparse Regularization
    Wu, Wen-Bin
    Chen, Si-Bao
    Ding, Chris
    Luo, Bin
    [J]. COGNITIVE COMPUTATION, 2024, 16 (02) : 654 - 670
  • [6] Non-linear speech feature extraction for phoneme classification and speaker recognition
    Chetouani, M
    Faundez-Zanuy, M
    Gas, B
    Zarader, JL
    [J]. NONLINEAR SPEECH MODELING AND APPLICATIONS, 2005, 3445 : 344 - 350
  • [7] Perceptive, non-linear speech processing and spiking neural networks
    Rouat, J
    Pichevar, R
    Loiselle, S
    [J]. NONLINEAR SPEECH MODELING AND APPLICATIONS, 2005, 3445 : 317 - 337
  • [8] Non-linear feature extraction for robust speech recognition in stationary and non-stationary noise
    Zhu, QF
    Alwan, A
    [J]. COMPUTER SPEECH AND LANGUAGE, 2003, 17 (04): : 381 - 402
  • [9] Non-linear feature extraction by redundancy reduction in an unsupervised stochastic neural network
    Deco, G
    Parra, L
    [J]. NEURAL NETWORKS, 1997, 10 (04) : 683 - 691
  • [10] Feature extraction using non-linear transformation for robust speech recognition on the AURORA database
    Sharma, S
    Ellis, D
    Kajarekar, S
    Jain, P
    Hermansky, H
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1117 - 1120