Acoustic features based on auditory model and adaptive fractional Fourier transform for speech recognition

被引:0
|
作者
YIN Hui XIE Xiang~+ KUANG Jingming (Department of Electronic Engineering
机构
基金
中国国家自然科学基金; 高等学校博士学科点专项科研基金;
关键词
MFCC; LFM; Acoustic features based on auditory model and adaptive fractional Fourier transform for speech recognition;
D O I
10.15949/j.cnki.0217-9776.2011.04.007
中图分类号
TN912.34 [语音识别与设备];
学科分类号
0711 ;
摘要
It is well known that auditory system of human beings has excellent performance which automatic speech recognition(ASR) systems can’t match,and fractional Fourier transform (FrFT) has unique advantages in non-stationary signal processing.In this paper,the Gammatone filterbank is applied to speech signals for front-end temporal filtering,and then acoustic features of the output subband signals are extracted based on fractional Fourier transform. Considering the critical effect of transform order for FrFT,an order adaptation method based on the instantaneous frequency is proposed,and its performance is compared with the method based on ambiguity function.ASR experiments are conducted on clean and noisy Putonghua digits,and the results show that the proposed features achieve significantly higher recognition rate than the MFCC baseline,and the order adaptation method based on instantaneous frequency has much lower complexity than that based on ambiguity function.Further more,the FrFT-based features achieve the highest recognition rate using the proposed order adaptation method.
引用
收藏
页码:453 / 463
页数:11
相关论文
共 50 条
  • [2] Adaptive-Order Fractional Fourier Transform Features for Speech Recognition
    Yin Hui
    Xie Xiang
    Kuang Jingming
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 654 - 657
  • [3] Fractional Fourier transform features for speech recognition
    Sarikaya, R
    Gao, YQ
    Saon, G
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 529 - 532
  • [4] Research on Speech Emotion Recognition Based on the Fractional Fourier Transform
    Huang, Lirong
    Shen, Xizhong
    [J]. ELECTRONICS, 2022, 11 (20)
  • [5] Emotion Recognition Based on Multiple Order Features Using Fractional Fourier Transform
    Ren, Bo
    Liu, Deyin
    Qi, Lin
    [J]. NINTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2017), 2017, 10420
  • [6] Fractional Fourier Transform Based Adaptive Filtering Techniques for Acoustic Emission Signal Enhancement
    Prajna, K.
    Mukhopadhyay, C. K.
    [J]. JOURNAL OF NONDESTRUCTIVE EVALUATION, 2020, 39 (01)
  • [7] Fractional Fourier Transform Based Adaptive Filtering Techniques for Acoustic Emission Signal Enhancement
    K. Prajna
    C. K. Mukhopadhyay
    [J]. Journal of Nondestructive Evaluation, 2020, 39
  • [8] Fractional Fourier transform based features for speaker recognition using support vector machine
    Ajmera, Pawan K.
    Holambe, Raghunath S.
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2013, 39 (02) : 550 - 557
  • [9] Auditory processing-based features for improving speech recognition in adverse acoustic conditions
    Maganti, Hari Krishna
    Matassoni, Marco
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014,
  • [10] Auditory processing-based features for improving speech recognition in adverse acoustic conditions
    Hari Krishna Maganti
    Marco Matassoni
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2014