Adaptive-Order Fractional Fourier Transform Features for Speech Recognition

被引:0
|
作者
Yin Hui [1 ]
Xie Xiang [1 ]
Kuang Jingming [1 ]
机构
[1] Beijing Inst Technol, Dept Elect Engn, Beijing 100081, Peoples R China
关键词
fractional Fourier transform; speech recognition; feature extraction; ambiguity function;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an acoustic feature for speech recognition based on the combination of MFCC and fractional Fourier transform (FrFT). Since the transform order is critical for the performance of FrFT, we use the ambiguity function to adaptively determine the optimal orders of FrFT for each frame. The performance of the proposed feature is compared with traditional MFCCs on recognizing speech of isolated and connected digits under both clean and noisy backgrounds. The recognition results and detailed confusion matrices are given and analyzed, which implies that the proposed feature is promising in certain speech processing fields.
引用
收藏
页码:654 / 657
页数:4
相关论文
共 50 条
  • [1] Fractional Fourier transform features for speech recognition
    Sarikaya, R
    Gao, YQ
    Saon, G
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 529 - 532
  • [2] Acoustic features based on auditory model and adaptive fractional Fourier transform for speech recognition
    YIN Hui XIE Xiang~+ KUANG Jingming (Department of Electronic Engineering
    [J]. Chinese Journal of Acoustics, 2011, 30 (04) : 453 - 463
  • [4] Emotion Recognition Based on Multiple Order Features Using Fractional Fourier Transform
    Ren, Bo
    Liu, Deyin
    Qi, Lin
    [J]. NINTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2017), 2017, 10420
  • [5] ORDER ADAPTATION OF THE FRACTIONAL FOURIER TRANSFORM USING THE INTRAFRAME PITCH CHANGE RATE FOR SPEECH RECOGNITION
    Yin, Hui
    Nadeu, Climent
    Hohmann, Volker
    Xie, Xiang
    Kuang, Jingming
    [J]. 2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 193 - 196
  • [6] Research on Speech Emotion Recognition Based on the Fractional Fourier Transform
    Huang, Lirong
    Shen, Xizhong
    [J]. ELECTRONICS, 2022, 11 (20)
  • [7] Speaker recognition using features derived from fractional Fourier transform
    Wang, JF
    Wang, JB
    [J]. Fourth IEEE Workshop on Automatic Identification Advanced Technologies, Proceedings, 2005, : 95 - 100
  • [8] Pitch- and Formant-Based Order Adaptation of the Fractional Fourier Transform and Its Application to Speech Recognition
    Yin, Hui
    Nadeu, Climent
    Hohmann, Volker
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2009,
  • [9] Pitch- and Formant-Based Order Adaptation of the Fractional Fourier Transform and Its Application to Speech Recognition
    Hui Yin
    Climent Nadeu
    Volker Hohmann
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2009
  • [10] Adaptive harmonic fractional Fourier transform
    Zhang, F
    Chen, YQ
    Bi, G
    [J]. IEEE SIGNAL PROCESSING LETTERS, 1999, 6 (11) : 281 - 283