Bi-mel-scale frequency cepstrum and its application in telephone speech recognition

被引:0
|
作者
CHEN Jingdong
XU Bo
HUANG Taiyi(National Laboratory of Pattern Recognition
机构
关键词
LPCC; IEEE; BMFC;
D O I
10.15949/j.cnki.0217-9776.1998.03.007
中图分类号
TN912 [电声技术和语音信号处理];
学科分类号
081002 ;
摘要
A new kind of feature for speech recognition, called Bi-Mel-scale Frequency Cepstrum (BMFC) is proposed. To calculate the BMFC, the speech signal is first segmented into short intervals and the bispectrum of each segment is estimated; then the bispectrum is smoothed by two-dimensional mel-scale inverse filter bank; finally, the bi-mel-scale frequency cepstrum coefficients are obtained by decorrelating the outputs of the filter bank with twodimensional Discrete Cosine Transform (DCT). Preliminary experiments show that the new feature can improve the performance of a telephone speech recognizer and is more robust to white noise than the conventional LPCC (Linear Prediction Coefficients) and MFCC (Mel-scale Frequency Cepstrum Coefficents) used in speech recognition
引用
收藏
页码:234 / 243
页数:10
相关论文
共 50 条
  • [1] Application of cepstrum algorithms for speech recognition
    Al-Shrouf, A
    Abu Zitar, R
    Al-Khayri, A
    Abu Arqub, M
    [J]. DEVELOPMENTS IN APPLIED ARTIFICIAL INTELLIGENCE, 2003, 2718 : 768 - 778
  • [2] A Study of Speech, Speaker and Emotion Recognition using Mel Frequency Cepstrum Coefficients and Support Vector Machines
    Rajasekhar, Ashwini
    Hota, Malaya Kumar
    [J]. PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 114 - 118
  • [3] Automatic speech recognition using Mel-frequency cepstrum coefficient (MFCC) and vector quantization (VQ) techniques for continuous speech
    Verma, Amit
    Kumar, Amit
    Kaur, Iqbaldeep
    [J]. INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2018, 5 (04): : 73 - 78
  • [4] Convolution neural network based automatic speech emotion recognition using Mel-frequency Cepstrum coefficients
    Pawar, Manju D.
    Kokate, Rajendra D.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (10) : 15563 - 15587
  • [5] Convolution neural network based automatic speech emotion recognition using Mel-frequency Cepstrum coefficients
    Manju D. Pawar
    Rajendra D. Kokate
    [J]. Multimedia Tools and Applications, 2021, 80 : 15563 - 15587
  • [6] Evaluation of MEL-LPC cepstrum in a large vocabulary continuous speech recognition
    Matsumoto, H
    Moroto, M
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 117 - 120
  • [7] On desensitizing the Mel-cepstrum to spurious spectral components for robust speech recognition
    Tyagi, V
    Wellekens, C
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 529 - 532
  • [8] RECOGNITION OF NON-SPEECH SOUNDS USING MEL-FREQUENCY CEPSTRUM COEFFICIENTS AND DYNAMIC TIME WARPING METHOD
    Disken, Gokay
    Ibrikci, Turgay
    [J]. 2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 144 - 147
  • [9] IMPROVEMENTS ON MEL-FREQUENCY CEPSTRUM MINIMUM-MEAN-SQUARE-ERROR NOISE SUPPRESSOR FOR ROBUST SPEECH RECOGNITION
    Yu, Dong
    Deng, Li
    Wu, Jian
    Gong, Yifan
    Acero, Alex
    [J]. 2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 69 - 72
  • [10] Hiligaynon Language 5-Word Vocabulary Speech Recognition Using Mel Frequency Cepstrum Coefficients and Genetic Algorithm
    Billones, Robert Kerwin C.
    Dadios, Elmer P.
    [J]. 2014 INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY, COMMUNICATION AND CONTROL, ENVIRONMENT AND MANAGEMENT (HNICEM), 2014,