Speech recognition using filter-bank features

被引:0
|
作者
Ravindran, S [1 ]
Demiroglu, C [1 ]
Anderson, DV [1 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mel-frequency cepstral coefficients (MFCC) have been shown to be very useful in tasks of speech recognition and are the preferred features in state of the art speech recognition systems. We present features derived from filter bank outputs whose performance is comparable to that of MFCCs for connected digit recognition using a Hidden Markov Model (HMM) based speech recognition system. The feature extraction method we present is easily implementable in floating gate analog VLSI circuitry which makes it a viable option for low power speech recognition tasks.
引用
下载
收藏
页码:1900 / 1903
页数:4
相关论文
共 50 条
  • [1] Generalized Filter-bank Features for Robust Speech Recognition Against Reverberation
    Pardede, Hilman F.
    Zilvan, Vicky
    Krisnandi, Dikdik
    Heryana, Ana
    Kusumo, R. Budiarianto S.
    2019 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS AND ITS APPLICATIONS (IC3INA), 2019, : 19 - 24
  • [2] Optimization of filter-bank to improve the extraction of MFCC features in speech recognition
    Hung, JW
    PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 675 - 678
  • [3] Filtering of filter-bank energies for robust speech recognition
    Jung, HY
    ETRI JOURNAL, 2004, 26 (03) : 273 - 276
  • [4] INSTANTANEOUS FREQUENCY FILTER-BANK FEATURES FOR LOW RESOURCE SPEECH RECOGNITION USING DEEP RECURRENT ARCHITECTURES
    Nayak, Shekhar
    Kumar, C. Shiva
    Murty, K. Sri Rama
    2021 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2021, : 105 - 110
  • [5] Frequency and time filtering of filter-bank energies for HMM speech recognition
    Nadeu, C
    Marino, JB
    Hernando, J
    Nogueiras, A
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 430 - 433
  • [6] An adaptive filter-bank equalizer for speech enhancement
    Vary, P
    SIGNAL PROCESSING, 2006, 86 (06) : 1206 - 1214
  • [7] A Level-dependent Auditory Filter-bank for Speech Recognition in Reverberant Environments
    Maganti, HariKrishna
    Matassoni, Marco
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 692 - 695
  • [8] Time and frequency filtering of filter-bank energies for robust HMM speech recognition
    Nadeu, C
    Macho, D
    Hernando, J
    SPEECH COMMUNICATION, 2001, 34 (1-2) : 93 - 114
  • [9] Bilinear map of filter-bank outputs for DNN-based speech recognition
    Ogawa, Tetsuji
    Ueda, Kenshiro
    Katsurada, Kouichi
    Kobayashi, Tetsunori
    Nitta, Tsuneo
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 16 - 20
  • [10] Enhancing robustness for speech recognition through bio-inspired auditory filter-bank
    Maganti, Hari Krishna
    Matassoni, Marco
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2012, 4 (05) : 271 - 277