Speech recognition using filter-bank features

被引:0
|
作者
Ravindran, S [1 ]
Demiroglu, C [1 ]
Anderson, DV [1 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mel-frequency cepstral coefficients (MFCC) have been shown to be very useful in tasks of speech recognition and are the preferred features in state of the art speech recognition systems. We present features derived from filter bank outputs whose performance is comparable to that of MFCCs for connected digit recognition using a Hidden Markov Model (HMM) based speech recognition system. The feature extraction method we present is easily implementable in floating gate analog VLSI circuitry which makes it a viable option for low power speech recognition tasks.
引用
下载
收藏
页码:1900 / 1903
页数:4
相关论文
共 50 条
  • [31] Accelerating the transient simulation of semiconductor devices using filter-bank transforms
    Movahhedi, M
    Abdipour, A
    Dehghan, M
    INTERNATIONAL JOURNAL OF NUMERICAL MODELLING-ELECTRONIC NETWORKS DEVICES AND FIELDS, 2006, 19 (01) : 47 - 67
  • [32] Accelerating the transient simulation of semiconductor devices using filter-bank transforms
    Movahhedi, Masoud
    Abdipour, Abdolali
    GAAS 2005: 13TH EUROPEAN GALLIUM ARSENIDE AND OTHER COMPOUND SEMICONDUCTORS APPLICATION SYMPOSIUM, CONFERENCE PROCEEDINGS, 2005, : 477 - 480
  • [33] Compressive Sensing and Filter-Bank Signal Models
    Vaidyanathan, P. P.
    2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 603 - 607
  • [34] Improvement of active microwave device Modeling using filter-bank transforms
    Movahhedi, Masoud
    Abdipour, Abdolah
    35th European Microwave Conference, Vols 1-3, Conference Proceedings, 2005, : 1113 - 1116
  • [35] Auditory filter-bank compression improves estimation of signal-to-noise ratio for speech in noise
    Liu, Fangqi
    Demosthenous, Andreas
    Yasin, Ifat
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2020, 147 (05): : 3197 - 3208
  • [36] Comparative study of filter-bank mean-energy distance for automated segmentation of speech signals
    Ananthakrishnan, G.
    Ranjani, H. G.
    Ramakrishnan, A. G.
    2007 INTERNATIONAL CONFERENCE OF SIGNAL PROCESSING, COMMUNICATIONS AND NETWORKING, VOLS 1 AND 2, 2006, : 6 - +
  • [37] Parallel MRI reconstruction: A filter-bank approach
    Ying, Leslie
    Abdelsalam, Emad
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 1374 - 1377
  • [38] On enhancing feature sequence filtering with filter-bank energy transformation in speaker verification with telephone speech
    Garreton, Claudio
    Becerra Yoma, Nestor
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1461 - 1464
  • [39] FILTER-BANK OPTIMIZATION FOR FREQUENCY DOMAIN LINEAR PREDICTION
    Peddinti, Vijayaditya
    Hermansky, Hynek
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7102 - 7106
  • [40] Separable spectro-temporal Gabor filter bank features: Reducing the complexity of robust features for automatic speech recognition
    Schaedler, Marc Rene
    Kollmeier, Birger
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 137 (04): : 2047 - 2059