Speech recognition using filter-bank features

被引：0

作者：

Ravindran, S ^{[1
]}

Demiroglu, C ^{[1
]}

Anderson, DV ^{[1
]}

机构：

[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA

来源：

CONFERENCE RECORD OF THE THIRTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2 | 2003年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Mel-frequency cepstral coefficients (MFCC) have been shown to be very useful in tasks of speech recognition and are the preferred features in state of the art speech recognition systems. We present features derived from filter bank outputs whose performance is comparable to that of MFCCs for connected digit recognition using a Hidden Markov Model (HMM) based speech recognition system. The feature extraction method we present is easily implementable in floating gate analog VLSI circuitry which makes it a viable option for low power speech recognition tasks.

引用

下载

页码：1900 / 1903

页数：4

共 50 条

[31] Accelerating the transient simulation of semiconductor devices using filter-bank transforms
Movahhedi, M
Abdipour, A
Dehghan, M
INTERNATIONAL JOURNAL OF NUMERICAL MODELLING-ELECTRONIC NETWORKS DEVICES AND FIELDS, 2006, 19 (01) : 47 - 67
[32] Accelerating the transient simulation of semiconductor devices using filter-bank transforms
Movahhedi, Masoud
Abdipour, Abdolali
GAAS 2005: 13TH EUROPEAN GALLIUM ARSENIDE AND OTHER COMPOUND SEMICONDUCTORS APPLICATION SYMPOSIUM, CONFERENCE PROCEEDINGS, 2005, : 477 - 480
[33] Compressive Sensing and Filter-Bank Signal Models
Vaidyanathan, P. P.
2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 603 - 607
[34] Improvement of active microwave device Modeling using filter-bank transforms
Movahhedi, Masoud
Abdipour, Abdolah
35th European Microwave Conference, Vols 1-3, Conference Proceedings, 2005, : 1113 - 1116
[35] Auditory filter-bank compression improves estimation of signal-to-noise ratio for speech in noise
Liu, Fangqi
Demosthenous, Andreas
Yasin, Ifat
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2020, 147 (05): : 3197 - 3208
[36] Comparative study of filter-bank mean-energy distance for automated segmentation of speech signals
Ananthakrishnan, G.
Ranjani, H. G.
Ramakrishnan, A. G.
2007 INTERNATIONAL CONFERENCE OF SIGNAL PROCESSING, COMMUNICATIONS AND NETWORKING, VOLS 1 AND 2, 2006, : 6 - +
[37] Parallel MRI reconstruction: A filter-bank approach
Ying, Leslie
Abdelsalam, Emad
2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 1374 - 1377
[38] On enhancing feature sequence filtering with filter-bank energy transformation in speaker verification with telephone speech
Garreton, Claudio
Becerra Yoma, Nestor
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1461 - 1464
[39] FILTER-BANK OPTIMIZATION FOR FREQUENCY DOMAIN LINEAR PREDICTION
Peddinti, Vijayaditya
Hermansky, Hynek
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7102 - 7106
[40] Separable spectro-temporal Gabor filter bank features: Reducing the complexity of robust features for automatic speech recognition
Schaedler, Marc Rene
Kollmeier, Birger
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 137 (04): : 2047 - 2059

← 1 2 3 4 5 →