Scale-transform based features for application in speech recognition

被引：2

作者：

Umesh, S ^{[1
]}

Cohen, L ^{[1
]}

Nelson, D ^{[1
]}

机构：

[1] Indian Inst Technol, Dept Elect Engn, Kanpur 208016, Uttar Pradesh, India

来源：

WAVELET APPLICATIONS IN SIGNAL AND IMAGE PROCESSING VII | 1999年 / 3813卷

关键词：

D O I：

10.1117/12.366828

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

We report recognition results using scale-transform based cepstral features in a telephone based digit recognition task The method is based on the use of scale-transform based features for speaker-independent applications, which are insensitive to linear-frequency scaling effects and therefore reduce inter-speaker variability due to differences in vocal-tract lengths. We have implemented a digit recognition task using the proposed scale-transform based features and have compared the recognition accuracy obtained when compared to using mel-cepstrum based front-end features.

引用

页码：727 / 731

页数：3

共 50 条

[1] Improvements in scale-transform based features for speech analysis
Umesh, S
Cohen, L
Nelson, D
[J]. WAVELET APPLICATIONS IN SIGNAL AND IMAGE PROCESSING V, 1997, 3169 : 481 - 494
[2] Fractional Fourier transform features for speech recognition
Sarikaya, R
Gao, YQ
Saon, G
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 529 - 532
[3] Visual speech recognition using wavelet transform and moment based features
Yau, Wai C.
Kumar, Dinesh K.
Arjunan, Sridhar P.
Kumar, Sanjay
[J]. ICINCO 2006: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS: ROBOTICS AND AUTOMATION, 2006, : 340 - 345
[4] Speech Recognition using Hilbert-Huang Transform Based Features
Hanna, Samer S.
Korany, Noha
Abd-el-Malek, Mina B.
[J]. 2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, : 338 - 341
[5] Robust speech features based on wavelet transform with application to speaker identification
Hsieh, CT
Lai, E
Wang, YC
[J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2002, 149 (02): : 108 - 114
[6] Region Dependent Transform on MLP Features for Speech Recognition
Ng, Tim
Zhang, Bing
Matsoukas, Spyros
Long Nguyen
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 228 - 231
[7] Wavelet Transform Based Features Vector Extraction in Isolated Words Speech Recognition System
Al-Qaraawi, Salih M.
Mahmood, Sarah Shukur
[J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON COMMUNICATION SYSTEMS, NETWORKS & DIGITAL SIGNAL PROCESSING (CSNDSP), 2014, : 847 - 850
[8] Acoustic features based on auditory model and adaptive fractional Fourier transform for speech recognition
YIN Hui XIE Xiang~+ KUANG Jingming (Department of Electronic Engineering
[J]. Chinese Journal of Acoustics, 2011, 30 (04) : 453 - 463
[9] AFFINE INVARIANT FEATURES AND THEIR APPLICATION TO SPEECH RECOGNITION
Qiao, Yu
Suzuki, Masayuki
Minematsu, Nobuaki
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4629 - 4632
[10] Hilbert Huang Transform based Speech Recognition
Vani, H. Y.
Anusuya, M. A.
[J]. 2016 SECOND INTERNATIONAL CONFERENCE ON COGNITIVE COMPUTING AND INFORMATION PROCESSING (CCIP), 2016,

← 1 2 3 4 5 →