Scale-transform based features for application in speech recognition

被引:2
|
作者
Umesh, S [1 ]
Cohen, L [1 ]
Nelson, D [1 ]
机构
[1] Indian Inst Technol, Dept Elect Engn, Kanpur 208016, Uttar Pradesh, India
关键词
D O I
10.1117/12.366828
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We report recognition results using scale-transform based cepstral features in a telephone based digit recognition task The method is based on the use of scale-transform based features for speaker-independent applications, which are insensitive to linear-frequency scaling effects and therefore reduce inter-speaker variability due to differences in vocal-tract lengths. We have implemented a digit recognition task using the proposed scale-transform based features and have compared the recognition accuracy obtained when compared to using mel-cepstrum based front-end features.
引用
收藏
页码:727 / 731
页数:3
相关论文
共 50 条
  • [1] Improvements in scale-transform based features for speech analysis
    Umesh, S
    Cohen, L
    Nelson, D
    [J]. WAVELET APPLICATIONS IN SIGNAL AND IMAGE PROCESSING V, 1997, 3169 : 481 - 494
  • [2] Fractional Fourier transform features for speech recognition
    Sarikaya, R
    Gao, YQ
    Saon, G
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 529 - 532
  • [3] Visual speech recognition using wavelet transform and moment based features
    Yau, Wai C.
    Kumar, Dinesh K.
    Arjunan, Sridhar P.
    Kumar, Sanjay
    [J]. ICINCO 2006: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS: ROBOTICS AND AUTOMATION, 2006, : 340 - 345
  • [4] Speech Recognition using Hilbert-Huang Transform Based Features
    Hanna, Samer S.
    Korany, Noha
    Abd-el-Malek, Mina B.
    [J]. 2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, : 338 - 341
  • [5] Robust speech features based on wavelet transform with application to speaker identification
    Hsieh, CT
    Lai, E
    Wang, YC
    [J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2002, 149 (02): : 108 - 114
  • [6] Region Dependent Transform on MLP Features for Speech Recognition
    Ng, Tim
    Zhang, Bing
    Matsoukas, Spyros
    Long Nguyen
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 228 - 231
  • [7] Wavelet Transform Based Features Vector Extraction in Isolated Words Speech Recognition System
    Al-Qaraawi, Salih M.
    Mahmood, Sarah Shukur
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON COMMUNICATION SYSTEMS, NETWORKS & DIGITAL SIGNAL PROCESSING (CSNDSP), 2014, : 847 - 850
  • [8] Acoustic features based on auditory model and adaptive fractional Fourier transform for speech recognition
    YIN Hui XIE Xiang~+ KUANG Jingming (Department of Electronic Engineering
    [J]. Chinese Journal of Acoustics, 2011, 30 (04) : 453 - 463
  • [9] AFFINE INVARIANT FEATURES AND THEIR APPLICATION TO SPEECH RECOGNITION
    Qiao, Yu
    Suzuki, Masayuki
    Minematsu, Nobuaki
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4629 - 4632
  • [10] Hilbert Huang Transform based Speech Recognition
    Vani, H. Y.
    Anusuya, M. A.
    [J]. 2016 SECOND INTERNATIONAL CONFERENCE ON COGNITIVE COMPUTING AND INFORMATION PROCESSING (CCIP), 2016,