Multi-band based recognition of spoken Arabic numerals using wavelet transform

被引:2
|
作者
Alkhaldi, W [1 ]
Fakhr, W [1 ]
Hamdy, N [1 ]
机构
[1] Arab Acad Sci & Technol, Alexandria, Egypt
关键词
D O I
10.1109/NRSC.2002.1022626
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Automatic speech recognition (ASR) using multi-band decoruposition provides high recognition rates especially in noisy environments. The Discrete Wavelet Transform (DWT) is known to be an efficient tool for decomposing signals into frequency sub-bands;g In this paper, the concept of Feature Recombination (FC) as applied to the recognition of spoken Arabic numerals is suggested. Utterances are decomposed using DWT before Cepstral coefficients of the resulting sub-bands are calculated. The obtained coefficients are concatenated to form a single feature vector that is used as an input to the speech classifier, e.g. a Hidden Markov Model (HMM), to compute the likelihood. Simulation results have demonstrated that the achieved correct recognition rates using the suggested method are comparable with the full-band ASR (conventional) system.
引用
收藏
页码:224 / 229
页数:6
相关论文
共 50 条
  • [1] Image Fusion Method Based on Multi-band Wavelet Transform
    Wang, YuanGan
    [J]. ADVANCES IN COMPUTER SCIENCE, INTELLIGENT SYSTEM AND ENVIRONMENT, VOL 3, 2011, 106 : 151 - 157
  • [2] Spoken Arabic Digits Recognition Using Discrete Wavelet
    Elrgaby, Mohammed
    Amoura, Abdwahab
    Ganoun, Ali
    [J]. 2014 UKSIM-AMSS 16TH INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION (UKSIM), 2014, : 275 - 279
  • [3] Single Image Ear Recognition Using Wavelet-Based Multi-Band PCA
    Zarachoff, Matthew
    Sheikh-Akbari, Akbar
    Monekosso, Dorothy
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [4] Feature extraction with wavelet transform for recognition of isolated Handwritten Farsi/Arabic characters and numerals
    Mowlaei, A
    Faez, K
    Haghighat, AT
    [J]. DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 923 - 926
  • [5] Spoken Arabic Digits Recognition Based on Wavelet Neural Networks
    Hu, Xiaohui
    Zhan, Lvjun
    Xue, Yun
    Zhou, Weixing
    Zhang, Liangjun
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 1481 - 1485
  • [6] Denoising of MST Radar Signal Using Multi-Band Wavelet Transform with Improved Thresholding
    Chandraiah, G.
    Reddy, T. Sreenivasulu
    [J]. PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2018, : 1026 - 1030
  • [7] Non-Decimated Wavelet Based Multi-Band Ear Recognition Using Principal Component Analysis
    Zarachoff, Matthew Martin
    Sheikh-Akbari, Akbar
    Monekosso, Dorothy
    [J]. IEEE ACCESS, 2022, 10 : 3949 - 3961
  • [8] Arabic Speech Recognition by Bionic Wavelet Transform and MFCC using a Multi Layer Perceptron
    Ben Nasr, Mohammed
    Talbi, Mourad
    Cherif, Adnane
    [J]. 2012 6TH INTERNATIONAL CONFERENCE ON SCIENCES OF ELECTRONICS, TECHNOLOGIES OF INFORMATION AND TELECOMMUNICATIONS (SETIT), 2012, : 803 - 808
  • [9] Persian/arabic handwritten word recognition using M-band packet wavelet transform
    Broumandnia, A.
    Shanbehzadeh, J.
    Varnoosfaderani, M. Rezakhah
    [J]. IMAGE AND VISION COMPUTING, 2008, 26 (06) : 829 - 842
  • [10] Ear Recognition using Chainlet based Multi-Band SVM
    Zarachoff, Matthew Martin
    Sheikh-Akbari, Akbar
    Monekosso, Dorothy
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS AND TECHNIQUES (IST), 2021,