DWT features performance analysis for automatic speech recognition of Urdu

被引:15
|
作者
Ali, Hazrat [1 ,2 ]
Ahmad, Nasir [3 ]
Zhou, Xianwei [2 ]
Iqbal, Khalid [2 ]
Ali, Sahibzada Muhammad [4 ]
机构
[1] City Univ London, Dept Comp, Machine Learning Grp, London EC1V 0HB, England
[2] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[3] Univ Engn & Technol Peshawar, Dept Comp Syst Engn, Peshawar 25120, Pakistan
[4] N Dakota State Univ, Dept Elect & Comp Engn, Fargo, ND 58108 USA
来源
SPRINGERPLUS | 2014年 / 3卷
关键词
Automatic speech recognition; Discrete wavelet transforms; Linear discriminant analysis; Mel-frequency cepstral coefficients; Urdu isolated words recognition; WAVELET;
D O I
10.1186/2193-1801-3-204
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents the work on Automatic Speech Recognition of Urdu language, using a comparative analysis for Discrete Wavelets Transform (DWT) based features and Mel Frequency Cepstral Coefficients (MFCC). These features have been extracted for one hundred isolated words of Urdu, each word uttered by ten different speakers. The words have been selected from the most frequently used words of Urdu. A variety of age and dialect has been covered by using a balanced corpus approach. After extraction of features, the classification has been achieved by using Linear Discriminant Analysis. After the classification task, the confusion matrix obtained for the DWT features has been compared with the one obtained for Mel-Frequency Cepstral Coefficients based speech recognition. The framework has been trained and tested for speech data recorded under controlled environments. The experimental results are useful in determination of the optimum features for speech recognition task.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [1] Automatic speech recognition of Urdu words using linear discriminant analysis
    Ali, Hazrat
    Ahmad, Nasir
    Zhou, Xianwei
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2015, 28 (05) : 2369 - 2375
  • [2] Automatic Urdu Speech Recognition Using Hidden Markov Model
    Asadullah
    Shaukat, Arslan
    Ali, Hazrat
    Akram, Usman
    [J]. 2016 INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2016), 2016, : 135 - 139
  • [3] Linear Discriminant Analysis Based Approach for Automatic Speech Recognition of Urdu Isolated Words
    Ali, Hazrat
    Ahmad, Nasir
    Zhou, Xianwei
    Ali, Muhammad
    Manjotho, Ali Asghar
    [J]. COMMUNICATION TECHNOLOGIES, INFORMATION SECURITY AND SUSTAINABLE DEVELOPMENT, 2014, 414 : 24 - 34
  • [4] Performance Analysis and Optimization of Automatic Speech Recognition
    Tabani, Hamid
    Arnau, Jose-Maria
    Tubella, Jordi
    Gonzalez, Antonio
    [J]. IEEE TRANSACTIONS ON MULTI-SCALE COMPUTING SYSTEMS, 2018, 4 (04): : 847 - 860
  • [5] SNR Features for Automatic Speech Recognition
    Garner, Philip N.
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 182 - 187
  • [6] Urdu Speech Emotion Recognition using Speech Spectral Features and Deep Learning Techniques
    Taj, Soonh
    Shaikh, Ghulam Mujtaba
    Hassan, Saif
    Nimra
    [J]. 2023 4th International Conference on Computing, Mathematics and Engineering Technologies: Sustainable Technologies for Socio-Economic Development, iCoMET 2023, 2023,
  • [7] Topological invariants as speech features for automatic speech recognition
    Kacur, Juraj
    Chudy, Vladimir
    [J]. INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2014, 7 (04) : 235 - 244
  • [8] ADAPTIVE BOOSTING FEATURES FOR AUTOMATIC SPEECH RECOGNITION
    Kham Nguyen
    Ng, Tim
    Long Nguyen
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4733 - 4736
  • [9] ADAPTIVE BOOSTING FEATURES FOR AUTOMATIC SPEECH RECOGNITION
    Kham Nguyen
    Ng, Tim
    Long Nguyen
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4733 - 4736
  • [10] On the Correlation and Transferability of Features between Automatic Speech Recognition and Speech Emotion Recognition
    Fayek, Haytham M.
    Lech, Margaret
    Cavedon, Lawrence
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3618 - 3622