Automatic speech recognition of Urdu words using linear discriminant analysis

被引:6
|
作者
Ali, Hazrat [1 ]
Ahmad, Nasir [2 ]
Zhou, Xianwei [1 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 10083, Peoples R China
[2] Univ Engn & Technol Peshawar, Dept Comp Syst Engn, Peshawar, Pakistan
关键词
Urdu automatic speech recognition; mel frequency cepstral coefficients; linear discriminant analysis;
D O I
10.3233/IFS-151554
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Urdu is amongst the five largest languages of the world and possess a very important role as it shares its vocabulary with languages as Arabic, Persian, Hindi and several other languages of the Indo-Pak. The Automatic Speech Recognition task of Urdu has not been addressed significantly. This paper presents the statistical based classification technique to achieve the task of Automatic Speech Recognition of isolated words in Urdu. The proposed approach is based on calculation of 52 Mel Frequency Cepstral Coefficients for each isolated word. The classification has been achieved with Linear Discriminant Analysis. The successful or incorrect matches have been presented in the Confusion Matrix. As a prototype, the framework has been trained with audio samples of seven speakers including male/female, native/non-native and speakers with different ages. The test set comprises of audio data of three speaker. For each isolated, percentage error has been calculated. It was found that majority of the words are recognized with percentage error less than 33%. Some words suffer 100% error and were referred to be the bad words. This work may provide a baseline for further research on Urdu Automatic Speech Recognition.
引用
收藏
页码:2369 / 2375
页数:7
相关论文
共 50 条
  • [1] Linear Discriminant Analysis Based Approach for Automatic Speech Recognition of Urdu Isolated Words
    Ali, Hazrat
    Ahmad, Nasir
    Zhou, Xianwei
    Ali, Muhammad
    Manjotho, Ali Asghar
    [J]. COMMUNICATION TECHNOLOGIES, INFORMATION SECURITY AND SUSTAINABLE DEVELOPMENT, 2014, 414 : 24 - 34
  • [2] Robustness of linear discriminant analysis in automatic speech recognition
    Katz, M
    Meier, HG
    Dolfing, H
    Klakow, D
    [J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 371 - 374
  • [3] Comparison of Linear Discriminant Analysis Approaches in Automatic Speech Recognition
    Jakovljevic, N.
    Miskovic, D.
    Janev, M.
    Secujski, M.
    Delic, V.
    [J]. ELEKTRONIKA IR ELEKTROTECHNIKA, 2013, 19 (07) : 76 - 79
  • [4] Automatic Urdu Speech Recognition Using Hidden Markov Model
    Asadullah
    Shaukat, Arslan
    Ali, Hazrat
    Akram, Usman
    [J]. 2016 INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2016), 2016, : 135 - 139
  • [5] DWT features performance analysis for automatic speech recognition of Urdu
    Ali, Hazrat
    Ahmad, Nasir
    Zhou, Xianwei
    Iqbal, Khalid
    Ali, Sahibzada Muhammad
    [J]. SPRINGERPLUS, 2014, 3 : 1 - 10
  • [6] Modified Linear Discriminant Analysis for speech recognition
    Li, Xiao-Bing
    O'Shaughnessy, Douglas
    [J]. 2007 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, 2007, : 1598 - 1601
  • [8] Automatic recognition of animal vocalizations using averaged MFCC and linear discriminant analysis
    Lee, CH
    Chou, CH
    Han, CC
    Huang, RZ
    [J]. PATTERN RECOGNITION LETTERS, 2006, 27 (02) : 93 - 101
  • [9] Probabilistic Linear Discriminant Analysis with Bottleneck Features for Speech Recognition
    Lu, Liang
    Renals, Steve
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 910 - 914
  • [10] ROBUST AUDIOVISUAL SPEECH RECOGNITION USING NOISE-ADAPTIVE LINEAR DISCRIMINANT ANALYSIS
    Zeiler, Steffen
    Nickel, Robert
    Ma, Ning
    Brown, Guy J.
    Kolossa, Dorothea
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2797 - 2801