Linear Discriminant Analysis Based Approach for Automatic Speech Recognition of Urdu Isolated Words

被引:3
|
作者
Ali, Hazrat [1 ,5 ]
Ahmad, Nasir [2 ]
Zhou, Xianwei [1 ]
Ali, Muhammad [3 ]
Manjotho, Ali Asghar [4 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Dept Commun Engn, Beijing 10083, Peoples R China
[2] Univ Engn & Technol Peshawar, Dept Comp Syst Engn, Peshawar 25120, Pakistan
[3] N Dakota State Univ, Dept Elect & Comp Engn, Fargo, ND 58108 USA
[4] Mehran Univ Engn & Technol, Dept Comp Syst Engn, Jamshoro, Pakistan
[5] City Univ London, Sch Informat, Machine Learning Grp, London EC1V 0HB, England
关键词
Urdu automatic speech recognition; Mel frequency cepstral coefficients; Linear Discriminant Analysis; Isolated words recognition;
D O I
10.1007/978-3-319-10987-9_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Urdu is amongst the five largest languages of the world and enjoys extreme importance by sharing its vocabulary with several other languages of the Indo-Pak. However, there has not been any significant research in the area of Automatic Speech Recognition of Urdu. This paper presents the statistical based classification technique to achieve the task of Automatic Speech Recognition of isolated words in Urdu. For each isolated word, 52 Mel Frequency Cepstral Coefficients have been extracted and based upon these coefficients; the classification has been achieved using Linear Discriminant Analysis. As a prototype, the system has been trained with audio samples of seven speakers including male/female, native/non-native and speakers with different ages while the testing has been done using audio samples of three speakers. It was determined that majority of words exhibit a percentage error of less than 33 %. Words with 100 % error were declared to be bad words. The work reported in this paper may serve as a strong baseline for future research work on Urdu ASR, especially for continuous speech recognition of Urdu.
引用
收藏
页码:24 / 34
页数:11
相关论文
共 50 条
  • [1] Automatic speech recognition of Urdu words using linear discriminant analysis
    Ali, Hazrat
    Ahmad, Nasir
    Zhou, Xianwei
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2015, 28 (05) : 2369 - 2375
  • [2] Robustness of linear discriminant analysis in automatic speech recognition
    Katz, M
    Meier, HG
    Dolfing, H
    Klakow, D
    [J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 371 - 374
  • [3] Comparison of Linear Discriminant Analysis Approaches in Automatic Speech Recognition
    Jakovljevic, N.
    Miskovic, D.
    Janev, M.
    Secujski, M.
    Delic, V.
    [J]. ELEKTRONIKA IR ELEKTROTECHNIKA, 2013, 19 (07) : 76 - 79
  • [4] Automatic Speech Recognition of Isolated Words in Hindi Language
    Wani, Priyanka
    Bormane, D. S.
    Patil, U. G.
    Shirbahadurkar, S. D.
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2016,
  • [5] DWT features performance analysis for automatic speech recognition of Urdu
    Ali, Hazrat
    Ahmad, Nasir
    Zhou, Xianwei
    Iqbal, Khalid
    Ali, Sahibzada Muhammad
    [J]. SPRINGERPLUS, 2014, 3 : 1 - 10
  • [6] Modified Linear Discriminant Analysis for speech recognition
    Li, Xiao-Bing
    O'Shaughnessy, Douglas
    [J]. 2007 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, 2007, : 1598 - 1601
  • [7] The Development of Isolated Words Pashto Automatic Speech Recognition System
    Ahmed, Irfan
    Ahmad, Nasir
    Ali, Hazrat
    Ahmad, Gulzar
    [J]. PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC 12), 2012, : 333 - 336
  • [8] Automatic digital modulation recognition based on robust linear discriminant analysis
    Shen, Wei-guo
    Wang, Wei
    [J]. WIRELESS COMMUNICATION AND SENSOR NETWORK, 2016, : 77 - 84
  • [9] On speech recognition of isolated words
    Teh, CC
    Jong, CC
    Siek, L
    [J]. ISIC-99: 8TH INTERNATIONAL SYMPOSIUM ON INTEGRATED CIRCUITS, DEVICES & SYSTEMS, PROCEEDINGS, 1999, : 431 - 434
  • [10] Minimum phoneme error based heteroscedastic linear discriminant analysis for speech recognition
    Zhang, B
    Matsoukas, S
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 925 - 928