Linear Discriminant Analysis Based Approach for Automatic Speech Recognition of Urdu Isolated Words

被引:3
|
作者
Ali, Hazrat [1 ,5 ]
Ahmad, Nasir [2 ]
Zhou, Xianwei [1 ]
Ali, Muhammad [3 ]
Manjotho, Ali Asghar [4 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Dept Commun Engn, Beijing 10083, Peoples R China
[2] Univ Engn & Technol Peshawar, Dept Comp Syst Engn, Peshawar 25120, Pakistan
[3] N Dakota State Univ, Dept Elect & Comp Engn, Fargo, ND 58108 USA
[4] Mehran Univ Engn & Technol, Dept Comp Syst Engn, Jamshoro, Pakistan
[5] City Univ London, Sch Informat, Machine Learning Grp, London EC1V 0HB, England
关键词
Urdu automatic speech recognition; Mel frequency cepstral coefficients; Linear Discriminant Analysis; Isolated words recognition;
D O I
10.1007/978-3-319-10987-9_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Urdu is amongst the five largest languages of the world and enjoys extreme importance by sharing its vocabulary with several other languages of the Indo-Pak. However, there has not been any significant research in the area of Automatic Speech Recognition of Urdu. This paper presents the statistical based classification technique to achieve the task of Automatic Speech Recognition of isolated words in Urdu. For each isolated word, 52 Mel Frequency Cepstral Coefficients have been extracted and based upon these coefficients; the classification has been achieved using Linear Discriminant Analysis. As a prototype, the system has been trained with audio samples of seven speakers including male/female, native/non-native and speakers with different ages while the testing has been done using audio samples of three speakers. It was determined that majority of words exhibit a percentage error of less than 33 %. Words with 100 % error were declared to be bad words. The work reported in this paper may serve as a strong baseline for future research work on Urdu ASR, especially for continuous speech recognition of Urdu.
引用
收藏
页码:24 / 34
页数:11
相关论文
共 50 条
  • [21] Speech Emotion Recognition Based on Linear Discriminant Analysis and Support Vector Machine Decision Tree
    Mao, Jun-Wei
    He, Yong
    Liu, Zhen-Tao
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 5529 - 5533
  • [22] MERGING LINEAR DISCRIMINANT ANALYSIS WITH BAG OF WORDS MODEL FOR HUMAN ACTION RECOGNITION
    Iostfidis, Alexandros
    Tefas, Anastasios
    Pitas, Ioannis
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 832 - 836
  • [23] AN APPROACH TO THE AUTOMATIC RECOGNITION OF SPEECH
    PAY, BE
    EVANS, CR
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1981, 14 (01): : 13 - 27
  • [24] Face Recognition based on Fuzzy Linear Discriminant Analysis
    Zhang, Genyuan
    INTERNATIONAL CONFERENCE ON FUTURE COMPUTER SUPPORTED EDUCATION, 2012, 2 : 873 - 879
  • [25] A speech recognition method of isolated words based on modified LPC cepstrum
    Zhang, Xueying
    Guo, Yueling
    Hou, Xuemei
    GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 481 - 484
  • [26] Dynamic Time Warping Based Speech Recognition for Isolated Sinhala Words
    Priyadarshani, P. G. N.
    Dias, N. G. J.
    Punchihewa, Amal
    2012 IEEE 55TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2012, : 892 - 895
  • [27] INFLUENCE OF THE SURROUNDINGS ON THE AUTOMATIC RECOGNITION OF ISOLATED WORDS
    HIRSCH, HG
    ACUSTICA, 1988, 66 (04): : 197 - 202
  • [29] Automatic recognition of animal vocalizations using averaged MFCC and linear discriminant analysis
    Lee, CH
    Chou, CH
    Han, CC
    Huang, RZ
    PATTERN RECOGNITION LETTERS, 2006, 27 (02) : 93 - 101
  • [30] An acoustic emission based approach for damage pattern recognition in composite using linear discriminant analysis
    Liu, Ran
    Qiao, Shuai
    Li, Chun-li
    Ma, Lian-hua
    Zhou, Wei
    Li, Qing
    COMPOSITES AND ADVANCED MATERIALS, 2024, 33