Linear Discriminant Analysis Based Approach for Automatic Speech Recognition of Urdu Isolated Words

被引:3
|
作者
Ali, Hazrat [1 ,5 ]
Ahmad, Nasir [2 ]
Zhou, Xianwei [1 ]
Ali, Muhammad [3 ]
Manjotho, Ali Asghar [4 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Dept Commun Engn, Beijing 10083, Peoples R China
[2] Univ Engn & Technol Peshawar, Dept Comp Syst Engn, Peshawar 25120, Pakistan
[3] N Dakota State Univ, Dept Elect & Comp Engn, Fargo, ND 58108 USA
[4] Mehran Univ Engn & Technol, Dept Comp Syst Engn, Jamshoro, Pakistan
[5] City Univ London, Sch Informat, Machine Learning Grp, London EC1V 0HB, England
关键词
Urdu automatic speech recognition; Mel frequency cepstral coefficients; Linear Discriminant Analysis; Isolated words recognition;
D O I
10.1007/978-3-319-10987-9_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Urdu is amongst the five largest languages of the world and enjoys extreme importance by sharing its vocabulary with several other languages of the Indo-Pak. However, there has not been any significant research in the area of Automatic Speech Recognition of Urdu. This paper presents the statistical based classification technique to achieve the task of Automatic Speech Recognition of isolated words in Urdu. For each isolated word, 52 Mel Frequency Cepstral Coefficients have been extracted and based upon these coefficients; the classification has been achieved using Linear Discriminant Analysis. As a prototype, the system has been trained with audio samples of seven speakers including male/female, native/non-native and speakers with different ages while the testing has been done using audio samples of three speakers. It was determined that majority of words exhibit a percentage error of less than 33 %. Words with 100 % error were declared to be bad words. The work reported in this paper may serve as a strong baseline for future research work on Urdu ASR, especially for continuous speech recognition of Urdu.
引用
下载
收藏
页码:24 / 34
页数:11
相关论文
共 50 条
  • [31] A kernel approach to implementation of local linear discriminant analysis for face recognition
    Shi, Zhan
    Hu, Jinglu
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2017, 12 (01) : 62 - 70
  • [32] A novel face recognition method based on linear discriminant analysis
    Zhang, YK
    Liu, CQ
    JOURNAL OF INFRARED AND MILLIMETER WAVES, 2003, 22 (05) : 327 - 330
  • [33] Rejection measurement based on linear discriminant analysis for document recognition
    He, Chun Lei
    Lam, Louisa
    Suen, Ching Y.
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2011, 14 (03) : 263 - 272
  • [34] Realization of Isolated-words Speech Recognition System
    Ren Wenxia
    Zhang Huili
    Lv Wenzhe
    PROCEEDINGS OF THE 2009 PACIFIC-ASIA CONFERENCE ON CIRCUITS, COMMUNICATIONS AND SYSTEM, 2009, : 353 - 355
  • [35] Rejection measurement based on linear discriminant analysis for document recognition
    Chun Lei He
    Louisa Lam
    Ching Y. Suen
    International Journal on Document Analysis and Recognition (IJDAR), 2011, 14 : 263 - 272
  • [36] DURIAN RECOGNITION BASED ON MULTIPLE FEATURES AND LINEAR DISCRIMINANT ANALYSIS
    Mustaffa, Mas Rina
    Yi, Nyon Xin
    Abdullah, Lili Nurliyana
    Nasharuddin, Nurul Amelina
    MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2018, 31 (05) : 57 - 72
  • [37] Linear Discriminant Analysis for Face Recognition
    Cheflali, Fatma Zohra
    Djeradi, A.
    Djeradi, R.
    2009 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS 2009), 2009, : 1 - +
  • [38] Automatic Speech Recognition System Based on Wavelet Analysis
    Ziolko, Mariusz
    Galka, Jakub
    Ziolko, Bartosz
    Jadczyk, Tomasz
    Skurzok, Dawid
    Wicijowski, Jan
    2010 IEEE FOURTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2010), 2010, : 450 - 451
  • [39] ROBUST AUDIOVISUAL SPEECH RECOGNITION USING NOISE-ADAPTIVE LINEAR DISCRIMINANT ANALYSIS
    Zeiler, Steffen
    Nickel, Robert
    Ma, Ning
    Brown, Guy J.
    Kolossa, Dorothea
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2797 - 2801
  • [40] A STATISTICAL APPROACH TO THE AUTOMATIC RECOGNITION OF SPEECH
    SMITH, JEK
    KLEM, L
    AMERICAN PSYCHOLOGIST, 1961, 16 (07) : 445 - 445