Linear Discriminant Analysis Based Approach for Automatic Speech Recognition of Urdu Isolated Words

被引:3
|
作者
Ali, Hazrat [1 ,5 ]
Ahmad, Nasir [2 ]
Zhou, Xianwei [1 ]
Ali, Muhammad [3 ]
Manjotho, Ali Asghar [4 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Dept Commun Engn, Beijing 10083, Peoples R China
[2] Univ Engn & Technol Peshawar, Dept Comp Syst Engn, Peshawar 25120, Pakistan
[3] N Dakota State Univ, Dept Elect & Comp Engn, Fargo, ND 58108 USA
[4] Mehran Univ Engn & Technol, Dept Comp Syst Engn, Jamshoro, Pakistan
[5] City Univ London, Sch Informat, Machine Learning Grp, London EC1V 0HB, England
关键词
Urdu automatic speech recognition; Mel frequency cepstral coefficients; Linear Discriminant Analysis; Isolated words recognition;
D O I
10.1007/978-3-319-10987-9_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Urdu is amongst the five largest languages of the world and enjoys extreme importance by sharing its vocabulary with several other languages of the Indo-Pak. However, there has not been any significant research in the area of Automatic Speech Recognition of Urdu. This paper presents the statistical based classification technique to achieve the task of Automatic Speech Recognition of isolated words in Urdu. For each isolated word, 52 Mel Frequency Cepstral Coefficients have been extracted and based upon these coefficients; the classification has been achieved using Linear Discriminant Analysis. As a prototype, the system has been trained with audio samples of seven speakers including male/female, native/non-native and speakers with different ages while the testing has been done using audio samples of three speakers. It was determined that majority of words exhibit a percentage error of less than 33 %. Words with 100 % error were declared to be bad words. The work reported in this paper may serve as a strong baseline for future research work on Urdu ASR, especially for continuous speech recognition of Urdu.
引用
下载
收藏
页码:24 / 34
页数:11
相关论文
共 50 条
  • [41] Generalization of linear discriminant analysis used in segmental unit input HMM for speech recognition
    Sakai, Makoto
    Kitaoka, Norihide
    Nakagawa, Seiichi
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 333 - +
  • [42] Automatic Pattern Recognition of Epileptiform Discharges Using Morphological Descriptors and Linear Discriminant Analysis
    Boos, Christine Fredel
    de Azevedo, Fernando Mendes
    2013 IEEE XXXIII INTERNATIONAL SCIENTIFIC CONFERENCE ELECTRONICS AND NANOTECHNOLOGY (ELNANO), 2013, : 293 - 296
  • [43] Domain Adaptation Based on Mixture of Latent Words Language Models for Automatic Speech Recognition
    Masumura, Ryo
    Asami, Taichi
    Oba, Takanobu
    Masataki, Hirokazu
    Sakauchi, Sumitaka
    Ito, Akinori
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (06): : 1581 - 1590
  • [44] The Hardware Accelerator of The Automatic Speech Recognition for The Continuous Korean Words
    Kim, Juyeob
    Kim, Yunjoo
    Kim, Wonjong
    Lee, Joohyun
    2015 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2015, : 213 - 214
  • [45] Automatic detection of accent nuclei at the head of words for speech recognition
    Minematsu, N
    Nakagawa, S
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1620 - 1623
  • [46] Automatic Recognition of Target Words in Infant-Directed Speech
    van der Klis, Anika
    Adriaans, Frans
    Han, Mengru
    Kager, Rene
    COMPANION PUBLICATON OF THE 2020 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION (ICMI '20 COMPANION), 2020, : 522 - 522
  • [47] Discriminant-function-based minimum recognition error rate pattern-recognition approach to speech recognition
    Chou, W
    PROCEEDINGS OF THE IEEE, 2000, 88 (08) : 1201 - 1223
  • [48] Automatic Speech Recognition (ASR) based Approach for Speech Therapy of Aphasic Patients: A Review
    Jamal, Norezmi
    Shanta, Shahnoor
    Mahmud, Farhanahani
    Sha'abani, M. N. A. H.
    ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING: FROM THEORY TO APPLICATIONS, 2017, 1883
  • [49] Local improvement approach and linear discriminant analysis-based local binary pattern for face recognition
    Najafi Khanbebin, Saeed
    Mehrdad, Vahid
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (13): : 7691 - 7707
  • [50] Isolated Words Recognition System Based on Hybrid Approach DTW/GHMM
    Bourouba, E-Hocine
    Bedda, Mouldi
    Djemili, Rafik
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2006, 30 (03): : 373 - 382