Urdu Speech Corpus and Preliminary Results on Speech Recognition

被引:4
|
作者
Ali, Hazrat [1 ]
Ahmad, Nasir [2 ]
Hafeez, Abdul [2 ]
机构
[1] COMSATS Inst Informat Technol, Dept Elect Engn, Abbottabad, Pakistan
[2] Univ Engn & Technol, Dept Comp Syst Engn, Peshawar, Pakistan
关键词
D O I
10.1007/978-3-319-44188-7_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Language resources for Urdu language are not well developed. In this work, we summarize our work on the development of Urdu speech corpus for isolated words. The Corpus comprises of 250 isolated words of Urdu recorded by ten individuals. The speakers include both native and non-native, male and female individuals. The corpus can be used for both speech and speaker recognition tasks. We also report our results on automatic speech recognition task for the said corpus. The framework extracts Mel Frequency Cepstral Coefficients along with the velocity and acceleration coefficients, which are then fed to different classifiers to perform recognition task. The classifiers used are Support Vector Machines, Random Forest and Linear Discriminant Analysis. Experimental results show that the best results are provided by the Support Vector Machines with a test set accuracy of 73 %. The results reported in this work may provide a useful baseline for future research on automatic speech recognition of Urdu.
引用
下载
收藏
页码:317 / 325
页数:9
相关论文
共 50 条
  • [1] An Urdu speech corpus for emotion recognition
    Asghar, Awais
    Sohaib, Sarmad
    Iftikhar, Saman
    Sha, Muhammad
    Fatima, Kiran
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [2] DESCU: Dyadic emotional speech corpus and recognition system for Urdu language
    Qasim, Muhammad
    Habib, Tania
    Urooj, Saba
    Mumtaz, Benazir
    SPEECH COMMUNICATION, 2023, 148 : 40 - 52
  • [3] Building a Speech and Text Corpus of Turkish: Large Corpus Collection with Initial Speech Recognition Results
    Polat, Huseyin
    Oyucu, Saadin
    SYMMETRY-BASEL, 2020, 12 (02):
  • [4] Speech emotion recognition for the Urdu language
    Zaheer, Nimra
    Ahmad, Obaid Ullah
    Shabbir, Mudassir
    Raza, Agha Ali
    LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (02) : 915 - 944
  • [5] A Speech Recognition System for Urdu Language
    Beg, Azam
    Hasnain, S. K.
    WIRELESS NETWORKS, INFORMATION PROCESSING AND SYSTEMS, 2008, 20 : 118 - +
  • [6] Design and development of phonetically rich Urdu speech corpus
    Raza, Agha Ali
    Hussain, Sarmad
    Sarfraz, Huda
    Ullah, Inam
    Sarfraz, Zahid
    ORIENTAL COCOSDA 2009 - INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS, 2009, : 38 - 43
  • [7] An open and free Speech Corpus for Speaker Recognition: The FSCSR Speech Corpus
    Bouziane, Ayoub
    Kadi, Houda
    Hourri, Soufiane
    Kharroubi, Jamal
    2016 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS: THEORIES AND APPLICATIONS (SITA), 2016,
  • [8] Speech emotion recognition for the Urdu languageDataset and evaluation
    Nimra Zaheer
    Obaid Ullah Ahmad
    Mudassir Shabbir
    Agha Ali Raza
    Language Resources and Evaluation, 2023, 57 : 915 - 944
  • [9] Corpus for automatic speech recognition
    Adda-Decker, Martine
    REVUE FRANCAISE DE LINGUISTIQUE APPLIQUEE, 2007, 12 (01): : 71 - 84
  • [10] A speech recognition and speech corpus system based on Matlab
    He, Q
    Zhang, YW
    PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 559 - 562