Urdu Speech Corpus and Preliminary Results on Speech Recognition

被引:4
|
作者
Ali, Hazrat [1 ]
Ahmad, Nasir [2 ]
Hafeez, Abdul [2 ]
机构
[1] COMSATS Inst Informat Technol, Dept Elect Engn, Abbottabad, Pakistan
[2] Univ Engn & Technol, Dept Comp Syst Engn, Peshawar, Pakistan
关键词
D O I
10.1007/978-3-319-44188-7_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Language resources for Urdu language are not well developed. In this work, we summarize our work on the development of Urdu speech corpus for isolated words. The Corpus comprises of 250 isolated words of Urdu recorded by ten individuals. The speakers include both native and non-native, male and female individuals. The corpus can be used for both speech and speaker recognition tasks. We also report our results on automatic speech recognition task for the said corpus. The framework extracts Mel Frequency Cepstral Coefficients along with the velocity and acceleration coefficients, which are then fed to different classifiers to perform recognition task. The classifiers used are Support Vector Machines, Random Forest and Linear Discriminant Analysis. Experimental results show that the best results are provided by the Support Vector Machines with a test set accuracy of 73 %. The results reported in this work may provide a useful baseline for future research on automatic speech recognition of Urdu.
引用
收藏
页码:317 / 325
页数:9
相关论文
共 50 条
  • [41] DWT features performance analysis for automatic speech recognition of Urdu
    Ali, Hazrat
    Ahmad, Nasir
    Zhou, Xianwei
    Iqbal, Khalid
    Ali, Sahibzada Muhammad
    SPRINGERPLUS, 2014, 3 : 1 - 10
  • [42] Sentiment analysis with word-based Urdu speech recognition
    Riyaz Shaik
    S. Venkatramaphanikumar
    Journal of Ambient Intelligence and Humanized Computing, 2022, 13 : 2511 - 2531
  • [43] A MULTI-GENRE URDU BROADCAST SPEECH RECOGNITION SYSTEM
    Khan, Erbaz
    Rauf, Sahar
    Adeeba, Farah
    Hussain, Sarmad
    2021 24TH CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), 2021, : 25 - 30
  • [44] BembaSpeech: A Speech Recognition Corpus for the Bemba Language
    Sikasote, Claytone
    Anastasopoulos, Antonios
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 7277 - 7283
  • [45] Multilingual Speech Recognition with Corpus Relatedness Sampling
    Li, Xinjian
    Dalmia, Siddharth
    Black, Alan W.
    Metze, Florian
    INTERSPEECH 2019, 2019, : 2120 - 2124
  • [46] Arabic corpus Implementation: Application to Speech Recognition
    Helali, Wafa
    Hajaiej, Zied
    Cherif, Adnane
    2018 INTERNATIONAL CONFERENCE ON ADVANCED SYSTEMS AND ELECTRICAL TECHNOLOGIES (IC_ASET), 2017, : 50 - 53
  • [47] A Cross-Corpus Recognition of Emotional Speech
    Xiao, Zhongzhe
    Wu, Di
    Zhang, Xiaojun
    Tao, Zhi
    PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2016, : 42 - 46
  • [48] Construction of a Corpus for Elderly Japanese Speech Recognition
    Fukuda, Meiko
    Nishimura, Ryota
    Kitaoka, Norihide
    Nishizaki, Hiromitsu
    Iribe, Yurie
    2018 IEEE 7TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE 2018), 2018, : 687 - 688
  • [49] Challenging the Boundaries of Speech Recognition: The MALACH Corpus
    Picheny, Michael
    Tuske, Zoltan
    Kingsbury, Brian
    Audhkhasi, Kartik
    Cui, Xiaodong
    Saon, George
    INTERSPEECH 2019, 2019, : 326 - 330
  • [50] Multimodal English corpus for automatic speech recognition
    Kunka, Bartosz
    Kupryjanow, Adam
    Dalka, Piotr
    Bratoszewski, Piotr
    Szczodrak, Maciej
    Spaleniak, Pawel
    Szykulski, Marcin
    Czyzewski, Andrzej
    2013 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2013, : 106 - 111