Urdu Speech Corpus and Preliminary Results on Speech Recognition

被引:4
|
作者
Ali, Hazrat [1 ]
Ahmad, Nasir [2 ]
Hafeez, Abdul [2 ]
机构
[1] COMSATS Inst Informat Technol, Dept Elect Engn, Abbottabad, Pakistan
[2] Univ Engn & Technol, Dept Comp Syst Engn, Peshawar, Pakistan
关键词
D O I
10.1007/978-3-319-44188-7_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Language resources for Urdu language are not well developed. In this work, we summarize our work on the development of Urdu speech corpus for isolated words. The Corpus comprises of 250 isolated words of Urdu recorded by ten individuals. The speakers include both native and non-native, male and female individuals. The corpus can be used for both speech and speaker recognition tasks. We also report our results on automatic speech recognition task for the said corpus. The framework extracts Mel Frequency Cepstral Coefficients along with the velocity and acceleration coefficients, which are then fed to different classifiers to perform recognition task. The classifiers used are Support Vector Machines, Random Forest and Linear Discriminant Analysis. Experimental results show that the best results are provided by the Support Vector Machines with a test set accuracy of 73 %. The results reported in this work may provide a useful baseline for future research on automatic speech recognition of Urdu.
引用
收藏
页码:317 / 325
页数:9
相关论文
共 50 条
  • [31] STRESS ANNOTATED URDU SPEECH CORPUS TO BUILD FEMALE VOICE FOR TTS
    Mumtaz, Benazir
    Urooj, Saba
    Hussain, Sarmad
    Habib, Wajiha
    2015 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2015 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2015, : 13 - 20
  • [32] Improving Speech Recognition for the Elderly: A New Corpus of Elderly Japanese Speech and Investigation of Acoustic Modeling for Speech Recognition
    Fukuda, Meiko
    Nishizaki, Hiromitsu
    Iribe, Yurie
    Nishimura, Ryota
    Kitaoka, Norihide
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6578 - 6585
  • [33] Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights
    Adiga, Devaraja
    Kumar, Rishabh
    Krishna, Amrith
    Jyothi, Preethi
    Ramakrishnan, Ganesh
    Goyal, Pawan
    Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, 2021, : 5039 - 5050
  • [34] Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech Recognition
    Zevallos, Rodolfo
    Camacho, Luis
    Melgarejo, Nelsi
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 5029 - 5034
  • [35] MNASR: A FREE SPEECH CORPUS FOR MONGOLIAN SPEECH RECOGNITION AND ACCOMPANIED BASELINES
    Wu, Yihao
    Wang, Yonghe
    Zhang, Hui
    Bao, Feilong
    Gao, Guanglai
    2022 25TH CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA 2022), 2022,
  • [36] Towards a Continuous Speech Corpus for Banking Domain Automatic Speech Recognition
    Suciu, George
    Toma, Stefan-Adrian
    Cheyeresan, Romulus
    2017 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2017,
  • [37] Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights
    Adiga, Devaraja
    Kumar, Rishabh
    Krishna, Amrith
    Jyothi, Preethi
    Ramakrishnan, Ganesh
    Goyal, Pawan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 5039 - 5050
  • [38] Speech Command Recognition: Text-to-Speech and Speech Corpus Scraping Are All You Need
    Kuzdeuov, Askat
    Nurgaliyev, Shakhizat
    Turmakhan, Diana
    Laiyk, Nurkhan
    Varol, Huseyin Atakan
    2023 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, AUTOMATION AND ARTIFICIAL INTELLIGENCE, RAAI 2023, 2023, : 286 - 291
  • [39] Sentiment analysis with word-based Urdu speech recognition
    Shaik, Riyaz
    Venkatramaphanikumar, S.
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 13 (5) : 2511 - 2531
  • [40] Automatic Urdu Speech Recognition Using Hidden Markov Model
    Asadullah
    Shaukat, Arslan
    Ali, Hazrat
    Akram, Usman
    2016 INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2016), 2016, : 135 - 139