Urdu Speech Corpus and Preliminary Results on Speech Recognition

被引：4

作者：

Ali, Hazrat ^{[1
]}

Ahmad, Nasir ^{[2
]}

Hafeez, Abdul ^{[2
]}

机构：

[1] COMSATS Inst Informat Technol, Dept Elect Engn, Abbottabad, Pakistan

[2] Univ Engn & Technol, Dept Comp Syst Engn, Peshawar, Pakistan

来源：

ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2016 | 2016年 / 629卷

关键词：

D O I：

10.1007/978-3-319-44188-7_24

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Language resources for Urdu language are not well developed. In this work, we summarize our work on the development of Urdu speech corpus for isolated words. The Corpus comprises of 250 isolated words of Urdu recorded by ten individuals. The speakers include both native and non-native, male and female individuals. The corpus can be used for both speech and speaker recognition tasks. We also report our results on automatic speech recognition task for the said corpus. The framework extracts Mel Frequency Cepstral Coefficients along with the velocity and acceleration coefficients, which are then fed to different classifiers to perform recognition task. The classifiers used are Support Vector Machines, Random Forest and Linear Discriminant Analysis. Experimental results show that the best results are provided by the Support Vector Machines with a test set accuracy of 73 %. The results reported in this work may provide a useful baseline for future research on automatic speech recognition of Urdu.

引用

页码：317 / 325

页数：9

共 50 条

[31] STRESS ANNOTATED URDU SPEECH CORPUS TO BUILD FEMALE VOICE FOR TTS
Mumtaz, Benazir
Urooj, Saba
Hussain, Sarmad
Habib, Wajiha
2015 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2015 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2015, : 13 - 20
[32] Improving Speech Recognition for the Elderly: A New Corpus of Elderly Japanese Speech and Investigation of Acoustic Modeling for Speech Recognition
Fukuda, Meiko
Nishizaki, Hiromitsu
Iribe, Yurie
Nishimura, Ryota
Kitaoka, Norihide
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6578 - 6585
[33] Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights
Adiga, Devaraja
Kumar, Rishabh
Krishna, Amrith
Jyothi, Preethi
Ramakrishnan, Ganesh
Goyal, Pawan
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, 2021, : 5039 - 5050
[34] Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech Recognition
Zevallos, Rodolfo
Camacho, Luis
Melgarejo, Nelsi
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 5029 - 5034
[35] MNASR: A FREE SPEECH CORPUS FOR MONGOLIAN SPEECH RECOGNITION AND ACCOMPANIED BASELINES
Wu, Yihao
Wang, Yonghe
Zhang, Hui
Bao, Feilong
Gao, Guanglai
2022 25TH CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA 2022), 2022,
[36] Towards a Continuous Speech Corpus for Banking Domain Automatic Speech Recognition
Suciu, George
Toma, Stefan-Adrian
Cheyeresan, Romulus
2017 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2017,
[37] Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights
Adiga, Devaraja
Kumar, Rishabh
Krishna, Amrith
Jyothi, Preethi
Ramakrishnan, Ganesh
Goyal, Pawan
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 5039 - 5050
[38] Speech Command Recognition: Text-to-Speech and Speech Corpus Scraping Are All You Need
Kuzdeuov, Askat
Nurgaliyev, Shakhizat
Turmakhan, Diana
Laiyk, Nurkhan
Varol, Huseyin Atakan
2023 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, AUTOMATION AND ARTIFICIAL INTELLIGENCE, RAAI 2023, 2023, : 286 - 291
[39] Sentiment analysis with word-based Urdu speech recognition
Shaik, Riyaz
Venkatramaphanikumar, S.
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 13 (5) : 2511 - 2531
[40] Automatic Urdu Speech Recognition Using Hidden Markov Model
Asadullah
Shaukat, Arslan
Ali, Hazrat
Akram, Usman
2016 INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2016), 2016, : 135 - 139

← 1 2 3 4 5 →