Urdu Speech Corpus and Preliminary Results on Speech Recognition

被引：4

作者：

Ali, Hazrat ^{[1
]}

Ahmad, Nasir ^{[2
]}

Hafeez, Abdul ^{[2
]}

机构：

[1] COMSATS Inst Informat Technol, Dept Elect Engn, Abbottabad, Pakistan

[2] Univ Engn & Technol, Dept Comp Syst Engn, Peshawar, Pakistan

来源：

ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2016 | 2016年 / 629卷

关键词：

D O I：

10.1007/978-3-319-44188-7_24

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Language resources for Urdu language are not well developed. In this work, we summarize our work on the development of Urdu speech corpus for isolated words. The Corpus comprises of 250 isolated words of Urdu recorded by ten individuals. The speakers include both native and non-native, male and female individuals. The corpus can be used for both speech and speaker recognition tasks. We also report our results on automatic speech recognition task for the said corpus. The framework extracts Mel Frequency Cepstral Coefficients along with the velocity and acceleration coefficients, which are then fed to different classifiers to perform recognition task. The classifiers used are Support Vector Machines, Random Forest and Linear Discriminant Analysis. Experimental results show that the best results are provided by the Support Vector Machines with a test set accuracy of 73 %. The results reported in this work may provide a useful baseline for future research on automatic speech recognition of Urdu.

引用

下载

页码：317 / 325

页数：9

共 50 条

[21] KsponSpeech: Korean Spontaneous Speech Corpus for Automatic Speech Recognition
Bang, Jeong-Uk
Yun, Seung
Kim, Seung-Hi
Choi, Mu-Yeol
Lee, Min-Kyu
Kim, Yeo-Jeong
Kim, Dong-Hyun
Park, Jun
Lee, Young-Jik
Kim, Sang-Hun
APPLIED SCIENCES-BASEL, 2020, 10 (19): : 1 - 17
[22] Urdu Speech Emotion Recognition: A Systematic Literature Review
Taj, Soonh
Mujtaba, Ghulam
Daudpota, Sher Muhammad
Mughal, Muhammad Hussain
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (07)
[23] Speaker Independent Urdu Speech Recognition Using HMM
Ashraf, Javed
Iqbal, Naveed
Khattak, Naveed Sarfraz
Zaidi, Ather Mohsin
NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2010, 6177 : 140 - 148
[24] Urdu Speech Emotion Recognition using Speech Spectral Features and Deep Learning Techniques
Taj, Soonh
Shaikh, Ghulam Mujtaba
Hassan, Saif
Nimra
2023 4th International Conference on Computing, Mathematics and Engineering Technologies: Sustainable Technologies for Socio-Economic Development, iCoMET 2023, 2023,
[25] MINIMALLY BALANCED CORPUS FOR SPEECH RECOGNITION
Irtza, Saad
Hussain, Sarmad
2013 FIRST INTERNATIONAL CONFERENCE ON COMMUNICATIONS SIGNAL PROCESSING, AND THEIR APPLICATIONS (ICCSPA'13), 2013,
[26] Introducing the Urdu-Sindhi Speech Emotion Corpus: A Novel Dataset of Speech Recordings for Emotion Recognition for Two Low-Resource Languages
Syed, Zafi Sherhan
Memon, Sajjad Ali
Shah, Muhammad Shehram
Syed, Abbas Shah
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (04) : 805 - 810
[27] Indian Languages Corpus for Speech Recognition
Basu, Joyanta
Khan, Soma
Roy, Rajib
Saxena, Babita
Ganguly, Dipankar
Arora, Sunita
Arora, Karunesh Kumar
Bansal, Shweta
Agrawal, Shyam Sunder
2019 22ND CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), 2019, : 13 - 18
[28] A New Corpus of Elderly Japanese Speech for Acoustic Modeling, and a Preliminary Investigation of Dialect-Dependent Speech Recognition
Fukuda, Meiko
Nishimura, Ryota
Nishizaki, Hiromitsu
Iribe, Yurie
Kitaoka, Norihide
2019 22ND CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), 2019, : 78 - 83
[29] Designing the Latvian Speech Recognition Corpus
Pinnis, Marcis
Auzina, Ilze
Goba, Karlis
LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1547 - 1553
[30] Corpus Construction for Aviation Speech Recognition
Cui, Yiyi
Wang, Zhen
Lu, Yanyu
Fu, Shan
HUMAN-COMPUTER INTERACTION: TECHNOLOGICAL INNOVATION, PT II, 2022, 13303 : 238 - 250

← 1 2 3 4 5 →