Linear Discriminant Analysis Based Approach for Automatic Speech Recognition of Urdu Isolated Words

被引：3

作者：

Ali, Hazrat ^{[1
,5
]}

Ahmad, Nasir ^{[2
]}

Zhou, Xianwei ^{[1
]}

Ali, Muhammad ^{[3
]}

Manjotho, Ali Asghar ^{[4
]}

机构：

[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Dept Commun Engn, Beijing 10083, Peoples R China

[2] Univ Engn & Technol Peshawar, Dept Comp Syst Engn, Peshawar 25120, Pakistan

[3] N Dakota State Univ, Dept Elect & Comp Engn, Fargo, ND 58108 USA

[4] Mehran Univ Engn & Technol, Dept Comp Syst Engn, Jamshoro, Pakistan

[5] City Univ London, Sch Informat, Machine Learning Grp, London EC1V 0HB, England

来源：

COMMUNICATION TECHNOLOGIES, INFORMATION SECURITY AND SUSTAINABLE DEVELOPMENT | 2014年 / 414卷

关键词：

Urdu automatic speech recognition; Mel frequency cepstral coefficients; Linear Discriminant Analysis; Isolated words recognition;

D O I：

10.1007/978-3-319-10987-9_3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Urdu is amongst the five largest languages of the world and enjoys extreme importance by sharing its vocabulary with several other languages of the Indo-Pak. However, there has not been any significant research in the area of Automatic Speech Recognition of Urdu. This paper presents the statistical based classification technique to achieve the task of Automatic Speech Recognition of isolated words in Urdu. For each isolated word, 52 Mel Frequency Cepstral Coefficients have been extracted and based upon these coefficients; the classification has been achieved using Linear Discriminant Analysis. As a prototype, the system has been trained with audio samples of seven speakers including male/female, native/non-native and speakers with different ages while the testing has been done using audio samples of three speakers. It was determined that majority of words exhibit a percentage error of less than 33 %. Words with 100 % error were declared to be bad words. The work reported in this paper may serve as a strong baseline for future research work on Urdu ASR, especially for continuous speech recognition of Urdu.

引用

页码：24 / 34

页数：11

共 50 条

[1] Automatic speech recognition of Urdu words using linear discriminant analysis
Ali, Hazrat
Ahmad, Nasir
Zhou, Xianwei
[J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2015, 28 (05) : 2369 - 2375
[2] Robustness of linear discriminant analysis in automatic speech recognition
Katz, M
Meier, HG
Dolfing, H
Klakow, D
[J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 371 - 374
[3] Comparison of Linear Discriminant Analysis Approaches in Automatic Speech Recognition
Jakovljevic, N.
Miskovic, D.
Janev, M.
Secujski, M.
Delic, V.
[J]. ELEKTRONIKA IR ELEKTROTECHNIKA, 2013, 19 (07) : 76 - 79
[4] Automatic Speech Recognition of Isolated Words in Hindi Language
Wani, Priyanka
Bormane, D. S.
Patil, U. G.
Shirbahadurkar, S. D.
[J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2016,
[5] DWT features performance analysis for automatic speech recognition of Urdu
Ali, Hazrat
Ahmad, Nasir
Zhou, Xianwei
Iqbal, Khalid
Ali, Sahibzada Muhammad
[J]. SPRINGERPLUS, 2014, 3 : 1 - 10
[6] Modified Linear Discriminant Analysis for speech recognition
Li, Xiao-Bing
O'Shaughnessy, Douglas
[J]. 2007 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, 2007, : 1598 - 1601
[7] The Development of Isolated Words Pashto Automatic Speech Recognition System
Ahmed, Irfan
Ahmad, Nasir
Ali, Hazrat
Ahmad, Gulzar
[J]. PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC 12), 2012, : 333 - 336
[8] Automatic digital modulation recognition based on robust linear discriminant analysis
Shen, Wei-guo
Wang, Wei
[J]. WIRELESS COMMUNICATION AND SENSOR NETWORK, 2016, : 77 - 84
[9] On speech recognition of isolated words
Teh, CC
Jong, CC
Siek, L
[J]. ISIC-99: 8TH INTERNATIONAL SYMPOSIUM ON INTEGRATED CIRCUITS, DEVICES & SYSTEMS, PROCEEDINGS, 1999, : 431 - 434
[10] Minimum phoneme error based heteroscedastic linear discriminant analysis for speech recognition
Zhang, B
Matsoukas, S
[J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 925 - 928

← 1 2 3 4 5 →