Speaker-Dependent Bottleneck Features for Egyptian Arabic Speech Recognition

被引：0

作者：

Romanenko, Aleksei ^{[1
,2
]}

Mendelev, Valentin ^{[1
,2
]}

机构：

[1] ITMO Univ, St Petersburg, Russia

[2] Speech Technol Ctr Ltd, St Petersburg, Russia

来源：

SPEECH AND COMPUTER | 2016年 / 9811卷

关键词：

Arabic language; Keyword search; Low resources;

D O I：

10.1007/978-3-319-43958-7_75

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, several ways to improve a speech recognition system for the Egyptian dialect of Arabic language are presented. The research is based on the CALLHOME Egyptian Arabic corpus. We demonstrate the contribution of speaker-dependent bottleneck features trained on other languages and verify the possibility of application of a small Modern Standard Arabic (MSA) corpus to derive phonetic transcriptions. The systems obtained demonstrate good results as compared to those published before.

引用

下载

页码：620 / 626

页数：7

共 50 条

[31] A Speaker-Dependent Deep Learning Approach to Joint Speech Separation and Acoustic Modeling for Multi-Talker Automatic Speech Recognition
Tu, Yan-Hui
Du, Jun
Dai, Li-Rung
Lee, Chin-Hui
2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
[32] CONNECTED-DIGIT SPEAKER-DEPENDENT SPEECH RECOGNITION USING A NEURAL NETWORK WITH TIME-DELAYED CONNECTIONS
UNNIKRISHNAN, KP
HOPFIELD, JJ
TANK, DW
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (03) : 698 - 713
[33] Syntactic Features for Arabic Speech Recognition
Kuo, Hong-Kwang Jeff
Mangu, Lidia
Emami, Ahmad
Zitouni, Imed
Lee, Young-Suk
2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 327 - 332
[34] Vowel Recognition from Continuous Articulatory Movements for Speaker-Dependent Applications
Wang, Jun
Green, Jordan R.
Samal, Ashok
Carrell, Tom D.
2010 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2010,
[35] SPEAKER-DEPENDENT LARGE VOCABULARY WORD RECOGNITION USING THE SPLIT METHOD
SUGAMURA, N
FURUI, S
REVIEW OF THE ELECTRICAL COMMUNICATIONS LABORATORIES, 1986, 34 (03): : 327 - 333
[36] Noise and Metadata Sensitive Bottleneck Features for Improving Speaker Recognition with Non-native Speech Input
Qian, Yao
Tao, Jidong
Sitendermann-Oeft, David
Evanini, Keelan
Ivanov, Alexei V.
Ramanarayanan, Vikram
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3648 - 3652
[37] EXPLORING THE ROLE OF PHONETIC BOTTLENECK FEATURES FOR SPEAKER AND LANGUAGE RECOGNITION
McLaren, Mitchell
Ferrer, Luciana
Lawson, Aaron
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5575 - 5579
[38] Low-SNR, Speaker-Dependent Speech Enhancement using GMMs and MFCCs
Boucheron, Laura E.
De Leon, Phillip L.
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 574 - 577
[39] Combining Missing-Feature Theory, Speech Enhancement and Speaker-Dependent/-Independent Modeling for Speech Separation
Ming, Ji
Hazen, Timothy J.
Glass, James R.
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 93 - +
[40] A Novel Weighted Dynamic Time Warping for Light Weight Speaker-Dependent Speech Recognition in Noisy and Bad Recording Conditions
Zhang, Xianglilan
Sun, Jiping
Huang, Xuhui
Luo, Zhigang
MECHANICAL DESIGN AND POWER ENGINEERING, PTS 1 AND 2, 2014, 490-491 : 1347 - +

← 1 2 3 4 5 →