Language Modeling for Mixed Language Speech Recognition using Weighted Phrase Extraction

被引：0

作者：

Li, Ying ^{[1
]}

Fung, Pascale ^{[1
]}

机构：

[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Hong Kong, Peoples R China

来源：

14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年

关键词：

mixed language; language model; code switching;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To train a code switching language model for mixed language speech recognition, we propose to assign weights to the sentence pairs in the parallel text data. The code switching language model which is composed of the code switching boundary prediction model, code switching translation model and reconstruction model is incorporated with a language for mixed language speech recognition. The code switching translation model which is trained using selected subsets of the sentence pairs in the parallel text data allows the decoder to make the decision whether a phrase is in the matrix language or in the embedded language. Moreover, we propose a weighting procedure while training the code switching translation model. We evaluate our methods on Mandarin-English code switching lecture speech and lunch conversations. Our proposed method reduces word error rate by a statistically significant 1.74% on the lecture speech, and by 1.29% on the lunch conversation over the conventional interpolated language model.

引用

页码：2598 / 2602

页数：5

共 50 条

[1] RELEVANCE LANGUAGE MODELING FOR SPEECH RECOGNITION
Chen, Kuan-Yu
Chen, Berlin
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5568 - 5571
[2] An evaluation of statistical language modeling for speech recognition using a mixed category of both words and parts-of-speech
Wakita, Y
Kawai, J
Iida, H
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 530 - 533
[3] Continuous Speech Recognition of Kannada Language using Triphone Modeling
Sajjan, Sharada C.
Vijaya, C.
[J]. PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 451 - 455
[4] Using morphemes in language modeling and automatic speech recognition of Amharic
Tachbelie, Martha Yifiru
Abate, Solomon Teferra
Menzel, Wolfgang
[J]. NATURAL LANGUAGE ENGINEERING, 2014, 20 (02) : 235 - 259
[5] Towards mixed language speech recognition systems
Imseng, David
Bourlard, Herve
Magimai-Doss, Mathew
[J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 278 - 281
[6] Features extraction, modeling and training strategies in continuous speech recognition for Romanian language
Dumitru, CO
Gavat, I
[J]. Eurocon 2005: The International Conference on Computer as a Tool, Vol 1 and 2 , Proceedings, 2005, : 1425 - 1428
[7] Joint acoustic and language modeling for speech recognition
Chien, Jen-Tzung
Chueh, Chuang-Hua
[J]. SPEECH COMMUNICATION, 2010, 52 (03) : 223 - 235
[8] CONTINUOUS TOPIC LANGUAGE MODELING FOR SPEECH RECOGNITION
Chueh, Chuang-Hua
Chien, Jen-Tzung
[J]. 2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 193 - 196
[9] Language Modeling for Speech Recognition of Spoken Cantonese
Yeung, Yu Ting
Cao, Houwei
Zheng, N. H.
Lee, Tan
Ching, P. C.
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1570 - 1573
[10] Latent semantic language modeling for speech recognition
Bellegarda, JR
[J]. MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING, 2004, 138 : 73 - 103

← 1 2 3 4 5 →