Training a language model using webdata for large vocabulary Japanese spontaneous speech recognition

被引：0

作者：

Masumura, Ryo ^{[1
]}

Hahm, Seongjun ^{[1
]}

Ito, Akinori ^{[1
]}

机构：

[1] Tohoku Univ, Grad Sch Engn, Sendai, Miyagi 980, Japan

来源：

12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年

关键词：

Spontaneous speech recognition; language model; World Wide Web; large vocabulary continuous speech recognition; Corpus of Spontaneous Japanese;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes a language modeling method using large-scale spoken language data retrieved from the Web for spontaneous speech recognition. We downloaded 15 million Web pages on a comprehensive range topics. Next, spoken language-like texts were selected from the downloaded Web data using the naive Bayes classifier, and typical linguistic phenomena such as fillers and pauses were added using simulation models. A language model trained by the generated data gave as high performance as the large-scale spontaneous speech corpus (Corpus of Spontaneous Japanese, CSJ). By combining the generated data and CSJ, we improved word accuracy.

引用

页码：1476 / 1479

页数：4

共 50 条

[21] A large vocabulary continuous speech recognition system for Persian language
Hossein Sameti
Hadi Veisi
Mohammad Bahrani
Bagher Babaali
Khosro Hosseinzadeh
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2011
[22] A multispan language modeling framework for large vocabulary speech recognition
Bellegarda, JR
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (05): : 456 - 467
[23] LARGE-VOCABULARY SPEECH RECOGNITION - A SYSTEM FOR THE ITALIAN LANGUAGE
DORTA, P
FERRETTI, M
MARTELLI, A
MELECRINIS, S
SCARCI, S
VOLPI, G
[J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1988, 32 (02) : 217 - 226
[24] Connectionist language modeling for large vocabulary continuous speech recognition
Schwenk, H
Gauvain, JL
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 765 - 768
[25] Frame discrimination training of HMMs for large vocabulary speech recognition
Povey, D
Woodland, PC
[J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 333 - 336
[26] Probabilistic Latent Speaker Training for Large Vocabulary Speech Recognition
Su, Dan
Wu, Xihong
Chi, Huisheng
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1225 - 1228
[27] Speaker selection training for large vocabulary continuous speech recognition
Huang, C
Chen, T
Chang, E
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 609 - 612
[28] Large vocabulary Russian speech recognition using syntactico-statistical language modeling
Karpov, Alexey
Markov, Konstantin
Kipyatkova, Irina
Vazhenina, Dania
Ronzhin, Andrey
[J]. SPEECH COMMUNICATION, 2014, 56 : 213 - 228
[29] Discriminative training for large-vocabulary speech recognition using minimum classification error
McDermott, Erik
Hazen, Timothy J.
Le Roux, Jonathan
Nakamura, Atsushi
Katagiri, Shigeru
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 203 - 223
[30] Analysis and recognition of spontaneous speech using Corpus of Spontaneous Japanese
Furui, S
Nakamura, M
Ichiba, T
Iwano, K
[J]. SPEECH COMMUNICATION, 2005, 47 (1-2) : 208 - 219

← 1 2 3 4 5 →