Training a language model using webdata for large vocabulary Japanese spontaneous speech recognition

被引:0
|
作者
Masumura, Ryo [1 ]
Hahm, Seongjun [1 ]
Ito, Akinori [1 ]
机构
[1] Tohoku Univ, Grad Sch Engn, Sendai, Miyagi 980, Japan
关键词
Spontaneous speech recognition; language model; World Wide Web; large vocabulary continuous speech recognition; Corpus of Spontaneous Japanese;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a language modeling method using large-scale spoken language data retrieved from the Web for spontaneous speech recognition. We downloaded 15 million Web pages on a comprehensive range topics. Next, spoken language-like texts were selected from the downloaded Web data using the naive Bayes classifier, and typical linguistic phenomena such as fillers and pauses were added using simulation models. A language model trained by the generated data gave as high performance as the large-scale spontaneous speech corpus (Corpus of Spontaneous Japanese, CSJ). By combining the generated data and CSJ, we improved word accuracy.
引用
收藏
页码:1476 / 1479
页数:4
相关论文
共 50 条
  • [21] A large vocabulary continuous speech recognition system for Persian language
    Hossein Sameti
    Hadi Veisi
    Mohammad Bahrani
    Bagher Babaali
    Khosro Hosseinzadeh
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2011
  • [22] A multispan language modeling framework for large vocabulary speech recognition
    Bellegarda, JR
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (05): : 456 - 467
  • [23] LARGE-VOCABULARY SPEECH RECOGNITION - A SYSTEM FOR THE ITALIAN LANGUAGE
    DORTA, P
    FERRETTI, M
    MARTELLI, A
    MELECRINIS, S
    SCARCI, S
    VOLPI, G
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1988, 32 (02) : 217 - 226
  • [24] Connectionist language modeling for large vocabulary continuous speech recognition
    Schwenk, H
    Gauvain, JL
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 765 - 768
  • [25] Frame discrimination training of HMMs for large vocabulary speech recognition
    Povey, D
    Woodland, PC
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 333 - 336
  • [26] Probabilistic Latent Speaker Training for Large Vocabulary Speech Recognition
    Su, Dan
    Wu, Xihong
    Chi, Huisheng
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1225 - 1228
  • [27] Speaker selection training for large vocabulary continuous speech recognition
    Huang, C
    Chen, T
    Chang, E
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 609 - 612
  • [28] Large vocabulary Russian speech recognition using syntactico-statistical language modeling
    Karpov, Alexey
    Markov, Konstantin
    Kipyatkova, Irina
    Vazhenina, Dania
    Ronzhin, Andrey
    [J]. SPEECH COMMUNICATION, 2014, 56 : 213 - 228
  • [29] Discriminative training for large-vocabulary speech recognition using minimum classification error
    McDermott, Erik
    Hazen, Timothy J.
    Le Roux, Jonathan
    Nakamura, Atsushi
    Katagiri, Shigeru
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 203 - 223
  • [30] Analysis and recognition of spontaneous speech using Corpus of Spontaneous Japanese
    Furui, S
    Nakamura, M
    Ichiba, T
    Iwano, K
    [J]. SPEECH COMMUNICATION, 2005, 47 (1-2) : 208 - 219