Search and Classification Based Language Model Adaptation

被引:0
|
作者
Shi, Qin
Chu, Stephen M.
Liu, Wen
Kuo, Hong-Kwang
Liu, Yi
Qin, Yong
机构
关键词
LM Rescoring; Topic Adaptation; Broadcast Transcription;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Adaptation techniques in language modeling have shown growing potentials in improving speech recognition performance. For topic adaptation, a set of pre-defined topic-specific language models are typically used, and adaptation is achieved through adjusting the interpolation weights. However, mismatch between the test data and the pre-defined models inevitably exists and is left untreated in the static approach. Instead of tuning the parameters in the existing models, this paper describes a method that dynamically extracts relevant documents from training sources according to intermediate decoding hypotheses to build new targeted language models. Different from general search-based document collection, a new and effective ranking method is used here for candidate extraction. The targeted language models are interpolated with the static topic language models and a general language model, and used for lattice rescoring. The proposed adaptation technique is implemented in a state-of-the-art Mandarin broadcast transcription system, and evaluated on the GALE task. We show that static topic adaptation reduces the relative character error rate by 4.9%. It is further shown that the proposed dynamic adaptation technique attains an additional 10.3% reduction in error rate.
引用
收藏
页码:1578 / 1581
页数:4
相关论文
共 50 条
  • [1] Language model adaptation based on the classification of a trigram's language style feature
    Liang, Q
    Zheng, TF
    Xu, MX
    Wu, WH
    Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 91 - 96
  • [2] VOICE SEARCH LANGUAGE MODEL ADAPTATION USING CONTEXTUAL INFORMATION
    Scheiner, Justin
    Williams, Ian
    Aleksic, Petar
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 253 - 257
  • [3] Dynamic Language Model Adaptation Using Keyword Category Classification
    Yamamoto, Hitoshi
    Hanazawa, Ken
    Miki, Kiyokazu
    Shinoda, Koichi
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2426 - +
  • [4] Language model based query classification
    Merkel, Andreas
    Klakow, Dietrich
    ADVANCES IN INFORMATION RETRIEVAL, 2007, 4425 : 720 - +
  • [5] Paragraph Vector Based Topic Model for Language Model Adaptation
    Jin, Wengong
    He, Tianxing
    Qian, Yanmin
    Yu, Kai
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3516 - 3520
  • [6] Online LDA-Based Language Model Adaptation
    Lehecka, Jan
    Prazak, Ales
    TEXT, SPEECH, AND DIALOGUE (TSD 2018), 2018, 11107 : 334 - 341
  • [7] Phoneme based Domain Prediction for Language Model Adaptation
    Bhasin, Anmol
    Mathur, Gaurav
    Yenigalla, Promod
    Natarajan, Bharatram
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [8] Cross-Modal Knowledge Adaptation for Language-Based Person Search
    Chen, Yucheng
    Huang, Rui
    Chang, Hong
    Tan, Chuanqi
    Xue, Tao
    Ma, Bingpeng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4057 - 4069
  • [9] Improving Spoken Document Retrieval by. Unsupervised Language Model Adaptation Using Utterance-based Web Search
    Herms, Robert
    Ritter, Marc
    Wilhelm-Stein, Thomas
    Eibl, Maximilian
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1430 - 1433
  • [10] Language Model Adaptation for Tiny Adaptation Corpora
    Klakow, Dietrich
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2214 - 2217