Search and Classification Based Language Model Adaptation

被引:0
|
作者
Shi, Qin
Chu, Stephen M.
Liu, Wen
Kuo, Hong-Kwang
Liu, Yi
Qin, Yong
机构
关键词
LM Rescoring; Topic Adaptation; Broadcast Transcription;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Adaptation techniques in language modeling have shown growing potentials in improving speech recognition performance. For topic adaptation, a set of pre-defined topic-specific language models are typically used, and adaptation is achieved through adjusting the interpolation weights. However, mismatch between the test data and the pre-defined models inevitably exists and is left untreated in the static approach. Instead of tuning the parameters in the existing models, this paper describes a method that dynamically extracts relevant documents from training sources according to intermediate decoding hypotheses to build new targeted language models. Different from general search-based document collection, a new and effective ranking method is used here for candidate extraction. The targeted language models are interpolated with the static topic language models and a general language model, and used for lattice rescoring. The proposed adaptation technique is implemented in a state-of-the-art Mandarin broadcast transcription system, and evaluated on the GALE task. We show that static topic adaptation reduces the relative character error rate by 4.9%. It is further shown that the proposed dynamic adaptation technique attains an additional 10.3% reduction in error rate.
引用
收藏
页码:1578 / 1581
页数:4
相关论文
共 50 条
  • [41] 3D Model Classification Based on Neural Architecture Search
    Zhou, Peng
    Yang, Jun
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (05): : 722 - 733
  • [42] In search of a unified model of language contact
    Winford, Donald
    BILINGUALISM-LANGUAGE AND COGNITION, 2013, 16 (04) : 734 - 736
  • [43] An article language model for BBS search
    Xu, JF
    Zhu, YB
    Li, X
    WEB ENGINEERING, PROCEEDINGS, 2005, 3579 : 152 - 160
  • [44] Language Model Supervision for Handwriting Recognition Model Adaptation
    Tensmeyer, Chris
    Wigington, Curtis
    Davis, Brian
    Stewart, Seth
    Martinez, Tony
    Barrett, William
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 133 - 138
  • [45] Language model adaptation and confidence measure for robust language identification
    Chen, YN
    Liu, J
    INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2005, VOLS 1 AND 2, PROCEEDINGS, 2005, : 270 - 273
  • [46] Lattice-based risk minimization training for unsupervised language model adaptation
    Kobayashi, Akio
    Oku, Takahiro
    Homma, Shinichi
    Imai, Toru
    Nakagawa, Seiichi
    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2011, : 1453 - 1456
  • [47] Unsupervised language model adaptation based on automatic text collection from WWW
    Suzuki, Motoyuki
    Kajiura, Yasutomo
    Ito, Akinori
    Makino, Shozo
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2202 - 2205
  • [48] Unsupervised class-based language model adaptation for spontaneous speech recognition
    Yokoyama, T
    Shinozaki, T
    Iwano, K
    Furui, S
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 236 - 239
  • [49] PLSA-based Topic Detection in Meetings for Adaptation of Lexicon and Language Model
    Akita, Yuya
    Nemoto, Yusuke
    Kawahara, Tatsuya
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1321 - 1324
  • [50] Personalized Ranking Model Adaptation for Web Search
    Wang, Hongning
    He, Xiaodong
    Chang, Ming-Wei
    Song, Yang
    White, Ryen W.
    Chu, Wei
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 323 - 332