Search and Classification Based Language Model Adaptation

被引:0
|
作者
Shi, Qin
Chu, Stephen M.
Liu, Wen
Kuo, Hong-Kwang
Liu, Yi
Qin, Yong
机构
关键词
LM Rescoring; Topic Adaptation; Broadcast Transcription;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Adaptation techniques in language modeling have shown growing potentials in improving speech recognition performance. For topic adaptation, a set of pre-defined topic-specific language models are typically used, and adaptation is achieved through adjusting the interpolation weights. However, mismatch between the test data and the pre-defined models inevitably exists and is left untreated in the static approach. Instead of tuning the parameters in the existing models, this paper describes a method that dynamically extracts relevant documents from training sources according to intermediate decoding hypotheses to build new targeted language models. Different from general search-based document collection, a new and effective ranking method is used here for candidate extraction. The targeted language models are interpolated with the static topic language models and a general language model, and used for lattice rescoring. The proposed adaptation technique is implemented in a state-of-the-art Mandarin broadcast transcription system, and evaluated on the GALE task. We show that static topic adaptation reduces the relative character error rate by 4.9%. It is further shown that the proposed dynamic adaptation technique attains an additional 10.3% reduction in error rate.
引用
收藏
页码:1578 / 1581
页数:4
相关论文
共 50 条
  • [31] Unsupervised Domain Adaptation Classification Model Based on Generative Adversarial Network
    Wang G.-G.
    Guo T.
    Yu Y.
    Su H.
    Guo, Tao (tguo@sicnu.edu.cn), 1600, Chinese Institute of Electronics (48): : 1190 - 1197
  • [32] Adaptation of motor imagery EEG classification model based on tensor decomposition
    Li, Xinyang
    Guan, Cuntai
    Zhang, Haihong
    Ang, Kai Keng
    Ong, Sim Heng
    JOURNAL OF NEURAL ENGINEERING, 2014, 11 (05)
  • [33] Visual Comparison of Language Model Adaptation
    Sevastjanova R.
    Cakmak E.
    Ravfogel S.
    Cotterell R.
    El-Assady M.
    IEEE Transactions on Visualization and Computer Graphics, 2023, 29 (01) : 1178 - 1188
  • [34] Data augmentation and language model adaptation
    Janiszek, D
    De Mori, R
    Bechet, E
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 549 - 552
  • [35] Model adaptation for spoken language understanding
    Tur, G
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 41 - 44
  • [36] Context Dependent Language Model Adaptation
    Liu, X.
    Gales, M. J. F.
    Woodland, P. C.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 837 - 840
  • [37] Pre-trained Language Model based Ranking in Baidu Search
    Zou, Lixin
    Zhang, Shengqiang
    Cai, Hengyi
    Ma, Dehong
    Cheng, Suqi
    Wang, Shuaiqiang
    Shi, Daiting
    Cheng, Zhicong
    Yin, Dawei
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 4014 - 4022
  • [38] Entity-Based Language Model Smoothing Approach for Smart Search
    Zhao, Feng
    Tian, Zeliang
    Jin, Hai
    IEEE ACCESS, 2018, 6 : 9991 - 10002
  • [39] LANGUAGE PERSON SEARCH WITH MUTUALLY CONNECTED CLASSIFICATION LOSS
    Wang, Yuyu
    Bo, Chunjuan
    Wang, Dong
    Wang, Shuang
    Qi, Yunwei
    Lu, Huchuan
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2057 - 2061
  • [40] Malicious URL Classification Model Based on Improved Sparrow Search Algorithm
    Ma, Yiran
    Guan, Qihang
    Guo, Fengyuan
    Zhang, Guidong
    PROCEEDINGS OF 2021 IEEE 11TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2021), 2021, : 21 - 25