Unsupervised language model adaptation

被引:0
|
作者
Bacchiani, M [1 ]
Roark, B [1 ]
机构
[1] AT&T Labs Res, Florham Pk, NJ 07932 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates unsupervised language model adaptation, from ASR transcripts. N-gram counts from these transcripts can be used either to adapt an existing n-gram model or to build an n-gram model from scratch. Various experimental results are reported on a particular domain adaptation task, namely building a customer care application starting from a general voicemail transcription system.. The experiments investigate the effectiveness of various adaptation strategies, including iterative adaptation and self-adaptation on the test data. They show an error rate reduction of 3.9% over the unadapted baseline performance, from 28% to 24.1%, using 17 hours of unsupervised adaptation material. This is 51% of the 7.7% adaptation gain obtained by supervised adaptation. Self-adaptation on the test data resulted in a 1.3% improvement over the baseline.
引用
收藏
页码:224 / 227
页数:4
相关论文
共 50 条
  • [21] Lattice-Based Risk Minimization Training for Unsupervised Language Model Adaptation
    Kobayashi, Akio
    Oku, Takahiro
    Homma, Shinichi
    Imai, Toru
    Nakagawa, Seiichi
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1464 - +
  • [22] Lattice-based risk minimization training for unsupervised language model adaptation
    Kobayashi, Akio
    Oku, Takahiro
    Homma, Shinichi
    Imai, Toru
    Nakagawa, Seiichi
    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2011, : 1453 - 1456
  • [23] HANDLING UNCERTAIN OBSERVATIONS IN UNSUPERVISED TOPIC-MIXTURE LANGUAGE MODEL ADAPTATION
    Chuangsuwanich, Ekapol
    Watanabe, Shinji
    Hori, Takaaki
    Iwata, Tomoharu
    Glass, James
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5033 - 5036
  • [24] An unsupervised language model adaptation based on keyword clustering and query availability estimation
    Ito, Akinori
    Kajiura, Yasutomo
    Makino, Shozo
    Suzuki, Motoyuki
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1412 - 1418
  • [25] UNSUPERVISED LANGUAGE MODEL ADAPTATION USING LATENT DIRICHLET ALLOCATION AND DYNAMIC MARGINALS
    Haidar, Md. Akmal
    O'Shaughnessy, Douglas
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1480 - 1484
  • [26] Transcription-less Call Routing using Unsupervised Language Model Adaptation
    Duta, Nicolae
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1562 - 1565
  • [27] Improving Transfer Learning in Unsupervised Language Adaptation
    Rocha, Gil
    Cardoso, Henrique Lopes
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 588 - 599
  • [28] Good-turing estimation from word lattices for unsupervised language model adaptation
    Riley, M
    Roark, B
    Sproat, R
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 453 - 458
  • [29] Unsupervised language model adaptation via topic modeling based on named entity hypotheses
    Liu, Yang
    Liu, Feifan
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4921 - 4924
  • [30] Novel Weighting Scheme for Unsupervised Language Model Adaptation Using Latent Dirichlet Allocation
    Haidar, Md Akmal
    O'Shaughnessy, Douglas
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2438 - 2441