Unsupervised language model adaptation

被引:0
|
作者
Bacchiani, M [1 ]
Roark, B [1 ]
机构
[1] AT&T Labs Res, Florham Pk, NJ 07932 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates unsupervised language model adaptation, from ASR transcripts. N-gram counts from these transcripts can be used either to adapt an existing n-gram model or to build an n-gram model from scratch. Various experimental results are reported on a particular domain adaptation task, namely building a customer care application starting from a general voicemail transcription system.. The experiments investigate the effectiveness of various adaptation strategies, including iterative adaptation and self-adaptation on the test data. They show an error rate reduction of 3.9% over the unadapted baseline performance, from 28% to 24.1%, using 17 hours of unsupervised adaptation material. This is 51% of the 7.7% adaptation gain obtained by supervised adaptation. Self-adaptation on the test data resulted in a 1.3% improvement over the baseline.
引用
收藏
页码:224 / 227
页数:4
相关论文
共 50 条
  • [31] UNSUPERVISED CV LANGUAGE MODEL ADAPTATION BASED ON DIRECT LIKELIHOOD MAXIMIZATION SENTENCE SELECTION
    Shinozaki, Takahiro
    Horiuchi, Yasuo
    Kuroiwa, Shingo
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5029 - 5032
  • [32] Unsupervised Cross-Adaptation Using Language Model and Deep Learning Based Acoustic Model Adaptations
    Takagi, Akira
    Konno, Kazuki
    Kato, Masaharu
    Kosaka, Tetsuo
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [33] Unsupervised Adaptation of Recurrent Neural Network Language Models
    Gangireddy, Siva Reddy
    Swietojanski, Pawel
    Bell, Peter
    Renals, Steve
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2333 - 2337
  • [34] Unsupervised crosslingual adaptation of tokenisers for spoken language recognition
    Ng, Raymond W. M.
    Nicolao, Mauro
    Hain, Thomas
    COMPUTER SPEECH AND LANGUAGE, 2017, 46 : 327 - 342
  • [35] Unsupervised Domain Adaptation of Language Models for Reading Comprehension
    Nishida, Kosuke
    Nishida, Kyosuke
    Saito, Itsumi
    Asano, Hisako
    Tomita, Junji
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5392 - 5399
  • [36] UDALM: Unsupervised Domain Adaptation through Language Modeling
    Karouzos, Constantinos
    Paraskevopoulos, Georgios
    Potamianos, Alexandros
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 2579 - 2590
  • [37] Unsupervised model adaptation for speaker verification
    Preti, Alexandre
    Bonastre, Jean-Francois
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2090 - 2093
  • [38] MODEL UNCERTAINTY FOR UNSUPERVISED DOMAIN ADAPTATION
    Lee, JoonHo
    Lee, Gyemin
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1841 - 1845
  • [39] Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition
    Ito, Akinori
    Kajiura, Yasutomo
    Suzuki, Motoyuki
    Makino, Shozo
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2009,
  • [40] Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition
    Akinori Ito
    Yasutomo Kajiura
    Motoyuki Suzuki
    Shozo Makino
    EURASIP Journal on Audio, Speech, and Music Processing, 2009