LANGUAGE MODEL ADAPTATION USING WWW DOCUMENTS OBTAINED BY UTTERANCE-BASED QUERIES

被引:3
|
作者
Tsiartas, Andreas [1 ]
Georgiou, Panayiotis [1 ]
Narayanan, Shrikanth [1 ]
机构
[1] Univ So Calif, Dept Elect Engn, Speech Anal & Interpretat Lab, Los Angeles, CA 90089 USA
关键词
Adapt language models; utterance queries; WWW corpora; in-domain documents;
D O I
10.1109/ICASSP.2010.5494928
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we consider the estimation of topic specific Language Models (LM) by exploiting documents from the World Wide Web (WWW). We focus on the quality of the generated queries and propose a novel query generation method. In contrast to the n-gram based queries used in past works, our approach relies on utterances as queries candidates. The proposed approach does not rely on any language specific information other than the initial in-domain training text. We have conducted experiments with Web texts of size 0-150 million words, and we have shown that despite not using any language specific information, the proposed approach results in up to 1.1% absolute Word Error Rate (WER) improvement as compared to keyword-based approaches. The proposed approach reduces the WER by 6.3% absolute in our experiments, compared to an in-domain LM without considering any Web data.
引用
收藏
页码:5406 / 5409
页数:4
相关论文
共 50 条
  • [1] Improving Spoken Document Retrieval by. Unsupervised Language Model Adaptation Using Utterance-based Web Search
    Herms, Robert
    Ritter, Marc
    Wilhelm-Stein, Thomas
    Eibl, Maximilian
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1430 - 1433
  • [2] TOWARDS UTTERANCE-BASED NEURAL NETWORK ADAPTATION IN ACOUSTIC MODELING
    Himawan, Ivan
    Motlicek, Petr
    Font, Marc Ferras
    Madikeri, Srikanth
    [J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 289 - 295
  • [3] Utterance-based Speech Dereverberation using Blind Channel Estimation and Multichannel Equalization
    Haque, Mohammad Ariful
    [J]. 2014 INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (ICECE), 2014, : 274 - 277
  • [4] Unsupervised language model adaptation based on automatic text collection from WWW
    Suzuki, Motoyuki
    Kajiura, Yasutomo
    Ito, Akinori
    Makino, Shozo
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2202 - 2205
  • [5] Enhancing Children's Short Utterance-Based ASV Using Inverse Gamma-tone Filtered Cepstral coefficients
    Aziz, Shahid
    Shahnawazuddin, S.
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (05) : 3020 - 3041
  • [6] A framework for retrieving Arabic documents based on queries written in Arabic slang language
    Shatnawi, Mohammed Q.
    Yassein, Muneer Bani
    Mahafza, Reem
    [J]. JOURNAL OF INFORMATION SCIENCE, 2012, 38 (04) : 350 - 365
  • [7] Enhancing Children’s Short Utterance-Based ASV Using Inverse Gamma-tone Filtered Cepstral coefficients
    Shahid Aziz
    S. Shahnawazuddin
    [J]. Circuits, Systems, and Signal Processing, 2024, 43 : 3020 - 3041
  • [8] ANALYZING DEEP CNN-BASED UTTERANCE EMBEDDINGS FOR ACOUSTIC MODEL ADAPTATION
    Rownicka, Joanna
    Bell, Peter
    Renals, Steve
    [J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 235 - 241
  • [9] Language model-based retrieval for Farsi documents
    Taghva, K
    Coombs, J
    Pareda, R
    Nartker, T
    [J]. ITCC 2004: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, VOL 2, PROCEEDINGS, 2004, : 13 - 17
  • [10] LANGUAGE MODEL ADAPTATION USING RANDOM FORESTS
    Deoras, Anoop
    Jelinek, Frederick
    Su, Yi
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5198 - 5201