Language model expansion using webdata for spoken document retrieval

被引:0
|
作者
Masumura, Ryo [1 ]
Hahm, Seongjun [1 ]
Ito, Akinori [1 ]
机构
[1] Tohoku Univ, Grad Sch Engn, Sendai, Miyagi 980, Japan
关键词
Spoken document retrieval; statistical language models; World Wide Web;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, there has been increasing demand for ad hoc retrieval of spoken documents. We can use existing text retrieval methods by transcribing spoken documents into text data using a Large Vocabulary Continuous Speech Recognizer (LVCSR). However, retrieval performance is severely deteriorated by recognition errors and out-of-vocabulary (OOV) words. To solve these problems, we previously proposed an expansion method that compensates the transcription by using text data downloaded from the Web. In this paper, we introduce two improvements to the existing document expansion framework. First, we use a large-scale sample database of webdata as the source of relevant documents, thus avoiding the bias introduced by choosing keywords in the existing methods. Next, we use a document retrieval method based on a statistical language model (SLM), which is a popular framework in information retrieval, and also propose a new smoothing method considering recognition errors and missing keywords. Retrieval experiments show that the proposed methods yield a good results.
引用
收藏
页码:2144 / 2147
页数:4
相关论文
共 50 条
  • [1] Phonetic Query Expansion for Spoken Document Retrieval
    Mamou, Jonathan
    Ramabhadran, Bhuvana
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2106 - +
  • [2] Phonetic query expansion for spoken document retrieval
    Reyes-Barragan, Alejandro
    Villasenor-Pineda, Luis
    Montes-y-Gomez, Manuel
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (47): : 57 - 64
  • [3] Cluster-based Language Model for Spoken Document Retrieval Using NMF-Based Document Clustering
    Hu, Xinhui
    Isotani, Ryosuke
    Kawai, Hisashi
    Nakamura, Satoshi
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 705 - 708
  • [4] A NEURAL DOCUMENT LANGUAGE MODELING FRAMEWORK FOR SPOKEN DOCUMENT RETRIEVAL
    Yen, Li-Phen
    Wu, Zhen-Yu
    Chen, Kuan-Yu
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8139 - 8143
  • [5] An LDA-smoothed Relevance Model for Document Expansion: A Case Study for Spoken Document Retrieval
    Ganguly, Debasis
    Leveling, Johannes
    Jones, Gareth J. F.
    [J]. SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 1057 - 1060
  • [6] Effects of Query Expansion for Spoken Document Passage Retrieval
    Akiba, Tomoyosi
    Honda, Koichiro
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2148 - 2151
  • [7] Exploring an Unsupervised, Language Independent, Spoken Document Retrieval System
    Caranica, Alexandru
    Cucu, Horia
    Buzo, Andi
    [J]. 2016 14TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2016,
  • [8] Improving Spoken Document Retrieval by. Unsupervised Language Model Adaptation Using Utterance-based Web Search
    Herms, Robert
    Ritter, Marc
    Wilhelm-Stein, Thomas
    Eibl, Maximilian
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1430 - 1433
  • [9] CLEF 2004 cross-language spoken document retrieval track
    Federico, M
    Bertoldi, N
    Levow, GA
    Jones, GJF
    [J]. MULTILINGUAL INFORMATION ACCESS FOR TEXT, SPEECH AND IMAGES, 2005, 3491 : 816 - 820
  • [10] An architecture for spoken document retrieval
    Terol, RM
    Martínez-Barco, P
    Palomar, M
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 505 - 511