An LDA-smoothed Relevance Model for Document Expansion: A Case Study for Spoken Document Retrieval

被引:0
|
作者
Ganguly, Debasis [1 ]
Leveling, Johannes [1 ]
Jones, Gareth J. F. [1 ]
机构
[1] Dublin City Univ, Sch Comp, Ctr Next Generat Localisat, Dublin 9, Ireland
基金
爱尔兰科学基金会;
关键词
Document Expansion; Topic Modelling;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Document expansion (DE) in information retrieval (IR) involves modifying each document in the collection by introducing additional terms into the document. It is particularly useful to improve retrieval of short and noisy documents where the additional terms can improve the description of the document content. Existing approaches to DE assume that documents to be expanded are from a single topic. In the case of multi-topic documents this can lead to a topic bias in terms selected for DE and hence may result in poor retrieval quality due to the lack of coverage of the original document topics in the expanded document. This paper proposes a new DE technique providing a more uniform selection and weighting of DE terms from all constituent topics. We show that our proposed method significantly outperforms the most recently reported relevance model based DE method on a spoken document retrieval task for both manual and automatic speech recognition transcripts.
引用
收藏
页码:1057 / 1060
页数:4
相关论文
共 50 条
  • [1] Language model expansion using webdata for spoken document retrieval
    Masumura, Ryo
    Hahm, Seongjun
    Ito, Akinori
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2144 - 2147
  • [2] Phonetic Query Expansion for Spoken Document Retrieval
    Mamou, Jonathan
    Ramabhadran, Bhuvana
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2106 - +
  • [3] Phonetic query expansion for spoken document retrieval
    Reyes-Barragan, Alejandro
    Villasenor-Pineda, Luis
    Montes-y-Gomez, Manuel
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (47): : 57 - 64
  • [4] Leveraging Relevance Cues for Improved Spoken Document Retrieval
    Chen, Pei-Ning
    Chen, Kuan-Yu
    Chen, Berlin
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 936 - +
  • [5] Effects of Query Expansion for Spoken Document Passage Retrieval
    Akiba, Tomoyosi
    Honda, Koichiro
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2148 - 2151
  • [6] EFFECTIVE PSEUDO-RELEVANCE FEEDBACK FOR SPOKEN DOCUMENT RETRIEVAL
    Chen, Yi-Wen
    Chen, Kuan-Yu
    Wang, Hsin-Min
    Chen, Berlin
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8535 - 8539
  • [7] An architecture for spoken document retrieval
    Terol, RM
    Martínez-Barco, P
    Palomar, M
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 505 - 511
  • [8] Experiments in spoken document retrieval
    Jones, K.Sparck
    Jones, G.J.F.
    Foote, J.T.
    Young, S.J.
    [J]. Information Processing and Management, 1996, 32 (04): : 399 - 417
  • [9] Experiments in spoken document retrieval
    Sparck-Jones, K
    Jones, GJF
    Foote, JT
    Young, SJ
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1996, 32 (04) : 399 - 417
  • [10] NEURAL RELEVANCE-AWARE QUERY MODELING FOR SPOKEN DOCUMENT RETRIEVAL
    Lo, Tien-Hong
    Chen, Ying-Wen
    Chen, Kuan-Yu
    Wang, Hsin-Min
    Chen, Berlin
    [J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 466 - 473