GENERATING PSEUDO-RELEVANT REPRESENTATIONS FOR SPOKEN DOCUMENT RETRIEVAL

被引:0
|
作者
Wu, Zheng-Yu [1 ]
Yen, Li-Phen [1 ]
Chen, Kuan-Yu [1 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Taipei, Taiwan
关键词
Spoken document retrieval; pseudo-relevance feedback; query reformulation; representation;
D O I
10.1109/icassp.2019.8683832
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Spoken document retrieval (SDR) has become an important research subject due to the immenseness of multimedia data along with speech have spread around the world in our daily life. One of the fundamental challenge facing SDR is that the input query usually contains only a few words, which is too short to convey the information need of a user. In order to mitigate the problem, a well-practiced strategy is to reformulate the original query by performing a pseudo-relevance feedback process. Although several studies have evidenced its ability and capability for enhancing the retrieval performance, the time-consuming problem makes it hard to be used in reality. Motivated by the observations, in this paper, we concentrate on proposing a novel framework, which targets at generating a set of pseudo-relevant representations for a given query automatically, and eliminating the time-wasting problem. On top of the generated representations, we further investigate a novel query reformulation mechanism so as to improve the retrieval performance. A series of empirical SDR experiments conducted on a benchmark collection demonstrate the good efficacy of the proposed framework, compared to several existing strong baseline systems.
引用
收藏
页码:7370 / 7374
页数:5
相关论文
共 50 条
  • [1] Improving Document Representations by Generating Pseudo Query Embeddings for Dense Retrieval
    Tang, Hongyin
    Sun, Xingwu
    Jin, Beihong
    Wang, Jingang
    Zhang, Fuzheng
    Wu, Wei
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 5054 - 5064
  • [2] Spoken document representations for probabilistic retrieval
    Jourlin, P
    Johnson, SE
    Sparck-Jones, K
    Woodland, PC
    [J]. SPEECH COMMUNICATION, 2000, 32 (1-2) : 21 - 36
  • [3] Improving query expansion using pseudo-relevant web knowledge for information retrieval
    Azad, Hiteshwar Kumar
    Deepak, Akshay
    Chakraborty, Chinmay
    Abhishek, Kumar
    [J]. PATTERN RECOGNITION LETTERS, 2022, 158 : 148 - 156
  • [4] The effects of pseudo-relevant 100% claims
    Munichor, Nira
    Levontin, Liat
    [J]. PSYCHOLOGY & MARKETING, 2024, 41 (04) : 719 - 733
  • [5] Improved Query-Topic Models Using Pseudo-Relevant Polya Document Models
    Cummins, Ronan
    [J]. ICTIR'17: PROCEEDINGS OF THE 2017 ACM SIGIR INTERNATIONAL CONFERENCE THEORY OF INFORMATION RETRIEVAL, 2017, : 101 - 108
  • [6] EFFECTIVE PSEUDO-RELEVANCE FEEDBACK FOR SPOKEN DOCUMENT RETRIEVAL
    Chen, Yi-Wen
    Chen, Kuan-Yu
    Wang, Hsin-Min
    Chen, Berlin
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8535 - 8539
  • [7] Combining Word and Phonetic-Code Representations for Spoken Document Retrieval
    Reyes-Barragan, Alejandro
    Montes-y-Gomez, Manuel
    Villasenor-Pineda, Luis
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PT II, 2011, 6609 : 458 - 466
  • [8] Combining multiple subword representations for open-vocabulary spoken document retrieval
    Lee, SW
    Tanaka, K
    Itoh, Y
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 505 - 508
  • [9] Experiments in spoken document retrieval
    Jones, K.Sparck
    Jones, G.J.F.
    Foote, J.T.
    Young, S.J.
    [J]. Information Processing and Management, 1996, 32 (04): : 399 - 417
  • [10] An architecture for spoken document retrieval
    Terol, RM
    Martínez-Barco, P
    Palomar, M
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 505 - 511