Query-Based Automatic Training Set Selection for Microblog Retrieval

被引:4
|
作者
Albishre, Khaled [1 ,2 ]
Li, Yuefeng [1 ]
Xu, Yue [1 ]
机构
[1] Queensland Univ Technol, Sch EECS, Brisbane, Qld, Australia
[2] Umm Al Qura Univ, Mecca, Saudi Arabia
关键词
Microblog retrieval; Topic model; Query expansion; Pseudo relevance feedback;
D O I
10.1007/978-3-319-93037-4_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Typical pseudo-relevance feedback models assume that the first-pass documents are the most relevant and use those documents to select feedback terms for query expansion. In real applications, however, short documents, such as microblogs, may not have enough information about the searched topic, thus increasing the chance that irrelevant documents will be included in the initial set of retrieved documents. This situation reduces a feedback model's ability to capture information that is relevant to users' needs, which makes determining the best documents for relevant feedback without requiring extra effort from the user a critical challenge. In this paper, we propose an innovative mechanism to automatically select useful feedback documents using a topic modeling technique to improve the effectiveness of pseudo-relevance feedback models. The main idea behind the proposed model is to discover the latent topics in the top-ranked documents that allow for the exploitation of the correlation between terms in relevant topics. To capture discriminative terms for query expansion, we incorporated topical features into a relevance model that focuses on the temporal information in the selected set of documents. Experimental results on TREC 2011-2013 microblog datasets illustrate that the proposed model significantly outperforms all state-of-the-art baseline models.
引用
收藏
页码:325 / 336
页数:12
相关论文
共 50 条
  • [21] Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos
    Zhang, Zhu
    Lin, Zhijie
    Zhao, Zhou
    Xiao, Zhenxin
    [J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 655 - 664
  • [22] Query-Based Hard-Image Retrieval for Object Detection at Test Time
    Ayers, Edward
    Sadeghi, Jonathan
    Redford, John
    Mueller, Romain
    Dokania, Puneet K.
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 14692 - 14700
  • [23] AUTOMATIC QUERY FORMULATION AND TERMINATION FOR DOCUMENT-RETRIEVAL BASED ON THE FUZZY SET-THEORY
    WUWONGSE, V
    LEE, HL
    [J]. COMPUTERS AND ARTIFICIAL INTELLIGENCE, 1990, 9 (04): : 345 - 356
  • [24] A query-based quantum eigensolver
    Jin, Shan
    Wu, Shaojun
    Zhou, Guanyu
    Li, Ying
    Li, Lvzhou
    Li, Bo
    Wang, Xiaoting
    [J]. Quantum Engineering, 2020, 2 (03)
  • [25] Snapshot query-based debugging
    Potanin, A
    Noble, J
    Biddle, R
    [J]. 2004 AUSTRALIAN SOFTWARE ENGINEERING CONFERENCE, PROCEEDINGS, 2004, : 251 - 259
  • [26] Dynamic query-based debugging
    Lencevicius, R
    Hölzle, U
    Singh, AK
    [J]. ECOOP'99 - OBJECT-ORIENTED PROGRAMMING, 1999, 1628 : 135 - 160
  • [27] Query-Based Data Pricing
    Koutris, Paraschos
    Upadhyaya, Prasang
    Balazinska, Magdalena
    Howe, Bill
    Suciu, Dan
    [J]. JOURNAL OF THE ACM, 2015, 62 (05)
  • [28] Video fingerprinting: Features for duplicate and similar video detection and query-based video retrieval
    Sarkar, Anindya
    Ghosh, Pratim
    Moxley, Emily
    Manjunath, B. S.
    [J]. MULTIMEDIA CONTENT ACCESS: ALGORITHMS AND SYSTEMS II, 2008, 6820
  • [29] Hybrid query expansion model for text and microblog information retrieval
    Zingla, Meriem Amina
    Latiri, Chiraz
    Mulhem, Philippe
    Berrut, Catherine
    Slimani, Yahya
    [J]. INFORMATION RETRIEVAL JOURNAL, 2018, 21 (04): : 337 - 367
  • [30] Query Expansion With Local Conceptual Word Embeddings in Microblog Retrieval
    Wang, Yashen
    Huang, Heyan
    Feng, Chong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (04) : 1737 - 1749