Query-Based Automatic Training Set Selection for Microblog Retrieval

被引:4
|
作者
Albishre, Khaled [1 ,2 ]
Li, Yuefeng [1 ]
Xu, Yue [1 ]
机构
[1] Queensland Univ Technol, Sch EECS, Brisbane, Qld, Australia
[2] Umm Al Qura Univ, Mecca, Saudi Arabia
关键词
Microblog retrieval; Topic model; Query expansion; Pseudo relevance feedback;
D O I
10.1007/978-3-319-93037-4_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Typical pseudo-relevance feedback models assume that the first-pass documents are the most relevant and use those documents to select feedback terms for query expansion. In real applications, however, short documents, such as microblogs, may not have enough information about the searched topic, thus increasing the chance that irrelevant documents will be included in the initial set of retrieved documents. This situation reduces a feedback model's ability to capture information that is relevant to users' needs, which makes determining the best documents for relevant feedback without requiring extra effort from the user a critical challenge. In this paper, we propose an innovative mechanism to automatically select useful feedback documents using a topic modeling technique to improve the effectiveness of pseudo-relevance feedback models. The main idea behind the proposed model is to discover the latent topics in the top-ranked documents that allow for the exploitation of the correlation between terms in relevant topics. To capture discriminative terms for query expansion, we incorporated topical features into a relevance model that focuses on the temporal information in the selected set of documents. Experimental results on TREC 2011-2013 microblog datasets illustrate that the proposed model significantly outperforms all state-of-the-art baseline models.
引用
收藏
页码:325 / 336
页数:12
相关论文
共 50 条
  • [41] Query-based sampling of text databases
    Callan, J
    Connell, M
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2001, 19 (02) : 97 - 130
  • [42] Query-Based Summarization for search lists
    Ye, Xinghuo
    Wei, Hai
    [J]. FIRST INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, : 330 - 333
  • [43] Effective and Robust Query-Based Stemming
    Paik, Jiaul H.
    Parui, Swapan K.
    Pal, Dipasree
    Robertson, Stephen E.
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2013, 31 (04)
  • [44] Query-Based Automatic Multi-document Summarization Extraction Method for Web Pages
    He, Qi
    Hao, Hong-Wei
    Yin, Xu-Cheng
    [J]. PROCEEDINGS OF THE 2011 2ND INTERNATIONAL CONGRESS ON COMPUTER APPLICATIONS AND COMPUTATIONAL SCIENCE, VOL 1, 2012, 144 : 107 - 112
  • [45] Multi-Grained Attention Network With Mutual Exclusion for Composed Query-Based Image Retrieval
    Li, Shenshen
    Xu, Xing
    Jiang, Xun
    Shen, Fumin
    Liu, Xin
    Shen, Heng Tao
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2959 - 2972
  • [46] Automatic query generation for content-based image retrieval
    Breiteneder, C
    Eidenberger, H
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 705 - 708
  • [47] Query-Based Sensors Selection for Collaborative Wireless Sensor Networks With Stochastic Energy Harvesting
    Chen, Yan-Bin
    Nevat, Ido
    Zhang, Pengfei
    Nagarajan, Sai Ganesh
    Wei, Hung-Yu
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (02) : 3031 - 3043
  • [48] Incorporating Semantic Word Representations into Query Expansion for Microblog Information Retrieval
    Xu, Bo
    Lin, Hongfei
    Lin, Yuan
    Xu, Kan
    Wang, Lin
    Gao, Jiping
    [J]. INFORMATION TECHNOLOGY AND CONTROL, 2019, 48 (04): : 626 - 636
  • [49] Optimal Query-Based Relevance Feedback in Medical Image Retrieval Using Score Fusion-Based Classification
    Behnam, Mohammad
    Pourghassem, Hossein
    [J]. JOURNAL OF DIGITAL IMAGING, 2015, 28 (02) : 160 - 178
  • [50] Microblog Retrieval Based on Concept-Enhanced Pre-Training Model
    Wang, Yashen
    Wang, Zhaoyu
    Zhang, Huanhuan
    Liu, Zhirun
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 17 (03)