Query-Based Automatic Training Set Selection for Microblog Retrieval

被引:4
|
作者
Albishre, Khaled [1 ,2 ]
Li, Yuefeng [1 ]
Xu, Yue [1 ]
机构
[1] Queensland Univ Technol, Sch EECS, Brisbane, Qld, Australia
[2] Umm Al Qura Univ, Mecca, Saudi Arabia
关键词
Microblog retrieval; Topic model; Query expansion; Pseudo relevance feedback;
D O I
10.1007/978-3-319-93037-4_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Typical pseudo-relevance feedback models assume that the first-pass documents are the most relevant and use those documents to select feedback terms for query expansion. In real applications, however, short documents, such as microblogs, may not have enough information about the searched topic, thus increasing the chance that irrelevant documents will be included in the initial set of retrieved documents. This situation reduces a feedback model's ability to capture information that is relevant to users' needs, which makes determining the best documents for relevant feedback without requiring extra effort from the user a critical challenge. In this paper, we propose an innovative mechanism to automatically select useful feedback documents using a topic modeling technique to improve the effectiveness of pseudo-relevance feedback models. The main idea behind the proposed model is to discover the latent topics in the top-ranked documents that allow for the exploitation of the correlation between terms in relevant topics. To capture discriminative terms for query expansion, we incorporated topical features into a relevance model that focuses on the temporal information in the selected set of documents. Experimental results on TREC 2011-2013 microblog datasets illustrate that the proposed model significantly outperforms all state-of-the-art baseline models.
引用
收藏
页码:325 / 336
页数:12
相关论文
共 50 条
  • [1] Regularizing query-based retrieval scores
    Diaz, Fernando
    [J]. INFORMATION RETRIEVAL, 2007, 10 (06): : 531 - 562
  • [2] Regularizing query-based retrieval scores
    Fernando Diaz
    [J]. Information Retrieval, 2007, 10 : 531 - 562
  • [3] Research on Query-based Automatic Summarization of Webpage
    Chen, Zhimin
    Shen, Jie
    [J]. 2009 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL I, 2009, : 173 - 176
  • [4] A query-based approach for test selection in diagnosis
    François Gagnon
    Babak Esfandiari
    [J]. Artificial Intelligence Review, 2008, 29
  • [5] Query-based HMM training method for ASR
    Kyung, Y
    Jung, J
    Moon, S
    [J]. ELECTRONICS LETTERS, 2003, 39 (16) : 1222 - 1223
  • [6] A query-based approach for test selection in diagnosis
    Gagnon, Francois
    Esfandiari, Babak
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2008, 29 (3-4) : 249 - 263
  • [7] Co-Learning Ranking for Query-Based Retrieval
    Peng, Min
    Huang, Jiajia
    Zhu, Jiahui
    Zhou, Li
    Fu, Hui
    He, Yanxiang
    Li, Fei
    [J]. WEB INFORMATION SYSTEMS ENGINEERING - WISE 2013, PT I, 2013, 8180 : 468 - 477
  • [8] Predicting Query Performance In Microblog Retrieval
    Perez, Jesus A. Rodriguez
    Jose, Joemon M.
    [J]. SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 1183 - 1186
  • [9] A query-based system for automatic invocation of web services
    Gupta, Chaitali
    Bhowmik, Rajdeep
    Head, Michael R.
    Govindaraju, Madhusudhan
    Meng, Weiyi
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, PROCEEDINGS, 2007, : 759 - +
  • [10] Short Query Expansion for Microblog Retrieval
    Zingla, Meriem Amina
    Chiraz, Latiri
    Slimani, Yahya
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS: PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE KES-2016, 2016, 96 : 225 - 234