Iterative query selection for opaque search engines with pseudo relevance feedback

被引:0
|
作者
Reuben, Maor [1 ,2 ]
Elyashar, Aviad [1 ,3 ]
Puzis, Rami [1 ,2 ]
机构
[1] Telekom Innovat Labs, Beer Sheva, Israel
[2] Ben Gurion Univ Negev, Dept Software & Informat Syst Engn, Ben Gurion, Israel
[3] Sami Shamoon Coll Engn, Dept Comp Sci, Beer Sheva, Israel
关键词
Query selection; Opaque search engine; Pseudo relevance feedback; Fake news;
D O I
10.1016/j.eswa.2022.117027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Retrieving information from an online search engine is the first and most important step in many data mining tasks, such as fake news detection. Most of the search engines currently available on the web, including all social media platforms, are black-boxes (i.e., opaque) supporting short keyword queries. In these settings, it is challenging to retrieve all posts and comments discussing a particular news item automatically and on a large scale.In this paper, we propose a method for generating short keyword queries given a prototype document. The proposed iterative query selection (IQS) algorithm interacts with the opaque search engine to iteratively improve the query, by maximizing the number of relevant results retrieved. Our evaluation of IQS was performed on the Twitter TREC Microblog 2012 and TREC-COVID 2019 datasets and demonstrated the algorithm's superior performance compared to state-of-the-art. In addition, we implemented IQS algorithm to automatically collect a large-scale dataset for fake news detection task of about 70K true and fake news items. The dataset, which we have made publicly available to the research community, includes over 22M accounts and 61M tweets. We demonstrate the usefulness of the dataset for fake news detection task achieving state-of-the-art performance.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] QUERY DIFFICULTY ESTIMATION VIA PSEUDO RELEVANCE FEEDBACK FOR IMAGE SEARCH
    Jia, Qianghuai
    Tian, Xinmei
    Mei, Tao
    2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2014,
  • [2] Query Change as Relevance Feedback in Session Search
    Zhang, Sicong
    Guan, Dongyi
    Yang, Hui
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 821 - 824
  • [3] Query expansion using pseudo relevance feedback on wikipedia
    Andisheh Keikha
    Faezeh Ensan
    Ebrahim Bagheri
    Journal of Intelligent Information Systems, 2018, 50 : 455 - 478
  • [4] Query expansion using pseudo relevance feedback on wikipedia
    Keikha, Andisheh
    Ensan, Faezeh
    Bagheri, Ebrahim
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2018, 50 (03) : 455 - 478
  • [5] Pseudo-relevance feedback query based on Wikipedia
    He, Tingting
    Dai, Xionglu
    2012 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC 2012), 2012, : 154 - 159
  • [6] Book search using social information, user profiles and query expansion with Pseudo Relevance Feedback
    Ritesh Kumar
    Guggilla Bhanodai
    Rajendra Pamula
    Applied Intelligence, 2019, 49 : 2178 - 2200
  • [7] Book search using social information, user profiles and query expansion with Pseudo Relevance Feedback
    Kumar, Ritesh
    Bhanodai, Guggilla
    Pamula, Rajendra
    APPLIED INTELLIGENCE, 2019, 49 (06) : 2178 - 2200
  • [8] Multimedia search with pseudo-relevance feedback
    Yan, R
    Hauptmann, A
    Jin, R
    IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2003, 2728 : 238 - 247
  • [9] Query based site selection for distributed search engines
    Sato, N
    Udagawa, M
    Uehara, M
    Sakai, Y
    Mori, H
    23RD INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS, 2003, : 556 - 561
  • [10] Iterative Estimation of Document Relevance Score for Pseudo-Relevance Feedback
    Ariannezhad, Mozhdeh
    Montazeralghaem, Ali
    Zamani, Hamed
    Shakery, Azadeh
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 676 - 683