Diversity-Aware Top-k Publish/Subscribe for Text Stream

被引:35
|
作者
Chen, Lisi [1 ]
Cong, Gao [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
关键词
text stream; diversification; publish/subscribe; SEARCH;
D O I
10.1145/2723372.2749451
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Massive amount of text data are being generated by a huge number of web users at an unprecedented scale. These data cover a wide range of topics. Users are interested in receiving a few up-to-date representative documents (e.g., tweets) that can provide them with a wide coverage of different aspects of their query topics. To address the problem, we consider the Diversity-Aware Top k Subscription (DAS) query. Given a DAS query, we continuously maintain an up-to-date result set that contains k most recently returned documents over a text stream for the query. The DAS query takes into account text relevance, document recency, and result diversity. We propose a novel solution to efficiently processing a large number of DAS queries over a stream of documents. We demonstrate the efficiency of our approach on real world dataset and the experimental results show that our solution is able to achieve a reduction of the processing time by 60-75% compared with two baselines. We also study the effectiveness of the DAS query.
引用
收藏
页码:347 / 362
页数:16
相关论文
共 50 条
  • [31] Mining Top-K Sequential Patterns in the Data Stream Environment
    Dai, Bi-Ru
    Jiang, Hung-Lin
    Chung, Chih-Heng
    [J]. INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 142 - 149
  • [32] Location-aware online learning for top-k recommendation
    Palovics, Robert
    Szalai, Peter
    Pap, Julia
    Frigo, Erzsebet
    Kocsis, Levente
    Benczur, Andras A.
    [J]. PERVASIVE AND MOBILE COMPUTING, 2017, 38 : 490 - 504
  • [33] Social-Aware Top-k Spatial Keyword Search
    Wu, Dingming
    Li, Yafei
    Choi, Byron
    Xu, Jianliang
    [J]. 2014 IEEE 15TH INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM), VOL 1, 2014, : 235 - 244
  • [34] Semantic-aware top-k spatial keyword queries
    Qian, Zhihu
    Xu, Jiajie
    Zheng, Kai
    Zhao, Pengpeng
    Zhou, Xiaofang
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2018, 21 (03): : 573 - 594
  • [35] Semantic-aware top-k spatial keyword queries
    Zhihu Qian
    Jiajie Xu
    Kai Zheng
    Pengpeng Zhao
    Xiaofang Zhou
    [J]. World Wide Web, 2018, 21 : 573 - 594
  • [36] Top-K structural diversity search in large networks
    Huang, Xin
    Cheng, Hong
    Li, Rong-Hua
    Qin, Lu
    Yu, Jeffrey Xu
    [J]. VLDB JOURNAL, 2015, 24 (03): : 319 - 343
  • [37] Semantic-Aware Top-k Multirequest Optimal Route
    Wang, Shuang
    Xu, Yingchun
    Wang, Yinzhe
    Liu, Hezhi
    Zhang, Qiaoqiao
    Ma, Tiemin
    Liu, Shengnan
    Zhang, Siyuan
    Li, Anliang
    [J]. COMPLEXITY, 2019, 2019
  • [38] Efficient Top-k Edge Structural Diversity Search
    Zhang, Qi
    Li, Rong-Hua
    Yang, Qixuan
    Wang, Guoren
    Qin, Lu
    [J]. 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 205 - 216
  • [39] Social-Aware Spatial Top-k and Skyline Queries
    Sohail, Ammar
    Cheema, Muhammad Aamir
    Taniar, David
    [J]. COMPUTER JOURNAL, 2018, 61 (11): : 1620 - 1638
  • [40] Context-Aware Top-k Processing using Views
    Maniu, Silviu
    Cautis, Bogdan
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1959 - 1968