Hyper-USS: Answering Subset Query Over Multi-Attribute Data Stream

被引:1
|
作者
Miao, Ruijie [1 ,4 ]
Zhang, Yiyao [2 ,6 ]
Qu, Guanyu [3 ,7 ,8 ]
Yang, Kaicheng [1 ,4 ,5 ]
Yang, Tong [1 ,4 ,5 ]
Cui, Bin [1 ,4 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] Nanjing Univ, Nanjing, Peoples R China
[3] Univ Chinese Acad Sci, Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[4] Peking Univ, Sch Comp Sci, Natl Key Lab Multimedia Informat Proc, Beijing, Peoples R China
[5] Peng Cheng Lab, Shenzhen, Peoples R China
[6] Nanjing Univ, Dept Comp Sci & Technol, Nanjing, Peoples R China
[7] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[8] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Sketch; Multi-attribute Data Stream; Subset Query;
D O I
10.1145/3580305.3599383
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sketching algorithms are considered as promising solutions for answering approximate query on massive data stream. In real scenarios, a large number of problems can be abstracted as subset query over multiple attributes. Existing sketches are designed for query on single attributes, and therefore are inefficient for query on multiple attributes. In this work, we propose Hyper-USS, an innovative sketching algorithm that supports subset query over multiple attributes accurately and efficiently. To the best of our knowledge, this work is the first sketching algorithm designed to answer approximate query over multi-attribute data stream. We utilize the key technique, Joint Variance Optimization, to guarantee high estimation accuracy on all attributes. Experiment results show that, compared with the state-of-the-art (SOTA) sketches that support subset query on single attributes, Hyper-USS improves the accuracy by 16.67x and the throughput by 8.54x. The code is open-sourced at Github.
引用
收藏
页码:1698 / 1709
页数:12
相关论文
共 50 条
  • [41] Exploiting Correlation for Query Resolution in Multi-attribute Structured Peer-to-Peer Networks
    Chatterjee, Shibayan
    Jayasumana, Anura P.
    [J]. PROCEEDINGS OF THE 2018 IEEE 43RD CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN), 2018, : 461 - 464
  • [42] Adaptive processing for continuous query over data stream
    Bae, Misook
    Hwang, Buhyun
    Nam, Jiseung
    [J]. PARALLEL AND DISTRIBUTED PROCESSING AND APPLICATIONS, PROCEEDINGS, 2007, 4742 : 347 - 358
  • [43] NQE: N-ary Query Embedding for Complex Query Answering over Hyper-Relational Knowledge Graphs
    Luo, Haoran
    Haihong, E.
    Yang, Yuhao
    Zhou, Gengxian
    Guo, Yikai
    Yao, Tianyu
    Tang, Zichen
    Lin, Xueyuan
    Wan, Kaiyang
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 4, 2023, : 4543 - 4551
  • [44] An adaptively multi-attribute index framework for big IoT data
    Huang, Chih-Yuan
    Chang, Yu-Jui
    [J]. COMPUTERS & GEOSCIENCES, 2021, 155
  • [45] Multi-attribute reduction of GPS data trajectories: a new approach
    Usyukov, Vlad
    [J]. 11TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT) / THE 3RD INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40) / AFFILIATED WORKSHOPS, 2020, 170 : 51 - 58
  • [46] Multi-Attribute Generalization Method in Privacy Preserving Data Publishing
    Yu Wen-bing
    Pin, L. V.
    Chen Nian-sheng
    [J]. 2010 2ND INTERNATIONAL CONFERENCE ON E-BUSINESS AND INFORMATION SYSTEM SECURITY (EBISS 2010), 2010, : 319 - 322
  • [47] Indexing Techniques for Power Management in Multi-Attribute Data Broadcast
    Qinglong Hu
    Wang-Chien Lee
    Dik Lun Lee
    [J]. Mobile Networks and Applications, 2001, 6 : 185 - 197
  • [48] Indexing techniques for power management in multi-attribute data broadcast
    Hu, QL
    Lee, WC
    Lee, DL
    [J]. MOBILE NETWORKS & APPLICATIONS, 2001, 6 (02): : 185 - 197
  • [49] A Multi-attribute Keyword Retrieval Mechanism for Encrypted Cloud Data
    Li, Yunfa
    Li, Mingyi
    Shen, Yangyang
    [J]. INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2016, 10 (12): : 335 - 346
  • [50] Distributed Query Engine for Multiple-Query Optimization over Data Stream
    Yang, Junye
    Zhang, Yong
    Wang, Jin
    Xing, Chunxiao
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 523 - 527