Hyper-USS: Answering Subset Query Over Multi-Attribute Data Stream

被引:1
|
作者
Miao, Ruijie [1 ,4 ]
Zhang, Yiyao [2 ,6 ]
Qu, Guanyu [3 ,7 ,8 ]
Yang, Kaicheng [1 ,4 ,5 ]
Yang, Tong [1 ,4 ,5 ]
Cui, Bin [1 ,4 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] Nanjing Univ, Nanjing, Peoples R China
[3] Univ Chinese Acad Sci, Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[4] Peking Univ, Sch Comp Sci, Natl Key Lab Multimedia Informat Proc, Beijing, Peoples R China
[5] Peng Cheng Lab, Shenzhen, Peoples R China
[6] Nanjing Univ, Dept Comp Sci & Technol, Nanjing, Peoples R China
[7] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[8] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Sketch; Multi-attribute Data Stream; Subset Query;
D O I
10.1145/3580305.3599383
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sketching algorithms are considered as promising solutions for answering approximate query on massive data stream. In real scenarios, a large number of problems can be abstracted as subset query over multiple attributes. Existing sketches are designed for query on single attributes, and therefore are inefficient for query on multiple attributes. In this work, we propose Hyper-USS, an innovative sketching algorithm that supports subset query over multiple attributes accurately and efficiently. To the best of our knowledge, this work is the first sketching algorithm designed to answer approximate query over multi-attribute data stream. We utilize the key technique, Joint Variance Optimization, to guarantee high estimation accuracy on all attributes. Experiment results show that, compared with the state-of-the-art (SOTA) sketches that support subset query on single attributes, Hyper-USS improves the accuracy by 16.67x and the throughput by 8.54x. The code is open-sourced at Github.
引用
收藏
页码:1698 / 1709
页数:12
相关论文
共 50 条
  • [1] Query and Animate Multi-attribute Trajectory Data
    Xu, Jianqiu
    Gueting, Ralf Hartmut
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2551 - 2554
  • [2] Research on Multi-Attribute Sequence Query Processing Techniques over Data Streams
    Lin, Edgar Chia-Han
    [J]. APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 : 575 - 578
  • [3] A Query Optimizer for Range Queries over Multi-Attribute Trajectories
    Xu, Jianqiu
    Lu, Hua
    Bao, Zhifeng
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2023, 14 (01)
  • [4] Multi-attribute Data Indexing for Query Based Entity Resolution
    Sun, Chen-Chen
    Shen, De-Rong
    Xiao, Ying-Yuan
    Li, Yu-Kun
    [J]. Ruan Jian Xue Bao/Journal of Software, 2022, 33 (06): : 2331 - 2347
  • [5] Range Search over Encrypted Multi-Attribute Data
    Falzon, Francesca
    Markatou, Evangelia Anna
    Espiritu, Zachary
    Tamassia, Roberto
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 16 (04): : 587 - 600
  • [6] QUERY AND CONSTRUCTION OF MULTI-ATTRIBUTE SOCIAL ACCOUNTS
    SCHINNAR, AP
    [J]. JOURNAL OF MATHEMATICAL SOCIOLOGY, 1977, 5 (02): : 239 - 255
  • [7] Towards an interactive query environment for a multi-attribute decision model with imprecise data and vague query conditions
    Lukacs, G
    [J]. 11TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATION, PROCEEDINGS, 2000, : 708 - 712
  • [8] Detecting Subset of Classifiers for Multi-attribute Response Prediction
    Conversano, Claudio
    Mola, Francesco
    [J]. DATA ANALYSIS AND CLASSIFICATION, 2010, : 233 - 240
  • [9] Ciphertext Query Method Based on Multi-Attribute Keywords
    Li, Haoyu
    Zhang, Longjun
    Li, Qingpeng
    Gao, Zhiqiang
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, INFORMATION MANAGEMENT AND NETWORK SECURITY, 2016, 47 : 341 - 344
  • [10] Similarity measure for multi-attribute data
    Li, CJ
    Prabhakaran, B
    Zheng, SQ
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1149 - 1152