Hyper-USS: Answering Subset Query Over Multi-Attribute Data Stream

被引:1
|
作者
Miao, Ruijie [1 ,4 ]
Zhang, Yiyao [2 ,6 ]
Qu, Guanyu [3 ,7 ,8 ]
Yang, Kaicheng [1 ,4 ,5 ]
Yang, Tong [1 ,4 ,5 ]
Cui, Bin [1 ,4 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] Nanjing Univ, Nanjing, Peoples R China
[3] Univ Chinese Acad Sci, Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[4] Peking Univ, Sch Comp Sci, Natl Key Lab Multimedia Informat Proc, Beijing, Peoples R China
[5] Peng Cheng Lab, Shenzhen, Peoples R China
[6] Nanjing Univ, Dept Comp Sci & Technol, Nanjing, Peoples R China
[7] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[8] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Sketch; Multi-attribute Data Stream; Subset Query;
D O I
10.1145/3580305.3599383
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sketching algorithms are considered as promising solutions for answering approximate query on massive data stream. In real scenarios, a large number of problems can be abstracted as subset query over multiple attributes. Existing sketches are designed for query on single attributes, and therefore are inefficient for query on multiple attributes. In this work, we propose Hyper-USS, an innovative sketching algorithm that supports subset query over multiple attributes accurately and efficiently. To the best of our knowledge, this work is the first sketching algorithm designed to answer approximate query over multi-attribute data stream. We utilize the key technique, Joint Variance Optimization, to guarantee high estimation accuracy on all attributes. Experiment results show that, compared with the state-of-the-art (SOTA) sketches that support subset query on single attributes, Hyper-USS improves the accuracy by 16.67x and the throughput by 8.54x. The code is open-sourced at Github.
引用
收藏
页码:1698 / 1709
页数:12
相关论文
共 50 条
  • [21] An introduction to reasoning over qualitative multi-attribute preferences
    Nunes, Ingrid
    Miles, Simon
    Luck, Michael
    Lucena, Carlos J. P.
    [J]. KNOWLEDGE ENGINEERING REVIEW, 2015, 30 (03): : 342 - 372
  • [22] View-based query answering and query containment over semistructured data
    Calvanese, D
    De Giacomo, G
    Lenzerini, M
    Vardi, MY
    [J]. DATABASE PROGRAMMING LANGUAGES, 2002, 2397 : 40 - 61
  • [23] Multi-Attribute Query Processing Through In-Network Aggregation in Edge Computing
    Li, Xiaocui
    Zhou, Zhangbing
    [J]. 2018 14TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2018, : 144 - 151
  • [24] Query Optimization over Distributed Data Stream
    Wang, Shuang
    Tan, Zhenhua
    Gao, Xiaoxing
    [J]. HIS 2009: 2009 NINTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, VOL 2, PROCEEDINGS, 2009, : 415 - 418
  • [25] Aggregated multi-attribute query processing in edge computing for industrial IoT applications
    Li, Xiaocui
    Zhou, Zhangbing
    Guo, Junqi
    Wang, Shangguang
    Zhang, Junsheng
    [J]. COMPUTER NETWORKS, 2019, 151 : 114 - 123
  • [26] The Optimization Reachability Query of Large scale Multi-attribute Constraints Directed Graph
    Zhang, Kehong
    Li, Keqiu
    [J]. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2018, 33 (02): : 71 - 85
  • [27] Multi-attribute data classification using neutrosophic probability
    Bhutani, Kanika
    Kumar, Megha
    Aggarwal, Swati
    [J]. 2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [28] A framework for efficient multi-attribute movement data analysis
    Fabio Valdés
    Ralf Hartmut Güting
    [J]. The VLDB Journal, 2019, 28 : 427 - 449
  • [29] MULTI-ATTRIBUTE DATA VISUALIZATION ANALYSIS MODEL FOR MULTIMEDIA
    Wu, Jun
    Chen, YaoXin
    [J]. 2019 6TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC AND SOCIO-CULTURAL COMPUTING (BESC 2019), 2019,
  • [30] Efficient Similarity Join and Search on Multi-Attribute Data
    Li, Guoliang
    He, Jian
    Deng, Dong
    Li, Jian
    [J]. SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 1137 - 1151