Domain-driven data synopses for dynamic quantiles

被引:4
|
作者
Gilbert, AC
Kotidis, Y
Muthukrishnan, S
Strauss, MJ
机构
[1] Univ Michigan, Dept Math, Ann Arbor, MI 48109 USA
[2] AT&T Labs Res, Florham Pk, NJ 07932 USA
[3] Rutgers State Univ, Dept Comp & Informat Sci, Piscataway, NJ 08854 USA
关键词
quantiles; database statistics; data streams;
D O I
10.1109/TKDE.2005.108
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present new algorithms for dynamically computing quantiles of a relation subject to insert as well as delete operations. At the core of our algorithms lies a small- space multiresolution representation of the underlying data distribution based on random subset sums or RSSs. These RSSs are updated with every insert and delete operation. When quantiles are demanded, we use these RSSs to estimate quickly, without having to access the data, all the quantiles, each guaranteed to be accurate to within user-specified precision. While quantiles have found many uses in databases, in this paper, our focus is primarily on network management applications that monitor the distribution of active sessions in the network. Our examples are drawn both from the telephony and the IP network, where the goal is to monitor the distribution of the length of active calls and IP flows, respectively, over time. For such applications, we propose a new type of histogram that uses RSSs for summarizing the dynamic parts of the distributions while other parts with small volume of sessions are approximated using simple counters.
引用
收藏
页码:927 / 938
页数:12
相关论文
共 50 条
  • [1] Domain-driven data mining: A framework
    Cao, Longbing
    Zhang, Chengqi
    IEEE INTELLIGENT SYSTEMS, 2007, 22 (04) : 78 - 79
  • [2] Toward Domain-Driven Data Mining
    Zhu, Zhengxiang
    Gu, Jifa
    Yang, Wenxin
    Li, Xingsen
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION WORKSHOP: IITA 2008 WORKSHOPS, PROCEEDINGS, 2008, : 44 - +
  • [3] Recent advances in domain-driven data mining
    Chuanren Liu
    Ehsan Fakharizadi
    Tong Xu
    Philip S. Yu
    International Journal of Data Science and Analytics, 2023, 15 : 1 - 7
  • [4] Domain-driven data mining: Methodologies and applications
    Zhang, Chengqi
    Cao, Longbing
    ADVANCES IN INTELLIGENT IT: ACTIVE MEDIA TECHNOLOGY 2006, 2006, 138 : 13 - +
  • [5] A Domain-Driven, Generative Data Model for BigPetStore
    Nowling, Ronald J.
    Vyas, Jay
    2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 49 - 55
  • [6] Recent advances in domain-driven data mining
    Liu, Chuanren
    Fakharizadi, Ehsan
    Xu, Tong
    Yu, Philip S.
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2023, 15 (01) : 1 - 7
  • [7] Domain-Driven Data Mining: Challenges and Prospects
    Cao, Longbing
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (06) : 755 - 769
  • [8] Recent advances in domain-driven data mining
    Liu, Chuanren
    Fakharizadi, Ehsan
    Xu, Tong
    Yu, Philip S.
    COMPLEX & INTELLIGENT SYSTEMS, 2022,
  • [9] The evolution of KDD: Towards domain-driven data mining*
    Cao, Longbing
    Zhang, Chengqi
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2007, 21 (04) : 677 - 692
  • [10] Introduction to the Domain-Driven Data Mining Special Section
    Zhang, Chengqi
    Yu, Philip S.
    Bell, David
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (06) : 753 - 754