Better Algorithms for Counting Triangles in Data Streams

被引:36
|
作者
McGregor, Andrew [1 ]
Vorotnikova, Sofya [1 ]
Vu, Hoa T. [1 ]
机构
[1] Univ Massachusetts, Amherst, MA 01003 USA
关键词
data streams; triangles; clustering coefficients; GRAPH; SUBGRAPH;
D O I
10.1145/2902251.2902283
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present space-efficient data stream algorithms for approximating the number of triangles in a graph up to a factor 1 + epsilon. While it can be shown that determining whether a graph is triangle-free is not possible in sub-linear space, a large body of work has focused on minimizing the space required in terms of the number of triangles T (or a lower bound on this quantity) and other parameters including the number of nodes n and the number of edges m. Two models are important in the literature: the arbitrary order model in which the stream consists of the edges of the graph in arbitrary order and the adjacency list order model in which all edges incident to the same node appear consecutively. We improve over the state of the art results in both models. For the adjacency list order model, we show that (O) over tilde (c(-2)mR/root T) space is sufficient in one pass and (O) over tilde(epsilon(-2)m(3/2)/T) space is sufficient in two passes where the (O) over tilde(.) notation suppresses log factors. For the arbitrary order model, we show that (O) over tilde (epsilon(-2)m/root T) space suffices given two passes and that (O) over tilde(epsilon(-2)m(3/2)/T) space suffices given three passes and oracle access to the degrees. Finally, we show how to efficiently implement the "wedge sampling" approach to triangle estimation in the arbitrary order model. To do this, we develop the first algorithm for fp sampling such that multiple independent samples can be generated with O (polylog n) update time; this primitive is widely applicable and this result may be of independent interest.
引用
下载
收藏
页码:401 / 411
页数:11
相关论文
共 50 条
  • [31] Counting triangles in regular graphs
    He, Jialin
    Hou, Xinmin
    Ma, Jie
    Xie, Tianying
    JOURNAL OF GRAPH THEORY, 2024,
  • [32] Counting triangles in regular graphs
    He, Jialin
    Hou, Xinmin
    Ma, Jie
    Xie, Tianying
    Journal of Graph Theory, 107 (04): : 759 - 777
  • [33] Memory-Efficient and Accurate Sampling for Counting Local Triangles in Graph Streams: From Simple to Multigraphs
    Lim, Yongsub
    Jung, Minsoo
    Kang, U.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2018, 12 (01)
  • [34] Deterministic Sampling and Range Counting in Geometric Data Streams
    Bagchi, Amitabha
    Chaudhary, Amitabh
    Eppstein, David
    Goodrich, Michael T.
    ACM TRANSACTIONS ON ALGORITHMS, 2007, 3 (02)
  • [35] Adaptive frequency counting over bursty data streams
    Lin, Bill
    Ho, Wai-Shing
    Kao, Ben
    Chui, Chun-Kit
    2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, VOLS 1 AND 2, 2007, : 516 - 523
  • [36] Distinct Element Counting in Distributed Dynamic Data Streams
    Chen, Wenji
    Guan, Yong
    2015 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (INFOCOM), 2015,
  • [37] Communicate Less, Learn More: A Locally Differential Private Approach for Counting Triangles with Better Accuracy
    Saladi, Sasi Bhushan V.
    Sristy, Nagesh Bhattu
    PROCEEDINGS OF THE FOURTEENTH ACM CONFERENCE ON DATA AND APPLICATION SECURITY AND PRIVACY, CODASPY 2024, 2024, : 159 - 161
  • [38] Graph partitioning MapReduce-based algorithms for counting triangles in large-scale graphs
    Ahmed Sharafeldeen
    Mohammed Alrahmawy
    Samir Elmougy
    Scientific Reports, 13
  • [39] Algorithms and estimators for summarization of unaggregated data streams
    Cohen, Edith
    Duffield, Nick
    Kaplan, Haim
    Lund, Carstent
    Thorup, Mikkel
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2014, 80 (07) : 1214 - 1244
  • [40] A Survey of Outlier Detection Algorithms for Data Streams
    Tamboli, Jinita
    Shukla, Madhu
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 3535 - 3540