Efficient k-clique Listing with Set Intersection Speedup

被引:4
|
作者
Yuan, Zhirong [1 ]
Peng, You [4 ]
Cheng, Peng [1 ]
Han, Li [1 ]
Lin, Xuemin [3 ]
Chen, Lei [2 ]
Zhang, Wenjie [4 ]
机构
[1] East China Normal Univ, Shanghai, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[3] Shanghai Jiao Tong Univ, Antai Coll Econ & Management, Shanghai, Peoples R China
[4] Univ New South Wales, Kensington, NSW, Australia
基金
中国国家自然科学基金; 澳大利亚研究理事会;
关键词
ARBORICITY;
D O I
10.1109/ICDE53745.2022.00192
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Listing all k-cliques is a fundamental problem in graph mining, with applications in finance, biology, and social network analysis. However, owing to the exponential growth of the search space as k increases, listing all k-cliques is algorithmically challenging. DDegree and DDegCol are the stateof-the-art algorithms that exploit ordering heuristics based on degree ordering and color ordering, respectively. Both DDegree and DDegCol induce high time and space overhead for set intersections cause they construct and maintain all induced subgraphs. Meanwhile, it is non-trivial to implement the data level parallelism to further accelerate on DDegree and DDegCol. In this paper, we propose two efficient algorithms SDegree and BitCol for k-clique listing. We mainly focus on accelerating the set intersections for k-clique listing. Both SDegree and BitCol exploit the data level parallelism for further acceleration with single instruction multiple data (SIMD) or vector instruction sets. Furthermore, we propose two preprocessing techniques PreCore and Pre-List, which run in linear time. The preprocessing techniques significantly reduce the size of the original graph and prevent exploring a large number of invalid nodes. In the theoretical analysis, our algorithms have a comparable time complexity and a slightly lower space complexity than the stateof-the-art algorithms. The comprehensive experiments reveal that our algorithms outperform the state-of-the-art algorithms by 3.75x for degree ordering and 5.67x for color ordering on average.
引用
收藏
页码:1955 / 1968
页数:14
相关论文
共 50 条
  • [1] Lightning Fast and Space Efficient k-clique Counting
    Ye, Xiaowei
    Li, Rong-Hua
    Dai, Qiangqiang
    Chen, Hongzhi
    Wang, Guoren
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 1191 - 1202
  • [2] The Binomial k-Clique
    Narayana, Nithya Sai
    Sane, Sharad
    JOURNAL OF COMBINATORIAL DESIGNS, 2013, 21 (01) : 36 - 45
  • [3] An Efficient movie recommendation algorithm based on improved k-clique
    Vilakone, Phonexay
    Park, Doo-Soon
    Xinchang, Khamphaphone
    Hao, Fei
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2018, 8
  • [4] Faster Combinatorial k-Clique Algorithms
    Abboud, Amir
    Fischer, Nick
    Shechter, Yarin
    LATIN 2024: THEORETICAL INFORMATICS, PT I, 2024, 14578 : 193 - 206
  • [5] The K-clique Densest Subgraph Problem
    Tsourakakis, Charalampos E.
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015), 2015, : 1122 - 1132
  • [6] Efficient similarity search in protein structure databases by k-clique hashing
    Weskamp, N
    Kuhn, D
    Hüllermeier, E
    Klebe, G
    BIOINFORMATICS, 2004, 20 (10) : 1522 - 1526
  • [7] Parallel K-clique Counting on GPUs
    Almasri, Mohammad
    El Hajj, Izzat
    Nagi, Rakesh
    Xiong, Jinjun
    Hwu, Wen-Mei
    PROCEEDINGS OF THE 36TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ICS 2022, 2022,
  • [8] The dynamic descriptive complexity of k-clique
    Zeume, Thomas
    INFORMATION AND COMPUTATION, 2017, 256 : 9 - 22
  • [9] Efficient Distributed k-Clique Mining for Large Networks Using MapReduce
    Shahrivari, Saeed
    Jalili, Saeed
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (03) : 964 - 974
  • [10] The Dynamic Descriptive Complexity of k-Clique
    Zeume, Thomas
    MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE 2014, PT I, 2014, 8634 : 547 - 558