BOND: Benchmarking Unsupervised Outlier Node Detection on Static Attributed Graphs

被引:0
|
作者
Liu, Kay [1 ]
Dou, Yingtong [1 ,8 ]
Zhao, Yue [2 ]
Ding, Xueying [2 ]
Hu, Xiyang [2 ]
Zhang, Ruitong [3 ]
Ding, Kaize [4 ]
Chen, Canyu [5 ]
Peng, Hao
Shu, Kai [5 ]
Sun, Lichao [6 ]
Li, Jundong [7 ]
Chen, George H. [2 ]
Jia, Zhihao [2 ]
Yu, Philip S. [1 ]
机构
[1] Univ Illinois, Chicago, IL 60680 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA USA
[3] Beihang Univ, Beijing, Peoples R China
[4] Arizona State Univ, Tempe, AZ USA
[5] IIT, Chicago, IL USA
[6] Lehigh Univ, Bethlehem, PA USA
[7] Univ Virginia, Charlottesville, VA USA
[8] Visa Res, Foster City, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting which nodes in graphs are outliers is a relatively new machine learning task with numerous applications. Despite the proliferation of algorithms developed in recent years for this task, there has been no standard comprehensive setting for performance evaluation. Consequently, it has been difficult to understand which methods work well and when under a broad range of settings. To bridge this gap, we present-to the best of our knowledge-the first comprehensive benchmark for unsupervised outlier node detection on static attributed graphs called BOND, with the following highlights. (1) We benchmark the outlier detection performance of 14 methods ranging from classical matrix factorization to the latest graph neural networks. (2) Using nine real datasets, our benchmark assesses how the different detection methods respond to two major types of synthetic outliers and separately to "organic" (real non-synthetic) outliers. (3) Using an existing random graph generation technique, we produce a family of synthetically generated datasets of different graph sizes that enable us to compare the running time and memory usage of the different outlier detection algorithms. Based on our experimental results, we discuss the pros and cons of existing graph outlier detection algorithms, and we highlight opportunities for future research. Importantly, our code is freely available and meant to be easily extendable: https://github.com/pygod-team/pygod/tree/main/benchmark
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Density of states for fast embedding node-attributed graphs
    Lingxiao Zhao
    Saurabh Sawlani
    Leman Akoglu
    Knowledge and Information Systems, 2023, 65 : 2455 - 2483
  • [42] Density of states for fast embedding node-attributed graphs
    Zhao, Lingxiao
    Sawlani, Saurabh
    Akoglu, Leman
    KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (06) : 2455 - 2483
  • [43] Similarity-Based Unsupervised Evaluation of Outlier Detection
    Marques, Henrique O.
    Zimek, Arthur
    Campello, Ricardo J. G. B.
    Sander, Jorg
    SIMILARITY SEARCH AND APPLICATIONS (SISAP 2022), 2022, 13590 : 234 - 248
  • [44] Generative Adversarial Active Learning for Unsupervised Outlier Detection
    Liu, Yezheng
    Li, Zhe
    Zhou, Chong
    Jiang, Yuanchun
    Sun, Jianshan
    Wang, Meng
    He, Xiangnan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (08) : 1517 - 1528
  • [45] Subsampling for Efficient and Effective Unsupervised Outlier Detection Ensembles
    Zimek, Arthur
    Gaudet, Matthew
    Campello, Ricardo J. G. B.
    Sander, Jorg
    19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 428 - 436
  • [46] OUTLIER DETECTION USING DIVERSE NEIGHBORHOOD GRAPHS
    Wang, Chao
    Gao, Hui
    Liu, Zhen
    Fu, Yan
    2018 15TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2018, : 58 - 62
  • [47] Community detection based on unsupervised attributed network embedding
    Zhou, Xinchuang
    Su, Lingtao
    Li, Xiangju
    Zhao, Zhongying
    Li, Chao
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [48] Community Detection in Attributed Graphs: An Embedding Approach
    Li, Ye
    Sha, Chaofeng
    Huang, Xin
    Zhang, Yanchun
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 338 - 345
  • [49] A Genetic Algorithm for Community Detection in Attributed Graphs
    Pizzuti, Clara
    Socievole, Annalisa
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2018, 2018, 10784 : 159 - 170
  • [50] Community Detection in Attributed Graphs with Differential Evolution
    Pizzuti, Clara
    Socievole, Annalisa
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2020, 2020, 12104 : 323 - 335