BOND: Benchmarking Unsupervised Outlier Node Detection on Static Attributed Graphs

被引:0
|
作者
Liu, Kay [1 ]
Dou, Yingtong [1 ,8 ]
Zhao, Yue [2 ]
Ding, Xueying [2 ]
Hu, Xiyang [2 ]
Zhang, Ruitong [3 ]
Ding, Kaize [4 ]
Chen, Canyu [5 ]
Peng, Hao
Shu, Kai [5 ]
Sun, Lichao [6 ]
Li, Jundong [7 ]
Chen, George H. [2 ]
Jia, Zhihao [2 ]
Yu, Philip S. [1 ]
机构
[1] Univ Illinois, Chicago, IL 60680 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA USA
[3] Beihang Univ, Beijing, Peoples R China
[4] Arizona State Univ, Tempe, AZ USA
[5] IIT, Chicago, IL USA
[6] Lehigh Univ, Bethlehem, PA USA
[7] Univ Virginia, Charlottesville, VA USA
[8] Visa Res, Foster City, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting which nodes in graphs are outliers is a relatively new machine learning task with numerous applications. Despite the proliferation of algorithms developed in recent years for this task, there has been no standard comprehensive setting for performance evaluation. Consequently, it has been difficult to understand which methods work well and when under a broad range of settings. To bridge this gap, we present-to the best of our knowledge-the first comprehensive benchmark for unsupervised outlier node detection on static attributed graphs called BOND, with the following highlights. (1) We benchmark the outlier detection performance of 14 methods ranging from classical matrix factorization to the latest graph neural networks. (2) Using nine real datasets, our benchmark assesses how the different detection methods respond to two major types of synthetic outliers and separately to "organic" (real non-synthetic) outliers. (3) Using an existing random graph generation technique, we produce a family of synthetically generated datasets of different graph sizes that enable us to compare the running time and memory usage of the different outlier detection algorithms. Based on our experimental results, we discuss the pros and cons of existing graph outlier detection algorithms, and we highlight opportunities for future research. Importantly, our code is freely available and meant to be easily extendable: https://github.com/pygod-team/pygod/tree/main/benchmark
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Unsupervised Outlier Detection Technique for Intrusion Detection in Cloud Computing
    Kumar, Manoj
    Mathur, Robin
    2014 INTERNATIONAL CONFERENCE FOR CONVERGENCE OF TECHNOLOGY (I2CT), 2014,
  • [32] Anomaly Based Network Intrusion Detection with Unsupervised Outlier Detection
    Zhang, Jiong
    Zulkernine, Mohammad
    2006 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-12, 2006, : 2388 - 2393
  • [33] Unsupervised outlier detection in heavy-ion collisions
    Thaprasop, P.
    Zhou, K.
    Steinheimer, J.
    Herold, C.
    PHYSICA SCRIPTA, 2021, 96 (06)
  • [34] Graph autoencoder-based unsupervised outlier detection
    Du, Xusheng
    Yu, Jiong
    Chu, Zheng
    Jin, Lina
    Chen, Jiaying
    INFORMATION SCIENCES, 2022, 608 : 532 - 550
  • [35] Unsupervised Outlier Detection Using Memory and Contrastive Learning
    Huyan, Ning
    Quan, Dou
    Zhang, Xiangrong
    Liang, Xuefeng
    Chanussot, Jocelyn
    Jiao, Licheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6440 - 6454
  • [36] Unsupervised Outlier Detection Mechanism for Tea Traceability Data
    Yang, Honggang
    Li, Shaowen
    Tu, Lijing
    Ma, Rongrong
    Chen, Yin
    IEEE ACCESS, 2022, 10 : 94818 - 94831
  • [37] An outlier ensemble for unsupervised anomaly detection in honeypots data
    Boukela, Lynda
    Zhang, Gongxuan
    Bouzefrane, Samia
    Zhou, Junlong
    INTELLIGENT DATA ANALYSIS, 2020, 24 (04) : 743 - 758
  • [38] Unsupervised Outlier Detection via Transformation Invariant Autoencoder
    Cheng, Zhen
    Zhu, En
    Wang, Siqi
    Zhang, Pei
    Li, Wang
    IEEE ACCESS, 2021, 9 : 43991 - 44002
  • [39] Unsupervised Outlier Detection in IOT Using Deep VAE
    Gouda, Walaa
    Tahir, Sidra
    Alanazi, Saad
    Almufareh, Maram
    Alwakid, Ghadah
    SENSORS, 2022, 22 (17)
  • [40] UNSUPERVISED ANOMALY DETECTION FOR TIME SERIES WITH OUTLIER EXPOSURE
    Feng, Jiaming
    Huang, Zheng
    Guo, Jie
    Qiu, Weidong
    33RD INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2021), 2020, : 1 - 12