BOND: Benchmarking Unsupervised Outlier Node Detection on Static Attributed Graphs

被引:0
|
作者
Liu, Kay [1 ]
Dou, Yingtong [1 ,8 ]
Zhao, Yue [2 ]
Ding, Xueying [2 ]
Hu, Xiyang [2 ]
Zhang, Ruitong [3 ]
Ding, Kaize [4 ]
Chen, Canyu [5 ]
Peng, Hao
Shu, Kai [5 ]
Sun, Lichao [6 ]
Li, Jundong [7 ]
Chen, George H. [2 ]
Jia, Zhihao [2 ]
Yu, Philip S. [1 ]
机构
[1] Univ Illinois, Chicago, IL 60680 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA USA
[3] Beihang Univ, Beijing, Peoples R China
[4] Arizona State Univ, Tempe, AZ USA
[5] IIT, Chicago, IL USA
[6] Lehigh Univ, Bethlehem, PA USA
[7] Univ Virginia, Charlottesville, VA USA
[8] Visa Res, Foster City, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting which nodes in graphs are outliers is a relatively new machine learning task with numerous applications. Despite the proliferation of algorithms developed in recent years for this task, there has been no standard comprehensive setting for performance evaluation. Consequently, it has been difficult to understand which methods work well and when under a broad range of settings. To bridge this gap, we present-to the best of our knowledge-the first comprehensive benchmark for unsupervised outlier node detection on static attributed graphs called BOND, with the following highlights. (1) We benchmark the outlier detection performance of 14 methods ranging from classical matrix factorization to the latest graph neural networks. (2) Using nine real datasets, our benchmark assesses how the different detection methods respond to two major types of synthetic outliers and separately to "organic" (real non-synthetic) outliers. (3) Using an existing random graph generation technique, we produce a family of synthetically generated datasets of different graph sizes that enable us to compare the running time and memory usage of the different outlier detection algorithms. Based on our experimental results, we discuss the pros and cons of existing graph outlier detection algorithms, and we highlight opportunities for future research. Importantly, our code is freely available and meant to be easily extendable: https://github.com/pygod-team/pygod/tree/main/benchmark
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Interaction-Focused Anomaly Detection on Bipartite Node-and-Edge-Attributed Graphs
    Fathony, Rizal
    Ng, Jenn
    Chen, Jia
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [22] Multivariate functional outlier detection using the fast massive unsupervised outlier detection indices
    Ojo, Oluwasegun Taiwo
    Anta, Antonio Fernandez
    Genton, Marc G.
    Lillo, Rosa E.
    STAT, 2023, 12 (01):
  • [23] Generative adversarial nets for unsupervised outlier detection
    Du, Xusheng
    Chen, Jiaying
    Yu, Jiong
    Li, Shu
    Tan, Qiyin
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 236
  • [24] On normalization and algorithm selection for unsupervised outlier detection
    Kandanaarachchi, Sevvandi
    Munoz, Mario A.
    Hyndman, Rob J.
    Smith-Miles, Kate
    DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (02) : 309 - 354
  • [25] An Unsupervised Boosting Strategy for Outlier Detection Ensembles
    Campos, Guilherme O.
    Zimek, Arthur
    Meira, Wagner, Jr.
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT I, 2018, 10937 : 564 - 576
  • [26] On normalization and algorithm selection for unsupervised outlier detection
    Sevvandi Kandanaarachchi
    Mario A. Muñoz
    Rob J. Hyndman
    Kate Smith-Miles
    Data Mining and Knowledge Discovery, 2020, 34 : 309 - 354
  • [27] Unsupervised outlier detection in quality control: an overview
    Archimbaud, Aurore
    JOURNAL OF THE SFDS, 2018, 159 (03): : 1 - 39
  • [28] Unsupervised Sequential Outlier Detection With Deep Architectures
    Lu, Weining
    Cheng, Yu
    Xiao, Cao
    Chang, Shiyu
    Huang, Shuai
    Liang, Bin
    Huang, Thomas
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (09) : 4321 - 4330
  • [29] Detection of Contextual Anomalies in Attributed Graphs
    Vaudaine, Remi
    Jeudy, Baptiste
    Largeron, Christine
    ADVANCES IN INTELLIGENT DATA ANALYSIS XIX, IDA 2021, 2021, 12695 : 338 - 349
  • [30] Incorporating Network Structure with Node Information for Semi-supervised Anomaly Detection on Attributed Graphs
    Chen, Bofeng
    Li, Jingdong
    Lu, Xingjian
    Sha, Chaofeng
    Wang, Xiaoling
    Zhang, Ji
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2021, PT I, 2021, 13080 : 242 - 257