Anomaly Detection Based on GCNs and DBSCAN in a Large-Scale Graph

被引:0
|
作者
Emane, Christopher Retiti Diop [1 ]
Song, Sangho [1 ]
Lee, Hyeonbyeong [1 ]
Choi, Dojin [2 ]
Lim, Jongtae [1 ]
Bok, Kyoungsoo [3 ]
Yoo, Jaesoo [1 ]
机构
[1] Chungbuk Natl Univ, Dept Informat & Commun Engn, Chungdae ro 1, Cheongju 28644, South Korea
[2] Changwon Natl Univ, Dept Comp Engn, Changwondaehak ro 20, Chang Won 51140, South Korea
[3] Wonkwang Univ, Dept Artificial Intelligence Convergence, Iksandae 460, Iksan 54538, South Korea
基金
新加坡国家研究基金会;
关键词
anomaly detection; GCNs; DBSCAN; deep learning; clustering algorithms; large-scale graph;
D O I
10.3390/electronics13132625
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Anomaly detection is critical across domains, from cybersecurity to fraud prevention. Graphs, adept at modeling intricate relationships, offer a flexible framework for capturing complex data structures. This paper proposes a novel anomaly detection approach, combining Graph Convolutional Networks (GCNs) and Density-Based Spatial Clustering of Applications with Noise (DBSCAN). GCNs, a specialized deep learning model for graph data, extracts meaningful node and edge representations by incorporating graph topology and attribute information. This facilitates learning expressive node embeddings capturing local and global structural patterns. For anomaly detection, DBSCAN, a density-based clustering algorithm effective in identifying clusters of varying densities amidst noise, is employed. By defining a minimum distance threshold and a minimum number of points within that distance, DBSCAN proficiently distinguishes normal graph elements from anomalies. Our approach involves training a GCN model on a labeled graph dataset, generating appropriately labeled node embeddings. These embeddings serve as input to DBSCAN, identifying clusters and isolating anomalies as noise points. The evaluation on benchmark datasets highlights the superior performance of our approach in anomaly detection compared to traditional methods. The fusion of GCNs and DBSCAN demonstrates a significant potential for accurate and efficient anomaly detection in graphs. This research contributes to advancing graph-based anomaly detection, with promising applications in domains where safeguarding data integrity and security is paramount.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Research on a large-scale community detection algorithm based on non-weighted graph
    Ai Fang-ju
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (02): : S2555 - S2562
  • [32] Scalable and Efficient Flow-Based Community Detection for Large-Scale Graph Analysis
    Bae, Seung-Hee
    Halperin, Daniel
    West, Jevin D.
    Rosvall, Martin
    Howe, Bill
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2017, 11 (03)
  • [33] Graph-Based Wrong IsA Relation Detection in a Large-Scale Lexical Taxonomy
    Liang, Jiaqing
    Xiao, Yanghua
    Zhang, Yi
    Hwang, Seung-won
    Wang, Haixun
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1178 - 1184
  • [34] Research on a large-scale community detection algorithm based on non-weighted graph
    AI Fang-ju
    Cluster Computing, 2019, 22 : 2555 - 2562
  • [35] A large-scale graph search algorithms Based LRU
    Xiao Li
    Xiao Jing-Zhong
    2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTER SCIENCE AND APPLICATION (FCSA 2011), VOL 3, 2011, : 72 - 75
  • [36] Constant Time EXPected Similarity Estimation for Large-Scale Anomaly Detection
    Schneider, Markus
    Ertel, Wolfgang
    Palm, Guenther
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 12 - 20
  • [37] Expected similarity estimation for large-scale batch and streaming anomaly detection
    Schneider, Markus
    Ertel, Wolfgang
    Ramos, Fabio
    MACHINE LEARNING, 2016, 105 (03) : 305 - 333
  • [38] Higher-Order PCA for Anomaly Detection in Large-Scale Networks
    Kim, Hayang
    Lee, Sungeun
    Ma, Xiaoli
    Wang, Chao
    2009 3RD IEEE INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP), 2009, : 85 - 88
  • [39] Density-preserving projections for large-scale local anomaly detection
    Timothy de Vries
    Sanjay Chawla
    Michael E. Houle
    Knowledge and Information Systems, 2012, 32 : 25 - 52
  • [40] Density-preserving projections for large-scale local anomaly detection
    de Vries, Timothy
    Chawla, Sanjay
    Houle, Michael E.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 32 (01) : 25 - 52