Cost-Sensitive GNN-Based Imbalanced Learning for Mobile Social Network Fraud Detection

被引:2
|
作者
Hu, Xinxin [1 ,2 ]
Chen, Haotian [3 ]
Chen, Hongchang [1 ]
Liu, Shuxin [1 ]
Li, Xing [1 ]
Zhang, Shibo [1 ]
Wang, Yahui [1 ]
Xue, Xiangyang [4 ]
机构
[1] Natl Digital Switching Syst Engn & Technol Res Ctr, Zhengzhou 450002, Peoples R China
[2] Acad Mil Sci, Inst Syst Engn, Beijing 100091, Peoples R China
[3] Univ Toronto, Edward S Rogers Sr Dept Elect & Comp Engn, Toronto, ON M5S 3G4, Canada
[4] Fudan Univ, Inst Big Data, Shanghai 200433, Peoples R China
关键词
Fraud; Telecommunications; Social networking (online); Graph neural networks; Costs; Training; Semantics; Cost-sensitive learning; fraud detection; graph imbalance; graph neural network (GNN); mobile social networks; reinforcement learning (RL);
D O I
10.1109/TCSS.2023.3302651
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the increasing prevalence of mobile social network fraud has led to significant distress and depletion of personal and social wealth, resulting in considerable economic harm. Graph neural networks (GNNs) have emerged as a popular approach to tackle this issue. However, the challenge of graph imbalance, which can greatly impede the effectiveness of GNN-based fraud detection methods, has received little attention in prior research. Thus, we are going to present a novel cost-sensitive graph neural network (CSGNN) in this article. Initially, reinforcement learning is utilized to train a suitable sampling threshold, followed by neighbor sampling based on node similarity, which helps to alleviate the graph imbalance issue preliminarily. Subsequently, message aggregation is executed on the sampled graph using GNN to obtain node embeddings. Concurrently, the optimization objective for the cost matrix is formulated using the sample histogram matrix, scatter matrix, and confusion matrix. The cost matrix and GNN are collaboratively optimized through the backpropagation algorithm. Ultimately, the derived cost-sensitive node embedding is employed for fraudulent node detection. Furthermore, this study provides a theoretical demonstration of the effectiveness of adaptive cost-sensitive learning in GNN. Extensive experiments are carried out on two publicly accessible real-world mobile network fraud datasets, revealing that the proposed CSGNN effectively addresses the graph imbalance issue while outperforming state-of-the-art algorithms in detection performance. The CSGNN code and datasets can be accessed at https://github.com/xxhu94/CSGNN.
引用
收藏
页码:2675 / 2690
页数:16
相关论文
共 50 条
  • [1] Pick and Choose: A GNN-based Imbalanced Learning Approach for Fraud Detection
    Liu, Yang
    Ao, Xiang
    Qin, Zidi
    Chi, Jianfeng
    Feng, Jinghua
    Yang, Hao
    He, Qing
    [J]. PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 3168 - 3177
  • [2] A Novel Cost-sensitive Capsule Network for Audit Fraud Detection
    Zhu, Feng
    Ning, D. J.
    Wang, Yu
    Liu, Shipeng
    [J]. 20TH INT CONF ON UBIQUITOUS COMP AND COMMUNICAT (IUCC) / 20TH INT CONF ON COMP AND INFORMATION TECHNOLOGY (CIT) / 4TH INT CONF ON DATA SCIENCE AND COMPUTATIONAL INTELLIGENCE (DSCI) / 11TH INT CONF ON SMART COMPUTING, NETWORKING, AND SERV (SMARTCNS), 2021, : 549 - 556
  • [3] A cost-sensitive ensemble deep forest approach for extremely imbalanced credit fraud detection
    Zhao, Fang
    Li, Gang
    Lyu, Yanxia
    Ma, Hongdong
    Zhu, Xiaoqian
    [J]. QUANTITATIVE FINANCE, 2021,
  • [4] A cost-sensitive ensemble deep forest approach for extremely imbalanced credit fraud detection
    Zhao, Fang
    Li, Gang
    Lyu, Yanxia
    Ma, Hongdong
    Zhu, Xiaoqian
    [J]. QUANTITATIVE FINANCE, 2023, 23 (10) : 1397 - 1409
  • [5] Cost-Sensitive Learning based on Performance Metric for Imbalanced Data
    Aurelio, Yuri Sousa
    de Almeida, Gustavo Matheus
    de Castro, Cristiano Leite
    Braga, Antonio Padua
    [J]. NEURAL PROCESSING LETTERS, 2022, 54 (04) : 3097 - 3114
  • [6] Cost-sensitive learning for imbalanced data streams
    Loezer, Lucas
    Enembreck, Fabricio
    Barddal, Jean Paul
    Britto Jr, Alceu de Souza
    [J]. PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 498 - 504
  • [7] Cost-Sensitive Learning Methods for Imbalanced Data
    Nguyen Thai-Nghe
    Gantner, Zeno
    Schmidt-Thieme, Lars
    [J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [8] Cost-Sensitive Learning based on Performance Metric for Imbalanced Data
    Yuri Sousa Aurelio
    Gustavo Matheus de Almeida
    Cristiano Leite de Castro
    Antonio Padua Braga
    [J]. Neural Processing Letters, 2022, 54 : 3097 - 3114
  • [9] Cost-Sensitive Learning for Medical Insurance Fraud Detection With Temporal Information
    Shi, Haolun
    Tayebi, Mohammad A.
    Pei, Jian
    Cao, Jiguo
    [J]. IEEE Transactions on Knowledge and Data Engineering, 2023, 35 (10) : 10451 - 10463
  • [10] Cost-sensitive learning for social network analysis and network measurement
    Zhang Xing
    Wang, Meili
    Zhang Yang
    Ning Jifeng
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2017, 61 : 67 - 79