Improving fraud detection via imbalanced graph structure learning

被引:5
|
作者
Ren, Lingfei [1 ,2 ]
Hu, Ruimin [1 ,2 ,5 ]
Liu, Yang [3 ]
Li, Dengshi [1 ,4 ]
Wu, Junhang [1 ,2 ]
Zang, Yilong [1 ,2 ]
Hu, Wenyi [1 ,2 ]
机构
[1] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Sch Comp Sci, Wuhan, Peoples R China
[2] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[4] Jianghan Univ, Sch Artificial Intelligence, Wuhan, Peoples R China
[5] Xidian Univ, Sch Cyber Engn, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
Fraud detection; Graph structure learning; Homophily; Heterophily;
D O I
10.1007/s10994-023-06464-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph-based fraud detection methods have recently attracted much attention due to the rich relational information of graph-structured data, which may facilitate the detection of fraudsters. However, the GNN-based algorithms may exhibit unsatisfactory performance faced with graph heterophily as the fraudsters usually disguise themselves by deliberately making extensive connections to normal users. In addition to this, the class imbalance problem also causes GNNs to overfit normal users and perform poorly for fraudsters. To address these problems, we propose an Imbalanced Graph Structure Learning framework for fraud detection (IGSL for short). Specifically, nodes are picked with a devised multi-relational class-balanced sampler for mini-batch training. Then, an iterative graph structure learning module is proposed to iteratively construct a global homophilic adjacency matrix in the embedding domain. Further, an anchor node message passing mechanism is proposed to reduce the computational complexity of the constructing homophily adjacency matrix. Extensive experiments on benchmark datasets show that IGSL achieves significantly better performance even when the graph is heavily heterophilic and imbalanced.
引用
收藏
页码:1069 / 1090
页数:22
相关论文
共 50 条
  • [21] BTG: A Bridge to Graph machine learning in telecommunications fraud detection
    Hu, Xinxin
    Chen, Hongchang
    Liu, Shuxin
    Jiang, Haocong
    Chu, Guanghan
    Li, Ran
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 137 : 274 - 287
  • [22] Credit card fraud detection based on federated graph learning
    Tang, Yuncan
    Liang, Yongquan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 256
  • [23] Anomaly Detection with Machine Learning and Graph Databases in Fraud Management
    Magomedov, Shamil
    Pavelyev, Sergei
    Ivanova, Irina
    Dobrotvorsky, Alexey
    Khrestina, Marina
    Yusubaliev, Timur
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (11) : 33 - 38
  • [24] Barely Supervised Learning for Graph-Based Fraud Detection
    Yu, Hang
    Liu, Zhengyang
    Luo, Xiangfeng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16548 - 16557
  • [25] Polyp Detection via Imbalanced Learning and Discriminative Feature Learning
    Bae, Seung-Hwan
    Yoon, Kuk-Jin
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2015, 34 (11) : 2379 - 2393
  • [26] Graph Classification via Graph Structure Learning
    Tu Huynh
    Tuyen Thanh Thi Ho
    Bac Le
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, PT II, 2022, 13758 : 269 - 281
  • [27] PU GNN: Chargeback Fraud Detection in P2E MMORPGs via Graph Attention Networks with Imbalanced PU Labels
    Choi, Jiho
    Park, Junghoon
    Kim, Woocheol
    Park, Jin-Hyeok
    Suh, Yumin
    Sung, Minchang
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI, 2023, 14174 : 243 - 258
  • [28] Detection of E-Commerce Fraud Review via Self-Paced Graph Contrast Learning
    Zhao, Weidong
    Liu, Xiaotong
    COMPUTER JOURNAL, 2023, 67 (06): : 2054 - 2065
  • [29] Fraud calls detection using class-imbalanced learning on graph structuresFraud calls detection using class-imbalanced learning...N. T. Cam, D. H. Hiep
    Nguyen Tan Cam
    Dinh Hoai Hiep
    The Journal of Supercomputing, 81 (7)
  • [30] An Effective Data Sampling Procedure for Imbalanced Data Learning on Health Insurance Fraud Detection
    Kotekani S.S.
    Velchamy I.
    Journal of Computing and Information Technology, 2020, 28 (04) : 269 - 285