Fraud-Agents Detection in Online Microfinance: A Large-Scale Empirical Study

被引:6
|
作者
Wu, Yiming [1 ]
Xie, Zhiyuan [1 ]
Ji, Shouling [1 ]
Liu, Zhenguang [2 ]
Zhang, Xuhong [3 ]
Lin, Changting [4 ]
Deng, Shuiguang [1 ]
Zhou, Jun [5 ]
Wang, Ting [6 ]
Beyah, Raheem [7 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Zhejiang, Peoples R China
[2] Zhejiang Gongshang Univ, Dept Comp Sci, Hangzhou 310018, Zhejiang, Peoples R China
[3] Zhejiang Univ, Coll Control Sci & Engn, Hangzhou 310027, Zhejiang, Peoples R China
[4] Zhejiang Univ, Binjiang Inst, Hangzhou 310027, Zhejiang, Peoples R China
[5] Ant Grp, Hangzhou 310000, Zhejiang, Peoples R China
[6] Penn State Univ, Coll Informat Sci & Technol, State Coll, PA 16801 USA
[7] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
关键词
History; Feature extraction; Wireless fidelity; Social networking (online); Systematics; Peer-to-peer computing; Computer science; Fraud-agents detection; online lending; Index Terms; empirical study; CREDIT; RISK;
D O I
10.1109/TDSC.2022.3151132
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Online Microlending, a new financial service, focuses on small loans without any sort of collateral. It provides more flexible and quicker funding for borrowers, as well as higher interest rates of return. For platforms that provide such services, an essential task is to adequately evaluate each loan's risk so as to minimize the possible financial loss. However, there exists a special group of borrowers, namely fraud-agents, who gain illegal profits from inciting other borrowers to cheat, i.e., they help the high-risk borrowers evade the risk evaluation by crafting fake personal information. The existence of fraud-agents poses a severe threat to the risk management systems and results in a huge financial loss for lending platforms. In this article, we present the first machine learning-based solution to detect fraud-agents in online microlending. The key challenge of this decade-long problem is that it is unclear how to construct effective features from multiple behavior logs such as phone call history, address book, loan history and activity logs of borrowers. To address this problem, we first conduct an empirical study on over 600K borrowers to gain some insights on the adversarial behaviors of fraud-agents comparing to normal borrowers and benign-agents. Based on the study, we are able to design a total of 26 features, falling into four groups, for fraud agent detection. Then, we propose a two-stage detection model to address the challenge of limited number of labeled fraud agent examples. The evaluation results show that our method can achieve a precision of 94.30%. We deploy our method on a real large online microlending platform with 11,953,273 borrowers, and we identify 29,727 fraud-agents from them. The domain experts from the platform confirm that 95.59% of them are real fraud-agents, and have added them to the platform's internal blacklist. We further conduct a measurement study on those fraud-agents to share deeper insights on their adversarial behaviors.
引用
收藏
页码:1169 / 1185
页数:17
相关论文
共 50 条
  • [1] Incremental collusive fraud detection in large-scale online auction networks
    Dadfarnia, Mahila
    Adibnia, Fazlollah
    Abadi, Mahdi
    Dorri, Ali
    JOURNAL OF SUPERCOMPUTING, 2020, 76 (09): : 7416 - 7437
  • [2] Incremental collusive fraud detection in large-scale online auction networks
    Mahila Dadfarnia
    Fazlollah Adibnia
    Mahdi Abadi
    Ali Dorri
    The Journal of Supercomputing, 2020, 76 : 7416 - 7437
  • [3] Online E-Commerce Fraud: A Large-scale Detection and Analysis
    Weng, Haiqin
    Li, Zhao
    Ji, Shouling
    Chu, Chen
    Lu, Haifeng
    Du, Tianyu
    He, Qinming
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1435 - 1440
  • [4] Fraud Detection Using Large-scale Imbalance Dataset
    Rubaidi, Zainab Saad
    Ben Ammar, Boulbaba
    Ben Aouicha, Mohamed
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2022, 31 (08)
  • [5] A Large-Scale Empirical Study of Conficker
    Shin, Seungwon
    Gu, Guofei
    Reddy, Narasimha
    Lee, Christopher P.
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2012, 7 (02) : 676 - 690
  • [6] A graph-powered large-scale fraud detection system
    Zhao Li
    Biao Wang
    Jiaming Huang
    Yilun Jin
    Zenghui Xu
    Ji Zhang
    Jianliang Gao
    International Journal of Machine Learning and Cybernetics, 2024, 15 : 115 - 128
  • [7] A graph-powered large-scale fraud detection system
    Li, Zhao
    Wang, Biao
    Huang, Jiaming
    Jin, Yilun
    Xu, Zenghui
    Zhang, Ji
    Gao, Jianliang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (01) : 115 - 128
  • [8] A Large-Scale Empirical Study of Security Patches
    Li, Frank
    Paxson, Vern
    CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, : 2201 - 2215
  • [9] Conficker and Beyond: A Large-Scale Empirical Study
    Shin, Seungwon
    Gu, Guofei
    26TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE (ACSAC 2010), 2010, : 151 - 160
  • [10] An Empirical Study of a Large Scale Online Recommendation System
    Fu, Huazheng
    Chen, Kang
    Ding, Jianbing
    WEB TECHNOLOGIES AND APPLICATIONS, APWEB 2015 WORKSHOPS, 2015, 9461 : 15 - 25