Fraud-Agents Detection in Online Microfinance: A Large-Scale Empirical Study

被引：6

作者：

Wu, Yiming ^{[1
]}

Xie, Zhiyuan ^{[1
]}

Ji, Shouling ^{[1
]}

Liu, Zhenguang ^{[2
]}

Zhang, Xuhong ^{[3
]}

Lin, Changting ^{[4
]}

Deng, Shuiguang ^{[1
]}

Zhou, Jun ^{[5
]}

Wang, Ting ^{[6
]}

Beyah, Raheem ^{[7
]}

机构：

[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Zhejiang, Peoples R China

[2] Zhejiang Gongshang Univ, Dept Comp Sci, Hangzhou 310018, Zhejiang, Peoples R China

[3] Zhejiang Univ, Coll Control Sci & Engn, Hangzhou 310027, Zhejiang, Peoples R China

[4] Zhejiang Univ, Binjiang Inst, Hangzhou 310027, Zhejiang, Peoples R China

[5] Ant Grp, Hangzhou 310000, Zhejiang, Peoples R China

[6] Penn State Univ, Coll Informat Sci & Technol, State Coll, PA 16801 USA

[7] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA

来源：

IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING | 2023年 / 20卷 / 02期

关键词：

History; Feature extraction; Wireless fidelity; Social networking (online); Systematics; Peer-to-peer computing; Computer science; Fraud-agents detection; online lending; Index Terms; empirical study; CREDIT; RISK;

D O I：

10.1109/TDSC.2022.3151132

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Online Microlending, a new financial service, focuses on small loans without any sort of collateral. It provides more flexible and quicker funding for borrowers, as well as higher interest rates of return. For platforms that provide such services, an essential task is to adequately evaluate each loan's risk so as to minimize the possible financial loss. However, there exists a special group of borrowers, namely fraud-agents, who gain illegal profits from inciting other borrowers to cheat, i.e., they help the high-risk borrowers evade the risk evaluation by crafting fake personal information. The existence of fraud-agents poses a severe threat to the risk management systems and results in a huge financial loss for lending platforms. In this article, we present the first machine learning-based solution to detect fraud-agents in online microlending. The key challenge of this decade-long problem is that it is unclear how to construct effective features from multiple behavior logs such as phone call history, address book, loan history and activity logs of borrowers. To address this problem, we first conduct an empirical study on over 600K borrowers to gain some insights on the adversarial behaviors of fraud-agents comparing to normal borrowers and benign-agents. Based on the study, we are able to design a total of 26 features, falling into four groups, for fraud agent detection. Then, we propose a two-stage detection model to address the challenge of limited number of labeled fraud agent examples. The evaluation results show that our method can achieve a precision of 94.30%. We deploy our method on a real large online microlending platform with 11,953,273 borrowers, and we identify 29,727 fraud-agents from them. The domain experts from the platform confirm that 95.59% of them are real fraud-agents, and have added them to the platform's internal blacklist. We further conduct a measurement study on those fraud-agents to share deeper insights on their adversarial behaviors.

引用

页码：1169 / 1185

页数：17

共 50 条

[21] Detection of fickle trolls in large-scale online social networks
Hossein Shafiei
Aresh Dadlani
Journal of Big Data, 9
[22] A Empirical Study of Programming Behaviours on Large Scale Online Learning
Cao, Menghua
Zhang, Qian
14TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND EDUCATION (ICCSE 2019), 2019, : 889 - 894
[23] FeatNet: Large-scale Fraud Device Detection by Network Representation Learning with Rich Features
Xu, Chao
Feng, Zhentan
Chen, Yizheng
Wang, Minghua
Wei, Tao
AISEC'18: PROCEEDINGS OF THE 11TH ACM WORKSHOP ON ARTIFICIAL INTELLIGENCE AND SECURITY, 2018, : 57 - 63
[24] Fraud detection on large scale social networks
Sylla, Yaya
Morizet-Mahoudeaux, Pierre
Brobst, Stephen
2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA, 2013, : 413 - +
[25] A Large-Scale Empirical Study on Semantic Versioning in Golang Ecosystem
Li, Wenke
Wu, Feng
Fu, Cai
Zhou, Fan
2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 1604 - 1614
[26] A Large-Scale Empirical Study of Aligned Time Series Forecasting
Pilyugina, Polina
Medvedeva, Svetlana
Mosievich, Kirill
Trofimov, Ilya
Kostromina, Alina
Simakov, Dmitry
Burnaev, Evgeny
IEEE ACCESS, 2024, 12 : 131100 - 131121
[27] Software testing and Android applications: a large-scale empirical study
Pecorelli, Fabiano
Catolino, Gemma
Ferrucci, Filomena
De Lucia, Andrea
Palomba, Fabio
EMPIRICAL SOFTWARE ENGINEERING, 2022, 27 (02)
[28] A Large-Scale Empirical Study of Compiler Errors in Continuous Integration
Zhang, Chen
Chen, Bihuan
Chen, Linlin
Peng, Xin
Zhao, Wenyun
ESEC/FSE'2019: PROCEEDINGS OF THE 2019 27TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, 2019, : 176 - 187
[29] A large-scale empirical study of code smells in JavaScript projects
David Johannes
Foutse Khomh
Giuliano Antoniol
Software Quality Journal, 2019, 27 : 1271 - 1314
[30] Software testing and Android applications: a large-scale empirical study
Fabiano Pecorelli
Gemma Catolino
Filomena Ferrucci
Andrea De Lucia
Fabio Palomba
Empirical Software Engineering, 2022, 27

← 1 2 3 4 5 →