Machine Learning Pipeline for Fraud Detection and Prevention in E-Commerce Transactions

被引:0
|
作者
Jhangiani, Resham [1 ]
Bein, Doina [1 ]
Verma, Abhishek [2 ]
机构
[1] Calif State Univ Fullerton, Dept Comp Sci, Fullerton, CA 92831 USA
[2] New Jersey City Univ, Dept Comp Sci, Jersey City, NJ 07305 USA
关键词
random forest; credit card fraud; support vector machine; gradient boost; logistic regression;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fraud has become a major problem in e-commerce and a lot of resources are being invested to recognize and prevent it. Present fraud detection and prevention systems are designed to prevent only a small fraction of fraudulent transactions processed, which still costs billions of dollars in loss. There is an urgent need for better fraud detection and prevention as the online transactions are estimated to increase substantially in the coming year. We propose a data driven model using machine learning algorithms on big data to predict the probability of a transaction being fraudulent or legitimate. The model was trained on historical e-commerce credit card transaction data to predict the probability of any future transaction by the customer being fraudulent. Supervised machine learning algorithms like Random Forest, Support Vector Machine, Gradient Boost and combinations of these are implemented and their performance are compared. While at the same time the problem of class imbalance is taken into consideration and techniques of oversampling and data pre-processing are performed before the model is trained on a classifier.
引用
收藏
页码:135 / 140
页数:6
相关论文
共 50 条
  • [31] Detection of E-Commerce Fraud Review via Self-Paced Graph Contrast Learning
    Zhao, Weidong
    Liu, Xiaotong
    [J]. Computer Journal, 2024, 67 (06): : 2054 - 2065
  • [32] Quantitative Detection of Financial Fraud Based on Deep Learning with Combination of E-Commerce Big Data
    Liu, Jian
    Gu, Xin
    Shang, Chao
    [J]. COMPLEXITY, 2020, 2020
  • [33] Detection of E-Commerce Fraud Review via Self-Paced Graph Contrast Learning
    Zhao, Weidong
    Liu, Xiaotong
    [J]. COMPUTER JOURNAL, 2023, 67 (06): : 2054 - 2065
  • [34] Online E-Commerce Fraud: A Large-scale Detection and Analysis
    Weng, Haiqin
    Li, Zhao
    Ji, Shouling
    Chu, Chen
    Lu, Haifeng
    Du, Tianyu
    He, Qinming
    [J]. 2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1435 - 1440
  • [35] Group-based Fraud Detection Network on e-Commerce Platforms
    Yu, Jianke
    Wang, Hanchen
    Wang, Xiaoyang
    Li, Zhao
    Qin, Lu
    Zhang, Wenjie
    Liao, Jian
    Zhang, Ying
    [J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 5463 - 5475
  • [36] Multiplex graph fusion network with reinforcement structure learning for fraud detection in online e-commerce platforms
    Zhang, Zheng
    Ao, Xiang
    Tessone, Claudio J.
    Liu, Gang
    Zhou, Mingyang
    Mao, Rui
    Liao, Hao
    [J]. Expert Systems with Applications, 2025, 262
  • [37] NNEnsLeG: A novel approach for e-commerce payment fraud detection using ensemble learning and neural networks
    Zeng, Qingfeng
    Lin, Li
    Jiang, Rui
    Huang, Weiyu
    Lin, Dijia
    [J]. Information Processing and Management, 2025, 62 (01):
  • [38] E-Commerce with Rich Clients and Flexible Transactions
    Clarke, Dylan
    Morgan, Graham
    [J]. FIRST INTERNATIONAL WORKSHOP ON SOFTWARE TECHNOLOGIES FOR FUTURE DEPENDABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 2009, : 73 - 77
  • [39] Fraud Prediction in Pakistani E-commerce Market
    Sabih, Muhammad
    Ejaz, Mahnoor
    Quershi, Khurram Karim
    Asad, Muhammad Usman
    Gu, Jason
    Balas, Valentina E.
    Farooq, Umar
    [J]. 2021 4TH INTERNATIONAL SYMPOSIUM ON ADVANCED ELECTRICAL AND COMMUNICATION TECHNOLOGIES (ISAECT), 2021,
  • [40] A method of fraud & intrusion detection for e-payment systems in mobile e-commerce
    Venkataram, Pallapa
    Babu, B. Sathish
    Naveen, M. K.
    Samyama, Gungal G. H.
    [J]. 2007 IEEE INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE, VOLS 1 AND 2, 2007, : 395 - +