Machine Learning Pipeline for Fraud Detection and Prevention in E-Commerce Transactions

被引:0
|
作者
Jhangiani, Resham [1 ]
Bein, Doina [1 ]
Verma, Abhishek [2 ]
机构
[1] Calif State Univ Fullerton, Dept Comp Sci, Fullerton, CA 92831 USA
[2] New Jersey City Univ, Dept Comp Sci, Jersey City, NJ 07305 USA
关键词
random forest; credit card fraud; support vector machine; gradient boost; logistic regression;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fraud has become a major problem in e-commerce and a lot of resources are being invested to recognize and prevent it. Present fraud detection and prevention systems are designed to prevent only a small fraction of fraudulent transactions processed, which still costs billions of dollars in loss. There is an urgent need for better fraud detection and prevention as the online transactions are estimated to increase substantially in the coming year. We propose a data driven model using machine learning algorithms on big data to predict the probability of a transaction being fraudulent or legitimate. The model was trained on historical e-commerce credit card transaction data to predict the probability of any future transaction by the customer being fraudulent. Supervised machine learning algorithms like Random Forest, Support Vector Machine, Gradient Boost and combinations of these are implemented and their performance are compared. While at the same time the problem of class imbalance is taken into consideration and techniques of oversampling and data pre-processing are performed before the model is trained on a classifier.
引用
收藏
页码:135 / 140
页数:6
相关论文
共 50 条
  • [1] Fraud Detection in E-commerce Transactions: A Machine Learning Perspective
    Manoharan, Geetha
    Ali, S. Dada Noor Hayath
    Sathe, Manoj
    Karthik, A.
    Nagpal, Amandeep
    Sidana, Ajay
    [J]. 2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [2] Fraud Analysis and Prevention in e-Commerce Transactions
    Caldeira, Evandro
    Brandao, Gabriel
    Pereira, Adriano C. M.
    [J]. 2014 9TH LATIN AMERICAN WEB CONGRESS (LA-WEB), 2014, : 42 - 49
  • [3] Improved Fraud Detection in e-Commerce Transactions
    Shaji, Jisha
    Panchal, Dakshata
    [J]. 2017 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS, COMPUTING AND IT APPLICATIONS (CSCITA), 2017, : 121 - 126
  • [4] Fraud Detection using Machine Learning in e-Commerce
    Saputra, Adi
    Suharjito
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (09) : 332 - 339
  • [5] Machine Learning for Fraud Detection in E-Commerce: A Research Agenda
    Tax, Niek
    de Vries, Kees Jan
    de Jong, Mathijs
    Dosoula, Nikoleta
    van den Akker, Bram
    Smith, Jon
    Thuong, Olivier
    Bernardi, Lucas
    [J]. DEPLOYABLE MACHINE LEARNING FOR SECURITY DEFENSE, MLHAT 2021, 2021, 1482 : 30 - 54
  • [6] Fraud risks in e-commerce transactions
    Wopperer, W
    [J]. GENEVA PAPERS ON RISK AND INSURANCE-ISSUES AND PRACTICE, 2002, 27 (03): : 383 - 394
  • [7] A Multiperspective Fraud Detection Method for Multiparticipant E-Commerce Transactions
    Yu, Wangyang
    Wang, Yadi
    Liu, Lu
    An, Yisheng
    Yuan, Bo
    Panneerselvam, John
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 1564 - 1576
  • [8] Fraud Detection in E-Commerce
    Alqethami, Sara
    Almutanni, Badriah
    AlGhamdi, Manal
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (06): : 200 - 206
  • [9] Fraud detection and prevention in e-commerce: A systematic literature review
    Rodrigues, Vinicius Facco
    Policarpo, Lucas Micol
    da Silveira, Diorgenes Eugenio
    Righi, Rodrigo da Rosa
    da Costa, Cristiano Andre
    Barbosa, Jorge Luis Victoria
    Antunes, Rodolfo Stoffel
    Scorsatto, Rodrigo
    Arcot, Tanuj
    [J]. ELECTRONIC COMMERCE RESEARCH AND APPLICATIONS, 2022, 56
  • [10] E-Commerce Fraud Detection Based on Machine Learning Techniques: Systematic Literature Review
    Mutemi, Abed
    Bacao, Fernando
    [J]. BIG DATA MINING AND ANALYTICS, 2024, 7 (02): : 419 - 444