Predicting accounting fraud using imbalanced ensemble learning classifiers - evidence from China

被引:8
|
作者
Rahman, Md Jahidur [1 ]
Zhu, Hongtao [2 ]
机构
[1] Wenzhou Kean Univ, Wenzhou, Peoples R China
[2] Univ Edinburgh, Edinburgh, Scotland
来源
ACCOUNTING AND FINANCE | 2023年 / 63卷 / 03期
关键词
Accounting fraud detection; Artificial intelligence; China A-share; CUSBoost; Ensemble learning algorithms; Machine learning; RUSBoost; FINANCIAL STATEMENT FRAUD; BANKRUPTCY PREDICTION; DECISION TREE; MACHINE; CLASSIFICATION; COMPENSATION; GOVERNANCE; REGRESSION; ARTICLE; FUSION;
D O I
10.1111/acfi.13044
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
The current research aims to launch effective accounting fraud detection models using imbalanced ensemble learning algorithms for China A-Share listed firms. Based on a sample of 33,544 Chinese firm-year instances from 1998 to 2017, this research respectively established one logistic regression and four ensemble learning classifiers (AdaBoost, XGBoost, CUSBoost, and RUSBoost) by 12 financial ratios and 28 raw financial data. Additionally, we divided the sample into the train and test observations to evaluate the classifiers' out-of-sample performance. In detail, we applied two metrics, namely, Area under the ROC (receiver operating characteristic) curve (AUC) and Area under the Precision-Recall curve (AUPR), to evaluate classifiers' discriminability. In the supplement test, this study put forward an algebraic fused model on the basis of the four ensemble learning classifiers and introduced the sliding window technique. The empirical results showed that the ensemble learning classifiers can detect accounting fraud for the imbalanced China A-listed firms far more effectively than the logistic regression model. Moreover, imbalanced ensemble learning classifiers (CUSBoost and RUSBoost) effectively performed better than the common ensemble learning models (AdaBoost and XGBoost) in average. The algebraic fused model in the supplement test also obtained the highest average AUC and AUPR among all the employed algorithms. Our results offer firm support for the potential role of Machine Learning (ML)-based Artificial Intelligence (AI) approaches in reliably predicting accounting fraud with high accuracy. Similarly, for the Chinese settings, our ML-based AI offers utmost advantage in forecasting accounting fraud. Finally, this paper fills the research gap on the applications of imbalanced ensemble learning in accounting fraud detection for Chinese listed firms.
引用
收藏
页码:3455 / 3486
页数:32
相关论文
共 50 条
  • [11] Predicting Fraud in Mobile Money Transactions using Machine Learning: The Effects of Sampling Techniques on the Imbalanced Dataset
    Botchey, Francis E.
    Qin, Zhen
    Hughes-Lartey, Kwesi
    Ampomah, Kwame E.
    [J]. INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2021, 45 (07): : 45 - 56
  • [12] Imbalanced Ensemble Classifier for Learning from Imbalanced Business School Dataset
    Chakraborty, Tanujit
    [J]. INTERNATIONAL JOURNAL OF MATHEMATICAL ENGINEERING AND MANAGEMENT SCIENCES, 2019, 4 (04) : 861 - 869
  • [13] Machine Learning Classifiers for Predicting Transit Fraud Emergent Research Forum (ERF)
    Claiborne, Jay
    Gupta, Ashish
    [J]. AMCIS 2018 PROCEEDINGS, 2018,
  • [14] Accounting fraud detection using contextual language learning
    Bhattacharya, Indranil
    Mickovic, Ana
    [J]. INTERNATIONAL JOURNAL OF ACCOUNTING INFORMATION SYSTEMS, 2024, 53
  • [15] Detection of Wangiri Telecommunication Fraud Using Ensemble Learning
    Arafat, Mais
    Qusef, Abdallah
    Sammour, George
    [J]. 2019 IEEE JORDAN INTERNATIONAL JOINT CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATION TECHNOLOGY (JEEIT), 2019, : 330 - 335
  • [16] Comparing performances and effectiveness of machine learning classifiers in detecting financial accounting fraud for Turkish SMEs
    Hamal, Serhan
    Senvar, Ozlem
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) : 769 - 782
  • [17] Melanoma recognition using deep learning and ensemble of classifiers
    Gil, Fabian
    Osowski, Stanislaw
    Slowinska, Monika
    [J]. 2022 23RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL PROBLEMS OF ELECTRICAL ENGINEERING (CPEE), 2022,
  • [18] A NEW ENSEMBLE LEARNING ALGORITHM USING REGIONAL CLASSIFIERS
    Lee, Byungwoo
    Choi, Sungha
    Oh, Byonghwa
    Yang, Jihoon
    Park, Sungyong
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2013, 22 (04)
  • [19] An Ensemble-Based Machine Learning for Predicting Fraud of Credit Card Transactions
    Baabdullah, Tahani
    Rawat, Danda B.
    Liu, Chunmei
    Alzahrani, Amani
    [J]. INTELLIGENT COMPUTING, VOL 2, 2022, 507 : 214 - 229
  • [20] Real Time Credit Card Fraud Detection on Huge Imbalanced Data using Meta-Classifiers
    Kavitha, M.
    Suriakala, M.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTING AND INFORMATICS (ICICI 2017), 2017, : 881 - 887