Predicting accounting fraud using imbalanced ensemble learning classifiers - evidence from China

被引:8
|
作者
Rahman, Md Jahidur [1 ]
Zhu, Hongtao [2 ]
机构
[1] Wenzhou Kean Univ, Wenzhou, Peoples R China
[2] Univ Edinburgh, Edinburgh, Scotland
来源
ACCOUNTING AND FINANCE | 2023年 / 63卷 / 03期
关键词
Accounting fraud detection; Artificial intelligence; China A-share; CUSBoost; Ensemble learning algorithms; Machine learning; RUSBoost; FINANCIAL STATEMENT FRAUD; BANKRUPTCY PREDICTION; DECISION TREE; MACHINE; CLASSIFICATION; COMPENSATION; GOVERNANCE; REGRESSION; ARTICLE; FUSION;
D O I
10.1111/acfi.13044
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
The current research aims to launch effective accounting fraud detection models using imbalanced ensemble learning algorithms for China A-Share listed firms. Based on a sample of 33,544 Chinese firm-year instances from 1998 to 2017, this research respectively established one logistic regression and four ensemble learning classifiers (AdaBoost, XGBoost, CUSBoost, and RUSBoost) by 12 financial ratios and 28 raw financial data. Additionally, we divided the sample into the train and test observations to evaluate the classifiers' out-of-sample performance. In detail, we applied two metrics, namely, Area under the ROC (receiver operating characteristic) curve (AUC) and Area under the Precision-Recall curve (AUPR), to evaluate classifiers' discriminability. In the supplement test, this study put forward an algebraic fused model on the basis of the four ensemble learning classifiers and introduced the sliding window technique. The empirical results showed that the ensemble learning classifiers can detect accounting fraud for the imbalanced China A-listed firms far more effectively than the logistic regression model. Moreover, imbalanced ensemble learning classifiers (CUSBoost and RUSBoost) effectively performed better than the common ensemble learning models (AdaBoost and XGBoost) in average. The algebraic fused model in the supplement test also obtained the highest average AUC and AUPR among all the employed algorithms. Our results offer firm support for the potential role of Machine Learning (ML)-based Artificial Intelligence (AI) approaches in reliably predicting accounting fraud with high accuracy. Similarly, for the Chinese settings, our ML-based AI offers utmost advantage in forecasting accounting fraud. Finally, this paper fills the research gap on the applications of imbalanced ensemble learning in accounting fraud detection for Chinese listed firms.
引用
收藏
页码:3455 / 3486
页数:32
相关论文
共 50 条
  • [41] Detecting financial statement fraud using dynamic ensemble machine learning
    Achakzai, Muhammad Atif Khan
    Peng, Juan
    [J]. INTERNATIONAL REVIEW OF FINANCIAL ANALYSIS, 2023, 89
  • [42] Credit Card Fraud Prediction Using XGBoost: An Ensemble Learning Approach
    Mohbey, Krishna Kumar
    Khan, Mohammad Zubair
    Indian, Ajay
    [J]. INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2022, 12 (02)
  • [43] Test-cost sensitive ensemble of classifiers using reinforcement learning
    Mirhashemi M.H.
    Anvari R.
    Barari M.
    Mozayani N.
    [J]. Revue d'Intelligence Artificielle, 2020, 34 (02) : 143 - 150
  • [44] An Adaptive Sampling Ensemble Classifier for Learning from Imbalanced Data Sets
    Geiler, Ordonez Jon
    Hong, Li
    Yue-Jian, Guo
    [J]. INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS (IMECS 2010), VOLS I-III, 2010, : 513 - 517
  • [45] Computer Network Intrusion Detection using various Classifiers and Ensemble Learning
    Mirza, Ali H.
    [J]. 2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [46] Brake fault diagnosis using a voting ensemble of machine learning classifiers
    Viswanathan, Sivagurunathan
    Sridharan, Naveen Venkatesh
    Rakkiyannan, Jegadeeshwaran
    Vaithiyanathan, Sugumaran
    [J]. RESULTS IN ENGINEERING, 2024, 23
  • [47] Classification of Neurodegenerative Disease Stages using Ensemble Machine Learning Classifiers
    Rohini, M.
    Surendran, D.
    [J]. 2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING ICRTAC -DISRUP - TIV INNOVATION , 2019, 2019, 165 : 66 - 73
  • [48] Predicting fraud in MD&A sections using deep learning
    Sivasubramanian, Sachin Velloor
    Skillicorn, David
    [J]. JOURNAL OF BUSINESS ANALYTICS, 2024, 7 (03) : 197 - 206
  • [49] Ensemble Learning from Imbalanced Data Set for Video Event Detection
    Yang, Yimin
    Chen, Shu-Ching
    [J]. 2015 IEEE 16TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2015, : 82 - 89
  • [50] Predicting tax fraud using supervised machine learning approach
    Murorunkwere, Belle Fille
    Haughton, Dominique
    Nzabanita, Joseph
    Kipkogei, Francis
    Kabano, Ignace
    [J]. AFRICAN JOURNAL OF SCIENCE TECHNOLOGY INNOVATION & DEVELOPMENT, 2023, 15 (06): : 731 - 742