Predicting accounting fraud using imbalanced ensemble learning classifiers - evidence from China

被引:8
|
作者
Rahman, Md Jahidur [1 ]
Zhu, Hongtao [2 ]
机构
[1] Wenzhou Kean Univ, Wenzhou, Peoples R China
[2] Univ Edinburgh, Edinburgh, Scotland
来源
ACCOUNTING AND FINANCE | 2023年 / 63卷 / 03期
关键词
Accounting fraud detection; Artificial intelligence; China A-share; CUSBoost; Ensemble learning algorithms; Machine learning; RUSBoost; FINANCIAL STATEMENT FRAUD; BANKRUPTCY PREDICTION; DECISION TREE; MACHINE; CLASSIFICATION; COMPENSATION; GOVERNANCE; REGRESSION; ARTICLE; FUSION;
D O I
10.1111/acfi.13044
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
The current research aims to launch effective accounting fraud detection models using imbalanced ensemble learning algorithms for China A-Share listed firms. Based on a sample of 33,544 Chinese firm-year instances from 1998 to 2017, this research respectively established one logistic regression and four ensemble learning classifiers (AdaBoost, XGBoost, CUSBoost, and RUSBoost) by 12 financial ratios and 28 raw financial data. Additionally, we divided the sample into the train and test observations to evaluate the classifiers' out-of-sample performance. In detail, we applied two metrics, namely, Area under the ROC (receiver operating characteristic) curve (AUC) and Area under the Precision-Recall curve (AUPR), to evaluate classifiers' discriminability. In the supplement test, this study put forward an algebraic fused model on the basis of the four ensemble learning classifiers and introduced the sliding window technique. The empirical results showed that the ensemble learning classifiers can detect accounting fraud for the imbalanced China A-listed firms far more effectively than the logistic regression model. Moreover, imbalanced ensemble learning classifiers (CUSBoost and RUSBoost) effectively performed better than the common ensemble learning models (AdaBoost and XGBoost) in average. The algebraic fused model in the supplement test also obtained the highest average AUC and AUPR among all the employed algorithms. Our results offer firm support for the potential role of Machine Learning (ML)-based Artificial Intelligence (AI) approaches in reliably predicting accounting fraud with high accuracy. Similarly, for the Chinese settings, our ML-based AI offers utmost advantage in forecasting accounting fraud. Finally, this paper fills the research gap on the applications of imbalanced ensemble learning in accounting fraud detection for Chinese listed firms.
引用
收藏
页码:3455 / 3486
页数:32
相关论文
共 50 条
  • [21] On effectively predicting autism spectrum disorder therapy using an ensemble of classifiers
    Bhekisipho Twala
    Eamon Molloy
    [J]. Scientific Reports, 13 (1)
  • [22] On effectively predicting autism spectrum disorder therapy using an ensemble of classifiers
    Twala, Bhekisipho
    Molloy, Eamon
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01):
  • [23] Using ensemble of classifiers for predicting HIV protease cleavage sites in proteins
    Nanni, Loris
    Lumini, Alessandra
    [J]. AMINO ACIDS, 2009, 36 (03) : 409 - 416
  • [24] Using ensemble of classifiers for predicting HIV protease cleavage sites in proteins
    Loris Nanni
    Alessandra Lumini
    [J]. Amino Acids, 2009, 36 : 409 - 416
  • [25] Predicting hospital associated disability from imbalanced data using supervised learning
    Saarela, Mirka
    Ryynanen, Olli-Pekka
    Ayramo, Sami
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2019, 95 : 88 - 95
  • [26] Recognition of Multiple Imbalanced Cancer Types Based on DNA Microarray Data Using Ensemble Classifiers
    Yu, Hualong
    Hong, Shufang
    Yang, Xibei
    Ni, Jun
    Dan, Yuanyuan
    Qin, Bin
    [J]. BIOMED RESEARCH INTERNATIONAL, 2013, 2013
  • [27] Predicting lung adenocarcinoma disease progression using methylation-correlated blocks and ensemble machine learning classifiers
    Yu, Xin
    Yang, Qian
    Wang, Dong
    Li, Zhaoyang
    Chen, Nianhang
    Kong, De-Xin
    [J]. PEERJ, 2021, 9
  • [28] Detection of Image Steganography Using Deep Learning and Ensemble Classifiers
    Plachta, Mikolaj
    Krzemien, Marek
    Szczypiorski, Krzysztof
    Janicki, Artur
    [J]. ELECTRONICS, 2022, 11 (10)
  • [29] Mining Smart Learning Analytics Data Using Ensemble Classifiers
    Kausar, Samina
    Oyelere, Solomon Sunday
    Salal, Yass Khudheir
    Hussain, Sadiq
    Cifci, Mehmet Akif
    Hilcenko, Slavoljub
    Iqbal, Muhammad Shahid
    Zhu Wenhao
    Xu Huahu
    [J]. INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2020, 15 (12) : 81 - 102
  • [30] Predicting Fraud Victimization Using Classical Machine Learning
    Lokanan, Mark
    Liu, Susan
    [J]. ENTROPY, 2021, 23 (03) : 1 - 19