Predicting fraud in MD&A sections using deep learning

被引:0
|
作者
Sivasubramanian, Sachin Velloor [1 ]
Skillicorn, David [1 ]
机构
[1] Queens Univ, Sch Comp, Kingston, ON K7L 3N6, Canada
关键词
Fraud prediction; company filings; SEC; deep learning; machine learning; text analytics; PHONEME CLASSIFICATION; BIDIRECTIONAL LSTM;
D O I
10.1080/2573234X.2024.2342773
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conventional data analytic techniques have been successfully applied to detecting fraud in the Management's Discussion and Analysis sections of company filings mandated by the SEC. Here, we investigate whether fraud detection can be improved by applying deep learning techniques. We build 18 deep learning models and compare their performance on a set of MD&A documents. The best-performing model achieved an accuracy of 91% and an F1-score of 77%, only slightly better than a conventional XGBoost predictor that achieved an accuracy of 91% and an F1-score of 73%. Of the deep learning models, the transformer, those incorporating attention mechanisms, and convolutional neural networks performed well; somewhat surprisingly, sequential models such as LSTMs did not.
引用
收藏
页码:197 / 206
页数:10
相关论文
共 50 条
  • [1] Financial Fraud Identification Based on Stacking Ensemble Learning Algorithm: Introducing MD&A Text Information
    Zhang, Zhiheng
    Ma, Yong
    Hua, Yongjun
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [2] Using MD&A to Improve Earnings Forecasts
    Bochkay, Khrystyna
    Levine, Carolyn B.
    [J]. JOURNAL OF ACCOUNTING AUDITING AND FINANCE, 2019, 34 (03): : 458 - 482
  • [3] Fraud Detection using Machine Learning and Deep Learning
    Raghavan, Pradheepan
    El Gayar, Neamat
    [J]. PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND KNOWLEDGE ECONOMY (ICCIKE' 2019), 2019, : 335 - 340
  • [4] Fraud Detection Using Machine Learning and Deep Learning
    Gandhar A.
    Gupta K.
    Pandey A.K.
    Raj D.
    [J]. SN Computer Science, 5 (5)
  • [5] Predicting Fraud Victimization Using Classical Machine Learning
    Lokanan, Mark
    Liu, Susan
    [J]. ENTROPY, 2021, 23 (03) : 1 - 19
  • [6] Correction to: Fraud Detection Using Machine Learning and Deep Learning
    Akash Gandhar
    Kapil Gupta
    Aman Kumar Pandey
    Dharm Raj
    [J]. SN Computer Science, 5 (7)
  • [7] Predicting corporate credit rating based on qualitative information of MD&A transformed using document vectorization techniques
    Choi, Jinwook
    Suh, Yongmoo
    Jung, Namchul
    [J]. DATA TECHNOLOGIES AND APPLICATIONS, 2020, 54 (02) : 151 - 168
  • [8] Fraud Detection in Banking Using Deep Reinforcement Learning
    El Bouchti, Abdelali
    Chakroun, Ahmed
    Abbar, Ilassan
    Okar, Chafik
    [J]. 2017 SEVENTH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH 2017), 2017, : 58 - 63
  • [9] Credit Card Fraud Detection using Deep Learning
    Shenvi, Pranali
    Samant, Neel
    Kumar, Shubham
    Kulkarni, Vaishali
    [J]. 2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
  • [10] Predicting tax fraud using supervised machine learning approach
    Murorunkwere, Belle Fille
    Haughton, Dominique
    Nzabanita, Joseph
    Kipkogei, Francis
    Kabano, Ignace
    [J]. AFRICAN JOURNAL OF SCIENCE TECHNOLOGY INNOVATION & DEVELOPMENT, 2023, 15 (06): : 731 - 742