Bankruptcy prediction using machine learning models with the text-based communicative value of annual reports

被引:13
|
作者
Chen, Tsung-Kang [1 ,3 ]
Liao, Hsien-Hsing [2 ]
Chen, Geng-Dao [1 ]
Kang, Wei-Han [1 ]
Lin, Yu-Chun [1 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Dept Management Sci, Hsinchu, Taiwan
[2] Natl Taiwan Univ, Dept Finance, New Taipei, Taiwan
[3] Natl Taiwan Univ, Ctr Res Econometr Theory & Applicat, New Taipei, Taiwan
关键词
Annual report text-based communicative value; Bankruptcy prediction; Machine learning; Credit risk; Incomplete information; ANNUAL-REPORT READABILITY; FINANCIAL RATIOS; COMPLEXITY; DISCLOSURE; EARNINGS; IMPACT; FOG;
D O I
10.1016/j.eswa.2023.120714
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate whether including the text-based communicative value of annual report increases the predictive power of four machine learning models (Logistic regression, Random Forest, XGBoost, and Support Vector Machine) for corporate bankruptcy prediction using U.S. firm observations from 1994 to 2018. We find that the overall prediction effectiveness of these four models (e.g. accuracy, F1-score, AUCs) significantly improves, especially true in the performance of XGBoost and Random Forest models. In addition, we find that annual report text-based communicative value variables significantly reduce models' Type II error and keep the Type I error at a relatively small level, especially for the short-term bankruptcy forecast. The results reveal that annual report text-based communicative value effectively mitigates the model misidentification of a non-bankrupt firm as a bankrupt firm. Our results also suggest that annual report text-based communicative value is helpful for bank's corporate loan underwriting decisions. Finally, our findings still hold when considering different testing periods and random state settings, replacing by another publicly available bankruptcy dataset, and introducing neural network models.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Domain Text Classification Using Machine Learning Models
    Rao, Akula V. S. Siva Rama
    Bhavani, D. Ganga
    Krishna, J. Gopi
    Swapna, B.
    Varma, K. Rama Sai
    [J]. PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON SUSTAINABLE EXPERT SYSTEMS (ICSES 2021), 2022, 351 : 573 - 582
  • [42] Machine learning based approval prediction for enhancement reports
    Nafees, Sadeem Ahmad
    Rehman, Faisal Asad Ur
    [J]. PROCEEDINGS OF 2021 INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGIES (IBCAST), 2021, : 377 - 382
  • [43] Performance Comparison of Machine Learning Models for Annual Precipitation Prediction Using Different Decomposition Methods
    Song, Chao
    Chen, Xiaohong
    [J]. REMOTE SENSING, 2021, 13 (05) : 1 - 27
  • [44] Realization of natural language processing and machine learning approaches for text-based sentiment analysis
    Naithani, Kanchan
    Raiwani, Yadav Prasad
    [J]. EXPERT SYSTEMS, 2023, 40 (05)
  • [45] Exploring Text-based Emotions Recognition Machine Learning Techniques on Social Media Conversation
    Chowanda, Andry
    Sutoyo, Rhio
    Meiliana
    Tanachutiwat, Sansiri
    [J]. 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE 2020, 2021, 179 : 821 - 828
  • [46] Analysis of Classification Models Based on Cuisine Prediction Using Machine Learning
    Jayaraman, Shobhna
    Choudhury, Tanupriya
    Kumar, Praveen
    [J]. PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES FOR SMART NATION (SMARTTECHCON), 2017, : 1485 - 1490
  • [47] Prediction of baking quality using machine learning based intelligent models
    Hilal Isleroglu
    Selami Beyhan
    [J]. Heat and Mass Transfer, 2020, 56 : 2045 - 2055
  • [48] Prediction of baking quality using machine learning based intelligent models
    Isleroglu, Hilal
    Beyhan, Selami
    [J]. HEAT AND MASS TRANSFER, 2020, 56 (07) : 2045 - 2055
  • [49] Exploring the use of machine learning for highly accurate text-based information retrieval system
    Sawarkar, Chandrashekhar Himmatrao
    Mulkalwar, Pramod N.
    [J]. Test Engineering and Management, 2019, 81 (11-12): : 6592 - 6599
  • [50] TbExplain: A Text-Based Explanation Method for Scene Classification Models With the Statistical Prediction Correction
    Aminimehr, Amirhossein
    Khani, Pouya
    Molaei, Amirali
    Kazemeini, Amirmohammad
    Cambria, Eric
    [J]. FIRST WORKSHOP ON GOVERNANCE, UNDERSTANDING, AND INTEGRATION OF DATA FOR EFFECTIVE AND RESPONSIBLE AI, GUIDE-AI 2024, 2024, : 54 - 60