Green AI in the finance industry: Exploring the impact of feature engineering on the accuracy and computational time of Machine Learning models

被引:0
|
作者
Machado, Marcos R. [1 ]
Asadi, Amin [1 ]
de Souza, Renato William R. [2 ]
Ugulino, Wallace C. [3 ]
机构
[1] Univ Twente, Dept Ind Engn & Business Informat Syst IEBIS, NL-7500 AE Enschede, Netherlands
[2] Fed Inst Educ Sci & Technol Ceara, Rod Pres Juscelino Kubitschek, BR-63870000 Boa Viagem, Ceara, Brazil
[3] Univ Twente, Dept Semant Cybersecur & Serv SCS, NL-7500 AE Enschede, Netherlands
关键词
Feature engineering; Green AI; Machine Learning; Hybrid Machine Learning; Customer loyalty; Finance industry; CUSTOMER LOYALTY; CLASSIFICATION; ALGORITHM; PREDICTION; SYSTEMS;
D O I
10.1016/j.asoc.2024.112343
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As research and practice in Artificial Intelligence (AI) applications rapidly expand, the support for AI deployment is also increasing. While the abundance of data allows for sophisticated feature engineering techniques that can enhance accuracy, it is crucial to highlight both the computational costs and the efficiency with which these models operate. This paper compares the processing time and accuracy of individual and hybrid Machine Learning (ML) models in predicting customer loyalty within financial contexts. Frameworks that incorporate feature engineering and green AI principles are used separately in both individual and hybrid approaches. The individual models are the commonly used regressor-based algorithms applied to business problems. The hybrid models first use k-Means to cluster customers, followed by the application of individual regressor-based models (e.g., decision trees, gradient boosting, and LightGBM). The present results show that using fewer features results in only a marginally lower accuracy compared to models with more features (a difference of approximate to 0.01 in MAE when comparing the use of 18 versus 85 features). Additionally, this article clearly demonstrate the trade-off between higher accuracy and longer computational time in hybrid ML models versus lower accuracy and shorter computational time in individual models when predicting customer loyalty. Hybrid models exhibit a lower MSE ( approximate to 0 . 88 ) compared to individual models (approximate to 0.91). These findings provide managers with insights on selecting the most appropriate model based on their organization's specific needs.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Exploring Gender Differences in Computational Thinking Learning in a VR Classroom: Developing Machine Learning Models Using Eye-Tracking Data and Explaining the Models
    Hong Gao
    Lisa Hasenbein
    Efe Bozkir
    Richard Göllner
    Enkelejda Kasneci
    International Journal of Artificial Intelligence in Education, 2023, 33 : 929 - 954
  • [42] Green finance and the socio-politico-economic factors' impact on the future oil prices: Evidence from machine learning
    Mohsin, Muhammad
    Jamaani, Fouad
    RESOURCES POLICY, 2023, 85
  • [43] Exploring structure-composition relationships of cubic perovskite oxides via extreme feature engineering and automated machine learning
    Deng, Qin
    Lin, Bin
    MATERIALS TODAY COMMUNICATIONS, 2021, 28 (28):
  • [44] Feature Engineering and Machine Learning Predictive Quality Models for Friction Stir Welding Defect Prediction in Aerospace Applications
    Camps, Marta
    Etxegarai, Maddi
    Bonada, Francesc
    Lacheny, William
    Pauleau, Sylvain
    Domingo, Xavier
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2022, 356 : 151 - 154
  • [45] Combining machine learning and process engineering physics towards enhanced accuracy and explainability of data-driven models
    Bikmukhametov, Timur
    Jaschke, Johannes
    COMPUTERS & CHEMICAL ENGINEERING, 2020, 138
  • [46] Impact of Feature Selection Techniques on the Performance of Machine Learning Models for Depression Detection Using EEG Data
    Hassan, Marwa
    Kaabouch, Naima
    APPLIED SCIENCES-BASEL, 2024, 14 (22):
  • [47] Feature Selection Stability and Accuracy of Prediction Models for Genomic Prediction of Residual Feed Intake in Pigs Using Machine Learning
    Piles, Miriam
    Bergsma, Rob
    Gianola, Daniel
    Gilbert, Helene
    Tusell, Llibertat
    FRONTIERS IN GENETICS, 2021, 12
  • [48] Constructing different machine learning models for identifying pelvic lipomatosis based on AI-assisted CT image feature recognition
    Wang, Maoyu
    Zhang, Zheran
    Xu, Zhikang
    Chen, Haihu
    Hua, Meimian
    Zeng, Shuxiong
    Yue, Xiaodong
    Xu, Chuanliang
    ABDOMINAL RADIOLOGY, 2024, : 1811 - 1821
  • [49] Feature engineering on climate data with machine learning to understand time-lagging effects in pasture yield prediction
    Balasubramaniam, Thirunavukarasu
    Mohotti, Wathsala Anupama
    Sabir, Kenneth
    Nayak, Richi
    ECOLOGICAL INFORMATICS, 2025, 86
  • [50] Improving the Accuracy of Ensemble Machine Learning Classification Models Using a Novel Bit-Fusion Algorithm for Healthcare AI Systems
    Mishra, Sashikala
    Shaw, Kailash
    Mishra, Debahuti
    Patil, Shruti
    Kotecha, Ketan
    Kumar, Satish
    Bajaj, Simi
    FRONTIERS IN PUBLIC HEALTH, 2022, 10