Green AI in the finance industry: Exploring the impact of feature engineering on the accuracy and computational time of Machine Learning models

被引:0
|
作者
Machado, Marcos R. [1 ]
Asadi, Amin [1 ]
de Souza, Renato William R. [2 ]
Ugulino, Wallace C. [3 ]
机构
[1] Univ Twente, Dept Ind Engn & Business Informat Syst IEBIS, NL-7500 AE Enschede, Netherlands
[2] Fed Inst Educ Sci & Technol Ceara, Rod Pres Juscelino Kubitschek, BR-63870000 Boa Viagem, Ceara, Brazil
[3] Univ Twente, Dept Semant Cybersecur & Serv SCS, NL-7500 AE Enschede, Netherlands
关键词
Feature engineering; Green AI; Machine Learning; Hybrid Machine Learning; Customer loyalty; Finance industry; CUSTOMER LOYALTY; CLASSIFICATION; ALGORITHM; PREDICTION; SYSTEMS;
D O I
10.1016/j.asoc.2024.112343
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As research and practice in Artificial Intelligence (AI) applications rapidly expand, the support for AI deployment is also increasing. While the abundance of data allows for sophisticated feature engineering techniques that can enhance accuracy, it is crucial to highlight both the computational costs and the efficiency with which these models operate. This paper compares the processing time and accuracy of individual and hybrid Machine Learning (ML) models in predicting customer loyalty within financial contexts. Frameworks that incorporate feature engineering and green AI principles are used separately in both individual and hybrid approaches. The individual models are the commonly used regressor-based algorithms applied to business problems. The hybrid models first use k-Means to cluster customers, followed by the application of individual regressor-based models (e.g., decision trees, gradient boosting, and LightGBM). The present results show that using fewer features results in only a marginally lower accuracy compared to models with more features (a difference of approximate to 0.01 in MAE when comparing the use of 18 versus 85 features). Additionally, this article clearly demonstrate the trade-off between higher accuracy and longer computational time in hybrid ML models versus lower accuracy and shorter computational time in individual models when predicting customer loyalty. Hybrid models exhibit a lower MSE ( approximate to 0 . 88 ) compared to individual models (approximate to 0.91). These findings provide managers with insights on selecting the most appropriate model based on their organization's specific needs.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Integrating Chemical Mechanisms and Feature Engineering in Machine Learning Models: A Novel Approach to Analyzing HONO Budget
    Chen, Dongyang
    Zhou, Li
    Wang, Weigang
    Lian, Chaofan
    Liu, Hefan
    Luo, Lan
    Xiao, Kuang
    Chen, Yong
    Song, Danlin
    Tan, Qinwen
    Ge, Maofa
    Yang, Fumo
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2024, : 22267 - 22277
  • [32] Do We Need Exotic Models? Engineering Metrics to Enable Green Machine Learning from Tackling Accuracy-Energy Trade-offs
    Naser, M. Z.
    JOURNAL OF CLEANER PRODUCTION, 2023, 382
  • [33] Feature Selection Methods Simultaneously Improve the Detection Accuracy and Model Building Time of Machine Learning Classifiers
    Alabdulwahab, Saleh
    Moon, BongKyo
    SYMMETRY-BASEL, 2020, 12 (09):
  • [34] Evaluating the Impact of Feature Selection Methods on the Performance of the Machine Learning Models in Detecting DDoS Attacks
    Bindra, Naveen
    Sood, Manu
    ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2020, 23 (03): : 250 - 261
  • [35] Exploring the interplay of foreign direct investment, digitalization, and green finance in renewable energy: Advanced analytical methods and machine learning insights
    Soltani, Amir
    ENERGY CONVERSION AND MANAGEMENT-X, 2024, 24
  • [36] Exploring the potential of machine learning in reducing the computational time/expense and improving the reliability of engine optimization studies
    Kavuri, Chaitanya
    Kokjohn, Sage L.
    INTERNATIONAL JOURNAL OF ENGINE RESEARCH, 2020, 21 (07) : 1251 - 1270
  • [37] A versatile computational algorithm for time-series data analysis and machine-learning models
    Taylor Chomiak
    Neilen P. Rasiah
    Leonardo A. Molina
    Bin Hu
    Jaideep S. Bains
    Tamás Füzesi
    npj Parkinson's Disease, 7
  • [38] A versatile computational algorithm for time-series data analysis and machine-learning models
    Chomiak, Taylor
    Rasiah, Neilen P.
    Molina, Leonardo A.
    Hu, Bin
    Bains, Jaideep S.
    Fuzesi, Tamas
    NPJ PARKINSONS DISEASE, 2021, 7 (01)
  • [39] Predicting High-Impact Research in the Construction Engineering and Management Domain Using Computational Machine Learning
    El-Adaway, Islam H.
    Ali, Gasser G.
    Ahmed, Muaz O.
    Eissa, Radwa
    Nabi, Mohamad Abdul
    Elbashbishy, Tamima
    Khalef, Ramy
    COMPUTING IN CIVIL ENGINEERING 2023-VISUALIZATION, INFORMATION MODELING, AND SIMULATION, 2024, : 663 - 671
  • [40] Exploring Gender Differences in Computational Thinking Learning in a VR Classroom: Developing Machine Learning Models Using Eye-Tracking Data and Explaining the Models
    Gao, Hong
    Hasenbein, Lisa
    Bozkir, Efe
    Goellner, Richard
    Kasneci, Enkelejda
    INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION, 2023, 33 (04) : 929 - 954