Predicting and explaining corruption across countries: A machine learning approach

被引:41
|
作者
Lima, Marcio Salles Melo [1 ,2 ]
Delen, Dursun [3 ]
机构
[1] Metalsider, Res & Dev, Betim, MG, Brazil
[2] Oklahoma State Univ, Spears Sch Business, Stillwater, OK 74078 USA
[3] Oklahoma State Univ, Spears Sch Business, Dept Management Sci & Informat Syst, Dept Business Analyt, Stillwater, OK 74078 USA
关键词
Corruption perception; Machine learning; Predictive modeling; Random forest; Society policies and regulations; Government integrity; Social development; MULTIPLE IMPUTATION; PUBLIC CORRUPTION; ECONOMIC-FREEDOM; INJURY SEVERITY; RANDOM FOREST; CLASSIFICATION; DECENTRALIZATION; CHALLENGES; REGRESSION; SCIENCE;
D O I
10.1016/j.giq.2019.101407
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
In the era of Big Data, Analytics, and Data Science, corruption is still ubiquitous and is perceived as one of the major challenges of modem societies. A large body of academic studies has attempted to identify and explain the potential causes and consequences of corruption, at varying levels of granularity, mostly through theoretical lenses by using correlations and regression-based statistical analyses. The present study approaches the phenomenon from the predictive analytics perspective by employing contemporary machine learning techniques to discover the most important corruption perception predictors based on enriched/enhanced nonlinear models with a high level of predictive accuracy. Specifically, within the multiclass classification modeling setting that is employed herein, the Random Forest (an ensemble-type machine learning algorithm) is found to be the most accurate prediction/classification model, followed by Support Vector Machines and Artificial Neural Networks. From the practical standpoint, the enhanced predictive power of machine learning algorithms coupled with a multi-source database revealed the most relevant corruption-related information, contributing to the related body of knowledge, generating actionable insights for administrator, scholars, citizens, and politicians. The variable importance results indicated that government integrity, property rights, judicial effectiveness, and education index are the most influential factors in defining the corruption level of significance.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Explaining and predicting employees’ attrition: a machine learning approach
    Praphula Kumar Jain
    Madhur Jain
    Rajendra Pamula
    [J]. SN Applied Sciences, 2020, 2
  • [2] Explaining and predicting employees' attrition: a machine learning approach
    Jain, Praphula Kumar
    Jain, Madhur
    Pamula, Rajendra
    [J]. SN APPLIED SCIENCES, 2020, 2 (04):
  • [3] Student and school performance across countries: A machine learning approach
    Masci, Chiara
    Johnes, Geraint
    Agasisti, Tommaso
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 269 (03) : 1072 - 1085
  • [4] Predicting and Explaining Pig Iron Production on Charcoal Blast Furnaces: A Machine Learning Approach
    Melo Lima, Marcio Salles
    Eryarsoy, Enes
    Delen, Dursun
    [J]. INFORMS JOURNAL ON APPLIED ANALYTICS, 2021, 51 (03): : 213 - 235
  • [5] Predicting Adolescent Mental Health Outcomes Across Cultures: A Machine Learning Approach
    Rothenberg, W. Andrew
    Bizzego, Andrea
    Esposito, Gianluca
    Lansford, Jennifer E.
    Al-Hassan, Suha M.
    Bacchini, Dario
    Bornstein, Marc H.
    Chang, Lei
    Deater-Deckard, Kirby
    Di Giunta, Laura
    Dodge, Kenneth A.
    Gurdal, Sevtap
    Liu, Qin
    Long, Qian
    Oburu, Paul
    Pastorelli, Concetta
    Skinner, Ann T.
    Sorbring, Emma
    Tapanya, Sombat
    Steinberg, Laurence
    Tirado, Liliana Maria Uribe
    Yotanyamaneewong, Saengduean
    Alampay, Liane Pena
    [J]. JOURNAL OF YOUTH AND ADOLESCENCE, 2023, 52 (08) : 1595 - 1619
  • [6] Predicting Adolescent Mental Health Outcomes Across Cultures: A Machine Learning Approach
    W. Andrew Rothenberg
    Andrea Bizzego
    Gianluca Esposito
    Jennifer E. Lansford
    Suha M. Al-Hassan
    Dario Bacchini
    Marc H. Bornstein
    Lei Chang
    Kirby Deater-Deckard
    Laura Di Giunta
    Kenneth A. Dodge
    Sevtap Gurdal
    Qin Liu
    Qian Long
    Paul Oburu
    Concetta Pastorelli
    Ann T. Skinner
    Emma Sorbring
    Sombat Tapanya
    Laurence Steinberg
    Liliana Maria Uribe Tirado
    Saengduean Yotanyamaneewong
    Liane Peña Alampay
    [J]. Journal of Youth and Adolescence, 2023, 52 : 1595 - 1619
  • [7] Predicting and explaining with machine learning models: Social science as a touchstone
    Buchholz, Oliver
    Grote, Thomas
    [J]. STUDIES IN HISTORY AND PHILOSOPHY OF SCIENCE, 2023, 102 : 60 - 69
  • [8] On the detection of patterns in electricity prices across European countries: An unsupervised machine learning approach
    Saligkaras, Dimitrios
    Papageorgiou, Vasileios E.
    [J]. AIMS ENERGY, 2022, 10 (06) : 1146 - 1164
  • [9] A machine learning approach to rank the determinants of banking crises over time and across countries
    Casabianca, Elizabeth Jane
    Catalano, Michele
    Forni, Lorenzo
    Giarda, Elena
    Passeri, Simone
    [J]. JOURNAL OF INTERNATIONAL MONEY AND FINANCE, 2022, 129
  • [10] Evaluating the reliability of neurocognitive biomarkers of neurodegenerative diseases across countries: A machine learning approach
    Belen Bachli, M.
    Sedeno, Lucas
    Ochab, Jeremi K.
    Piguet, Olivier
    Kumfor, Fiona
    Reyes, Pablo
    Torralva, Teresa
    Roca, Maria
    Felipe Cardona, Juan
    Gonzalez Campo, Cecilia
    Herrera, Eduar
    Slachevsky, Andrea
    Matallana, Diana
    Manes, Facundo
    Garcia, Adolfo M.
    Ibanez, Agustin
    Chialvo, Dante R.
    [J]. NEUROIMAGE, 2020, 208