Using machine learning to select variables in data envelopment analysis: Simulations and application using electricity distribution data

被引:5
|
作者
Duras, Toni [1 ]
Javed, Farrukh [2 ]
Mansson, Kristofer [1 ]
Sjolander, Paer [1 ]
Soderberg, Magnus [3 ]
机构
[1] Jonkoping Univ, Jonkoping Int Business Sch, POB 1026, SE-55111 Jonkoping, Sweden
[2] Lund Univ, Lund, Sweden
[3] Griffith Univ, Brisbane, Australia
关键词
Data envelopment analysis; Curse of dimensionality; Machine learning; Variable selection; Regulation; EFFICIENCY; REGRESSION;
D O I
10.1016/j.eneco.2023.106621
中图分类号
F [经济];
学科分类号
02 ;
摘要
Agencies that regulate electricity providers often apply nonparametric data envelopment analysis (DEA) to assess the relative efficiency of each firm. The reliability and validity of DEA are contingent upon selecting relevant input variables. In the era of big (wide) data, the assumptions of traditional variable selection techniques are often violated due to challenges related to high-dimensional data and their standard empirical properties. Currently, regulators have access to a large number of potential input variables. Therefore, our aim is to introduce new machine learning methods for regulators of the energy market. We also propose a new two-step analytical approach where, in the first step, the machine learning-based adaptive least absolute shrinkage and selection operator (ALASSO) is used to select variables and, in the second step, selected variables are used in a DEA model. In contrast to previous research, we find, by using a more realistic data-generating process common for production functions (i.e., Cobb-Douglas and Translog), that the performance of different machine learning techniques differs substantially in different empirically relevant situations. Simulations also reveal that the ALASSO is superior to other machine learning and regression-based methods when the collinearity is low or moderate. However, in situations of multicollinearity, the LASSO approach exhibits the best performance. We also use real data from the Swedish electricity distribution market to illustrate the empirical relevance of selecting the most appropriate variable selection method.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Formative evaluation of electricity distribution utilities using data envelopment analysis
    Santos, S. P.
    Amado, C. A. F.
    Rosado, J. R.
    [J]. JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2011, 62 (07) : 1298 - 1319
  • [2] Data envelopment analysis with uncertain data: An application for Iranian electricity distribution companies
    Sadjadi, S. J.
    Omrani, H.
    [J]. ENERGY POLICY, 2008, 36 (11) : 4247 - 4254
  • [3] Efficiency evaluation of electricity distribution companies: Integrating data envelopment analysis and machine learning for a holistic analysis
    Omrani, Hashem
    Emrouznejad, Ali
    Teplova, Tamara
    Amini, Mohaddeseh
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [4] Application of data envelopment analysis in the performance evaluation of electricity distribution: a review
    Qassim, Raad Yahya
    Corso, Gilberto
    Lucena, Liacir dos Santos
    Thome, Zieli Dutra
    [J]. INTERNATIONAL JOURNAL OF BUSINESS PERFORMANCE MANAGEMENT, 2005, 7 (01) : 60 - 70
  • [5] Project management efficiency of a Portuguese electricity distribution utility using data envelopment analysis
    Trindade, Diogo
    Barroso, Ana Paula
    Machado, Virginia Helena
    [J]. CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS/INTERNATIONAL CONFERENCE ON PROJECT MANAGEMENT/CONFERENCE ON HEALTH AND SOCIAL CARE INFORMATION SYSTEMS AND TECHNOLOGIES, CENTERIS/PROJMAN / HCIST 2015, 2015, 64 : 674 - 682
  • [6] Performance evaluation of Iranian electricity distribution units by using stochastic data envelopment analysis
    Azadeh, A.
    Haghighi, S. Motevali
    Zarrin, M.
    Khaefi, S.
    [J]. INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2015, 73 : 919 - 931
  • [7] Using Data Envelopment Analysis to support the design of process improvement interventions in electricity distribution
    Amado, Carla A. F.
    Santos, Sergio P.
    Sequeira, Joao F. C.
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2013, 228 (01) : 226 - 235
  • [9] Ranking trade resistance variables using data envelopment analysis
    Badau, Flavius
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2015, 247 (03) : 978 - 986
  • [10] An Application of Data Envelopment Analysis and Machine Learning Approach to Risk Management
    Jomthanachai, Suriyan
    Wong, Wai-Peng
    Lim, Chee-Peng
    [J]. IEEE ACCESS, 2021, 9 : 85978 - 85994