Combining Data Envelopment Analysis and Machine Learning

被引:12
|
作者
Guerrero, Nadia M. [1 ]
Aparicio, Juan [1 ]
Valero-Carreras, Daniel [1 ]
机构
[1] Miguel Hernandez Univ Elche UMH, Ctr Operat Res CIO, Elche 03202, Spain
关键词
data envelopment analysis; PAC learning; support vector regression; machine learning; structural risk minimization; DISTANCE FUNCTION; EFFICIENCY; HYPERPLANES; MODELS;
D O I
10.3390/math10060909
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Data Envelopment Analysis (DEA) is one of the most used non-parametric techniques for technical efficiency assessment. DEA is exclusively concerned about the minimization of the empirical error, satisfying, at the same time, some shape constraints (convexity and free disposability). Unfortunately, by construction, DEA is a descriptive methodology that is not concerned about preventing overfitting. In this paper, we introduce a new methodology that allows for estimating polyhedral technologies following the Structural Risk Minimization (SRM) principle. This technique is called Data Envelopment Analysis-based Machines (DEAM). Given that the new method controls the generalization error of the model, the corresponding estimate of the technology does not suffer from overfitting. Moreover, the notion of epsilon-insensitivity is also introduced, generating a new and more robust definition of technical efficiency. Additionally, we show that DEAM can be seen as a machine learning-type extension of DEA, satisfying the same microeconomic postulates except for minimal extrapolation. Finally, the performance of DEAM is evaluated through simulations. We conclude that the frontier estimator derived from DEAM is better than that associated with DEA. The bias and mean squared error obtained for DEAM are smaller in all the scenarios analyzed, regardless of the number of variables and DMUs.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Ensemble learning method for classification: Integrating data envelopment analysis with machine learning
    An, Qingxian
    Huang, Siwei
    Han, Yuxuan
    Zhu, You
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2024, 169
  • [2] An Application of Data Envelopment Analysis and Machine Learning Approach to Risk Management
    Jomthanachai, Suriyan
    Wong, Wai-Peng
    Lim, Chee-Peng
    [J]. IEEE ACCESS, 2021, 9 : 85978 - 85994
  • [3] Data envelopment analysis classification machine
    Yan, Hong
    Wei, Quanling
    [J]. INFORMATION SCIENCES, 2011, 181 (22) : 5029 - 5041
  • [4] Data envelopment analysis assessment machine
    Wei, Quanling
    Yan, Hong
    [J]. ICDM 2006: Sixth IEEE International Conference on Data Mining, Workshops, 2006, : 808 - 812
  • [5] Combining the Virtues of Stochastic Frontier and Data Envelopment Analysis
    Parmeter, Christopher F.
    Zelenyuk, Valentin
    [J]. OPERATIONS RESEARCH, 2019, 67 (06) : 1628 - 1658
  • [6] Assessing the impact of hydropower projects in Brazil through data envelopment analysis and machine learning
    Bortoluzzi, Mirian
    Furlan, Marcelo
    Neto, Jose Francisco dos Reis
    [J]. RENEWABLE ENERGY, 2022, 200 : 1316 - 1326
  • [7] A Note on Combining Machine Learning with Statistical Modeling for Financial Data Analysis
    Maria Sarabia, Jose
    Prieto, Faustino
    Jorda, Vanesa
    Sperlich, Stefan
    [J]. RISKS, 2020, 8 (02)
  • [8] A tropical Atlantic dynamics analysis by combining machine learning and satellite data
    Arnault, Sabine
    Thiria, Sylvie
    Crepon, Michel
    Kaly, Francois
    [J]. ADVANCES IN SPACE RESEARCH, 2021, 68 (02) : 467 - 486
  • [9] Machine Learning in Data Lake for Combining Data Silos
    Wibowo, Merlinda
    Sulaiman, Sarina
    Shamsuddin, Siti Mariyam
    [J]. DATA MINING AND BIG DATA, DMBD 2017, 2017, 10387 : 294 - 306
  • [10] Efficiency evaluation of electricity distribution companies: Integrating data envelopment analysis and machine learning for a holistic analysis
    Omrani, Hashem
    Emrouznejad, Ali
    Teplova, Tamara
    Amini, Mohaddeseh
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133