Data envelopment analysis and big data

被引:57
|
作者
Khezrimotlagh, Dariush [1 ]
Zhu, Joe [2 ,3 ]
Cook, Wade D. [4 ]
Toloo, Mehdi [5 ]
机构
[1] Penn State Univ, Dept Math, Harrisburg, PA USA
[2] Nanjing Audit Univ, Coll Auditing & Evaluat, Nanjing 211815, Jiangsu, Peoples R China
[3] Worcester Polytech Inst, Foisie Business Sch, Worcester, MA 01609 USA
[4] York Univ, Schulich Sch Business, N York, ON, Canada
[5] VSB Tech Univ Ostrava, Dept Syst Engn, Ostrava, Czech Republic
关键词
Data envelopment analysis (DEA); Big data; Performance evaluation; Simulation; DEA; EFFICIENCY; ALGORITHM; MODELS;
D O I
10.1016/j.ejor.2018.10.044
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
In the traditional data envelopment analysis (DEA) approach for a set of n Decision Making Units (DMUs), a standard DEA model is solved n times, one for each DMU. As the number of DMUs increases, the running-time to solve the standard model sharply rises. In this study, a new framework is proposed to significantly decrease the required DEA calculation time in comparison with the existing methodologies when a large set of DMUs (e.g., 20,000 DMUs or more) is present. The framework includes five steps: (i) selecting a subsample of DMUs using a proposed algorithm, (ii) finding the best-practice DMUs in the selected subsample, (iii) finding the exterior DMUs to the hull of the selected subsample, (iv) identifying the set of all efficient DMUs, and (v) measuring the performance scores of DMUs as those arising from the traditional DEA approach. The variable returns to scale technology is assumed and several simulation experiments are designed to estimate the running-time for applying the proposed method for big data. The obtained results in this study point out that the running-time is decreased up to 99.9% in comparison with the existing techniques. In addition, we illustrate the essential computation time for applying the proposed method as a function of the number of DMUs (cardinality), number of inputs and outputs (dimension), and the proportion of efficient DMUs (density). The methods are also compared on a real data set consisting of 30,099 electric power plants in the United States from 1996 to 2016. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:1047 / 1054
页数:8
相关论文
共 50 条
  • [41] Advances in data envelopment analysis
    Ali Emrouznejad
    [J]. Annals of Operations Research, 2014, 214 : 1 - 4
  • [42] Improving public services' performance measurement systems: applying data envelopment analysis in the big and open data context
    Bartolacci, Francesca
    Del Gobbo, Roberto
    Soverchia, Michela
    [J]. INTERNATIONAL JOURNAL OF PUBLIC SECTOR MANAGEMENT, 2024,
  • [43] Clustering and meta-envelopment in data envelopment analysis
    Tsionas, Mike G.
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2023, 304 (02) : 763 - 778
  • [44] Hybrid cluster and data envelopment analysis with interval data
    Kianfar, K.
    Namin, M. Ahadzadeh
    Tabriz, A. Alam
    Najafi, E.
    Lotfi, F. Hosseinzadeh
    [J]. SCIENTIA IRANICA, 2018, 25 (05) : 2904 - 2911
  • [45] Qualitative and Quantitative Data Envelopment Analysis with Interval Data
    Inuiguchi, Masahiro
    Mizoshita, Fumiki
    [J]. INTEGRATED UNCERTAINTY MANAGEMENT AND APPLICATIONS, 2010, 68 : 163 - 174
  • [46] A generalized model for data envelopment analysis with interval data
    Jahanshahloo, G. R.
    Lotfi, F. Hosseinzadeh
    Malkhalifeh, M. Rostamy
    Namin, M. Ahadzadeh
    [J]. APPLIED MATHEMATICAL MODELLING, 2009, 33 (07) : 3237 - 3244
  • [47] Flexible factors in categorized data for data envelopment analysis
    Salehian, Mir-Vahid
    Saati, Saber
    Sohraee, Sevan
    [J]. OPSEARCH, 2024, 61 (01) : 163 - 188
  • [48] Ranking units in Data Envelopment Analysis with fuzzy data
    Valami, Hadi Bagherzadeh
    Raeinojehdehi, Reza
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2016, 30 (05) : 2505 - 2516
  • [49] Qualitative and quantitative data envelopment analysis with interval data
    Masahiro Inuiguchi
    Fumiki Mizoshita
    [J]. Annals of Operations Research, 2012, 195 : 189 - 220
  • [50] Flexible factors in categorized data for data envelopment analysis
    Mir-Vahid Salehian
    Saber Saati
    Sevan Sohraee
    [J]. OPSEARCH, 2024, 61 : 163 - 188