Measuring data-centre workflows complexity through process mining: the Google cluster case

被引:16
|
作者
Fernandez-Cerero, Damian [1 ]
Jesus Varela-Vaca, Angel [1 ]
Fernandez-Montes, Alejandro [1 ]
Teresa Gomez-Lopez, Maria [1 ]
Antonio Alvarez-Bermejo, Jose [2 ]
机构
[1] Univ Seville, Dept Comp Languages & Syst, Seville 41012, Spain
[2] Univ Almeria, Dept Comp Sci, Almeria 04120, Spain
来源
JOURNAL OF SUPERCOMPUTING | 2020年 / 76卷 / 04期
关键词
Cloud computing; Business process management; Scheduling; Process mining; Process discovery; High performance computing; ENERGY POLICIES; CLOUD; MACHINES;
D O I
10.1007/s11227-019-02996-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data centres have become the backbone of large Cloud services and applications, providing virtually unlimited elastic and scalable computational and storage resources. The search for the efficiency and optimisation of resources is one of the current key aspects for large Cloud Service Providers and is becoming more and more challenging, since new computing paradigms such as Internet of Things, Cyber-Physical Systems and Edge Computing are spreading. One of the key aspects to achieve efficiency in data centres consists of the discovery and proper analysis of the data-centre behaviour. In this paper, we present a model to automatically retrieve execution workflows of existing data-centre logs by employing process mining techniques. The discovered processes are characterised and analysed according to the understandability and complexity in terms of execution efficiency of data-centre jobs. We finally validate and demonstrate the usability of the proposal by applying the model in a real scenario, that is, the Google Cluster traces.
引用
收藏
页码:2449 / 2478
页数:30
相关论文
共 50 条
  • [31] Optimizing a batch manufacturing process through interpretable data mining models
    Mark Last
    Guy Danon
    Sholomo Biderman
    Eli Miron
    Journal of Intelligent Manufacturing, 2009, 20 : 523 - 534
  • [32] The Case Ordering Problem in Surgical Procedural Training through Process Mining
    Cornejo, Felipe
    Fazio, Cristian
    Munoz-Gama, Jorge
    Sepulveda, Marcos
    Fuentes, Ricardo
    de la Fuente, Rene
    2019 38TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2019,
  • [33] A Case Investigation of Product Structure Complexity in Mass Customization Using a Data Mining Approach
    Nielsen, Peter
    Brunoe, Thomas D.
    Nielsen, Kjeld
    PROCEEDINGS OF THE 7TH WORLD CONFERENCE ON MASS CUSTOMIZATION, PERSONALIZATION, AND CO-CREATION (MCPC 2014) - TWENTY YEARS OF MASS CUSTOMIZATION - TOWARDS NEW FRONTIERS, 2014, : 17 - 25
  • [34] Exploring Robot Personality through Big Data Mining: A Century-Long Analysis from Google Books
    Xu, Liang
    Chao, Chiju
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2024, 40 (22) : 7642 - 7654
  • [35] Towards Process Mining of EMR Data Case Study for Sepsis Management
    de Vries, Gert-Jan
    Quintano Neira, Ricardo Alfredo
    Geleijnse, Gijs
    Dixit, Prabhakar
    Mazza, Bruno Franco
    PROCEEDINGS OF THE 10TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 5: HEALTHINF, 2017, : 585 - 593
  • [36] Process-Driven Data Quality Management Through Integration of Data Quality into Existing Process ModelsApplication of Complexity-Reducing Patterns and the Impact on Complexity Metrics
    Paul Glowalla
    Ali Sunyaev
    Business & Information Systems Engineering, 2013, 5 : 433 - 448
  • [37] Educational data mining using cluster analysis and decision tree technique: A case study
    Krizanic, Snjezana
    INTERNATIONAL JOURNAL OF ENGINEERING BUSINESS MANAGEMENT, 2020, 12
  • [38] A DECISION MAKING PROCESS APPLICATION FOR THE SLURRY PRODUCTION IN CERAMICS VIA FUZZY CLUSTER AND DATA MINING
    Gurbuz, Feyza
    Pardalos, Panos M.
    JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2012, 8 (02) : 285 - 297
  • [39] Identifying the Context of Data Usage to Diagnose Privacy Issues through Process Mining
    Mehr, Azadeh Sadat Mozafari
    de Carvalho, Renata M.
    van Dongen, Boudewijn
    TRANSACTIONS ON DATA PRIVACY, 2023, 16 (02) : 123 - 151
  • [40] Understanding the indoor environment through mining sensory data - A case study
    Wu, Shaomin
    Clements-Croome, Derek
    ENERGY AND BUILDINGS, 2007, 39 (11) : 1183 - 1191