Measuring data-centre workflows complexity through process mining: the Google cluster case

被引:16
|
作者
Fernandez-Cerero, Damian [1 ]
Jesus Varela-Vaca, Angel [1 ]
Fernandez-Montes, Alejandro [1 ]
Teresa Gomez-Lopez, Maria [1 ]
Antonio Alvarez-Bermejo, Jose [2 ]
机构
[1] Univ Seville, Dept Comp Languages & Syst, Seville 41012, Spain
[2] Univ Almeria, Dept Comp Sci, Almeria 04120, Spain
来源
JOURNAL OF SUPERCOMPUTING | 2020年 / 76卷 / 04期
关键词
Cloud computing; Business process management; Scheduling; Process mining; Process discovery; High performance computing; ENERGY POLICIES; CLOUD; MACHINES;
D O I
10.1007/s11227-019-02996-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data centres have become the backbone of large Cloud services and applications, providing virtually unlimited elastic and scalable computational and storage resources. The search for the efficiency and optimisation of resources is one of the current key aspects for large Cloud Service Providers and is becoming more and more challenging, since new computing paradigms such as Internet of Things, Cyber-Physical Systems and Edge Computing are spreading. One of the key aspects to achieve efficiency in data centres consists of the discovery and proper analysis of the data-centre behaviour. In this paper, we present a model to automatically retrieve execution workflows of existing data-centre logs by employing process mining techniques. The discovered processes are characterised and analysed according to the understandability and complexity in terms of execution efficiency of data-centre jobs. We finally validate and demonstrate the usability of the proposal by applying the model in a real scenario, that is, the Google Cluster traces.
引用
收藏
页码:2449 / 2478
页数:30
相关论文
共 50 条
  • [41] Prediction of Composites Behavior Undergoing an ATP Process through Data-Mining
    Martin, Clara Argerich
    Collado, Angel Leon
    Pinillo, Ruben Ibanez
    Barasinski, Anais
    Abisset-Chavanne, Emmanuelle
    Chinesta, Francisco
    PROCEEDINGS OF 21ST INTERNATIONAL ESAFORM CONFERENCE ON MATERIAL FORMING (ESAFORM 2018), 2018, 1960
  • [42] PMDG: Privacy for Multi-perspective Process Mining Through Data Generalization
    Hildebrant, Ryan
    Fahrenkrog-Petersen, Stephan A.
    Weidlich, Matthias
    Ren, Shangping
    ADVANCED INFORMATION SYSTEMS ENGINEERING, CAISE 2023, 2023, 13901 : 506 - 521
  • [43] Predicting COVID-19 Incidence Through Analysis of Google Trends Data in Iran: Data Mining and Deep Learning Pilot Study
    Ayyoubzadeh, Seyed Mohammad
    Ayyoubzadeh, Seyed Mehdi
    Zahedi, Hoda
    Ahmadi, Mahnaz
    Kalhori, Sharareh R. Niakan
    JMIR PUBLIC HEALTH AND SURVEILLANCE, 2020, 6 (02): : 192 - 198
  • [44] Measuring the process of quality of care for ST-segment elevation acute myocardial infarction through data-mining of the electronic discharge notes
    Chang, Sheng-Nan
    Lin, Jou-Wei
    Liu, Shi-Chi
    Hwang, Juey-Jen
    JOURNAL OF EVALUATION IN CLINICAL PRACTICE, 2008, 14 (01) : 116 - 120
  • [45] Measuring regional competitiveness through Data Envelopment Analysis: A Peruvian case
    Charles, Vincent
    Felipe Zegarra, Luis
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (11) : 5371 - 5381
  • [46] Process-Driven Data Quality Management Through Integration of Data Quality into Existing Process Models Application of Complexity-Reducing Patterns and the Impact on Complexity Metrics
    Glowalla, Paul
    Sunyaev, Ali
    BUSINESS & INFORMATION SYSTEMS ENGINEERING, 2013, 5 (06) : 433 - 448
  • [47] Identifying Variation in Personal Daily Routine Through Process Mining: A Case Study
    Di Federico, Gemma
    Fernandez-Llatas, Carlos
    Ahmadi, Zahra
    Shirali, Mohsen
    Burattin, Andrea
    PROCESS MINING WORKSHOPS, ICPM 2023, 2024, 503 : 223 - 234
  • [48] A case study on integrating data analysis and process mining in conventional tunnel construction
    Melnyk, Oleksandr
    Huymajer, Marco
    Huemer, Christian
    Rosenberger, Lucas
    Mazak-Huemer, Alexandra
    DEVELOPMENTS IN THE BUILT ENVIRONMENT, 2025, 22
  • [49] Quality-Informed Process Mining: A Case for Standardised Data Quality Annotations
    Goel, Kanika
    Leemans, Sander J. J.
    Martin, Niels
    Wynn, Moe T.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (05)
  • [50] Research on Data Mining Service and Its Application Case in Complex Industrial Process
    Lu, Qi
    Lyu, Zhi-Jun
    Xiang, Qian
    Zhou, Yaqin
    Bao, Jinsong
    2017 13TH IEEE CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2017, : 1124 - 1129