Cloud Workload Categorization Using Various Data Preprocessing and Clustering Techniques

被引:0
|
作者
Daraghmeh, Mustafa [1 ]
Agarwal, Anjali [1 ]
Jararweh, Yaser [2 ]
机构
[1] Concordia Univ, Montreal, PQ, Canada
[2] Jordan Univ Sci & Technol, Irbid, Jordan
关键词
Cloud Workload Segmentation; Feature Engineering; Clustering Analysis; Cloud Computing;
D O I
10.1145/3603166.3632131
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Effectively managing cloud resources can be challenging due to the inter-dependencies of various cloud-hosted services. Workload categorization identifies and groups workloads with similar characteristics. Data center managers can make informed decisions on resource allocation, workload scheduling, and infrastructure maintenance, leading to better performance and reduced costs. However, since cloud workloads can be interpreted differently due to their characteristics, several well-founded categories can be concealed within the various data perspectives. This paper proposes a workload categorization approach to automate the categorization process of the scheduling workloads, utilizing different clustering and data preprocessing methods, evaluated using a cloud workload trace derived from Microsoft Azure. Our research highlights the importance of using advanced data preprocessing techniques and integrating them seamlessly into clustering methods to ensure precise workload segmentation.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Categorization of Cloud Workload Types with Clustering
    Orzechowski, Piotr
    Proficz, Jerzy
    Krawczyk, Henryk
    Szymanski, Julian
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL, NETWORKS, COMPUTING, AND SYSTEMS (ICSNCS 2016), VOL 1, 2017, 395 : 303 - 313
  • [2] Advanced data preprocessing using fuzzy clustering techniques
    Genther, H
    Glesner, M
    [J]. FUZZY SETS AND SYSTEMS, 1997, 85 (02) : 155 - 164
  • [3] Effects of Various Preprocessing Techniques to Turkish Text Categorization Using N-Gram Features
    Deniz, Ayca
    Kiziloz, Llakan Ezgi
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2017, : 655 - 660
  • [4] A Systematic Cloud Workload Clustering Technique in Large Scale Data Centers
    Ismaeel, Salam
    Miri, Ali
    [J]. 2019 IEEE WORLD CONGRESS ON SERVICES (IEEE SERVICES 2019), 2019, : 362 - 363
  • [5] Garment Categorization Using Data Mining Techniques
    Jain, Sheenam
    Kumar, Vijay
    [J]. SYMMETRY-BASEL, 2020, 12 (06):
  • [6] Preprocessing of Terrain Data in the Cloud using a Workflow Management System
    Kaster, Marvin
    Wuerz, Hendrik M.
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2022, : 40 - 49
  • [7] Integrating clustering and regression for workload estimation in the cloud
    Yu, Yongjia
    Jindal, Vasu
    Yen, I-Ling
    Bastani, Farokh
    Xu, Jie
    Garraghan, Peter
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (23):
  • [8] On the Analysis of Work Accidents Data by Using Data Preprocessing and Statistical Techniques
    Aksehir, Zinnet Duygu
    Oruc, Yalcin
    Elibol, Ahmet
    Akleylek, Sedat
    Kilic, Erdal
    [J]. 2018 2ND INTERNATIONAL SYMPOSIUM ON MULTIDISCIPLINARY STUDIES AND INNOVATIVE TECHNOLOGIES (ISMSIT), 2018, : 649 - +
  • [9] Classification of Short Text Using Various Preprocessing Techniques: An Empirical Evaluation
    Kumar, H. M. Keerthi
    Harish, B. S.
    [J]. RECENT FINDINGS IN INTELLIGENT COMPUTING TECHNIQUES, VOL 3, 2018, 709 : 19 - 30
  • [10] A Comparative Study of Various Clustering Techniques on Big Data Sets using Apache Mahout
    Eluri, Venkateswara Reddy
    AL-Jabri, Amina Salim Mohd
    Ramesh, M.
    Jane, Mare
    [J]. 2016 3RD MEC INTERNATIONAL CONFERENCE ON BIG DATA AND SMART CITY (ICBDSC), 2016, : 374 - 377