An Infrastructure for Automating Large-scale Performance Studies and Data Processing

被引:0
|
作者
Jayasinghe, Deepal [1 ]
Kimball, Josh [1 ]
Zhu, Tao [1 ]
Choudhary, Siddharth [1 ]
Pu, Calton [1 ]
机构
[1] Georgia Inst Technol, Ctr Expt Res Comp Syst, Atlanta, GA 30332 USA
关键词
Automation; Benchmarking; Cloud; Code Generation; Data Warehouse; ETL; Performance; Visualization;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Cloud has enabled the computing model to shift from traditional data centers to publicly shared computing infrastructure; yet, applications leveraging this new computing model can experience performance and scalability issues, which arise from the hidden complexities of the cloud. The most reliable path for better understanding these complexities is an empirically based approach that relies on collecting data from a large number of performance studies. Armed with this performance data, we can understand what has happened, why it happened, and more importantly, predict what will happen in the future. However, this approach presents challenges itself, namely in the form of data management. We attempt to mitigate these data challenges by fully automating the performance measurement process. Concretely, we have developed an automated infrastructure, which reduces the complexity of the large-scale performance measurement process by generating all the necessary resources to conduct experiments, to collect and process data and to store and analyze data. In this paper, we focus on the performance data management aspect of our infrastructure.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Optimisation and Application of a Data (Pre)Processing Strategy for Large-Scale DNA Adductomics in Exposomics Studies
    Vangeenderhuysen, Pablo
    De Graeve, Marilyn
    Lipenga, Trancizeo
    McCormack, Valerie
    Goessens, Tess
    Van Hecke, Thomas
    Engelen, Liesa
    Poma, Giulia
    De Smet, Stefaan
    De Saeger, Sarah
    Covaci, Adrian
    Nawrot, Tim
    Vanhaecke, Lynn
    De Boevre, Marthe
    Hemeryck, Lieselot Y.
    ENVIRONMENTAL AND MOLECULAR MUTAGENESIS, 2024, 65 : 101 - 101
  • [42] Valid data from large-scale proteomics studies
    Daniel Chamrad
    Helmut E Meyer
    Nature Methods, 2005, 2 : 647 - 648
  • [43] Mining large-scale smartphone data for personality studies
    Chittaranjan, Gokul
    Blom, Jan
    Gatica-Perez, Daniel
    PERSONAL AND UBIQUITOUS COMPUTING, 2013, 17 (03) : 433 - 450
  • [44] Mining large-scale smartphone data for personality studies
    Gokul Chittaranjan
    Jan Blom
    Daniel Gatica-Perez
    Personal and Ubiquitous Computing, 2013, 17 : 433 - 450
  • [45] Exploratory data analysis in large-scale genetic studies
    Teo, Yik Y.
    BIOSTATISTICS, 2010, 11 (01) : 70 - 81
  • [46] Large-Scale Data Harmonization Across Prospective Studies
    Pan, Ke
    Bazzano, Lydia A.
    Betha, Kalpana
    Charlton, Brittany M.
    Chavarro, Jorge E.
    Cordero, Christina
    Gunderson, Erica P.
    Haggerty, Catherine L.
    Hart, Jaime E.
    Jukic, Anne Marie
    Ley, Sylvia H.
    Mishra, Gita D.
    Mumford, Sunni L.
    Schisterman, Enrique F.
    Schliep, Karen
    Shaffer, Jeffrey G.
    Sotres-Alvarez, Daniela
    Stanford, Joseph B.
    Wilcox, Allen J.
    Wise, Lauren A.
    Yeung, Edwina
    Harville, Emily W.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2023, 192 (12) : 2033 - 2049
  • [47] Valid data from large-scale proteomics studies
    Chamrad, D
    Meyer, HE
    NATURE METHODS, 2005, 2 (09) : 647 - 648
  • [48] Dynamic and fast processing of queries on large-scale RDF data
    Yuan, Pingpeng
    Xie, Changfeng
    Jin, Hai
    Liu, Ling
    Yang, Guang
    Shi, Xuanhua
    KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (02) : 311 - 334
  • [49] Applying Combinatorial Testing to Large-scale Data Processing at Adobe
    Smith, Riley
    Jarman, Darryl
    Kuhn, Richard
    Kacker, Raghu
    Simos, Dimitris
    Kampel, Ludwig
    Leithner, Manuel
    Gosney, Gabe
    2019 IEEE 12TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS (ICSTW 2019), 2019, : 190 - 193
  • [50] Determinants of Large-Scale Spatial Data Processing in Polish Mining
    Kosydor, Pawel
    Warchala, Ewa
    Krawczyk, Artur
    Piorkowski, Adam
    XIXTH CONFERENCE OF PHD STUDENTS AND YOUNG SCIENTISTS: INTERDISCIPLINARY TOPICS IN MINING AND GEOLOGY, 2020, 2209