An Infrastructure for Automating Large-scale Performance Studies and Data Processing

被引:0
|
作者
Jayasinghe, Deepal [1 ]
Kimball, Josh [1 ]
Zhu, Tao [1 ]
Choudhary, Siddharth [1 ]
Pu, Calton [1 ]
机构
[1] Georgia Inst Technol, Ctr Expt Res Comp Syst, Atlanta, GA 30332 USA
关键词
Automation; Benchmarking; Cloud; Code Generation; Data Warehouse; ETL; Performance; Visualization;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Cloud has enabled the computing model to shift from traditional data centers to publicly shared computing infrastructure; yet, applications leveraging this new computing model can experience performance and scalability issues, which arise from the hidden complexities of the cloud. The most reliable path for better understanding these complexities is an empirically based approach that relies on collecting data from a large number of performance studies. Armed with this performance data, we can understand what has happened, why it happened, and more importantly, predict what will happen in the future. However, this approach presents challenges itself, namely in the form of data management. We attempt to mitigate these data challenges by fully automating the performance measurement process. Concretely, we have developed an automated infrastructure, which reduces the complexity of the large-scale performance measurement process by generating all the necessary resources to conduct experiments, to collect and process data and to store and analyze data. In this paper, we focus on the performance data management aspect of our infrastructure.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Improving the performance of precise query processing on large-scale nested data with UniHash index
    School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai, China
    不详
    Int. J. Database Theory Appl., 1 (111-128):
  • [32] LARGE-SCALE INFRASTRUCTURE PROJECTS IN EUROPE
    EKENGER, P
    TECHNOLOGY IN SOCIETY, 1987, 9 (01) : 87 - 95
  • [33] TerraBrasilis: A Spatial Data Analytics Infrastructure for Large-Scale Thematic Mapping
    Assis, Luiz Fernando F. G.
    Ferreira, Karine Reis
    Vinhas, Lubia
    Maurano, Luis
    Almeida, Claudio
    Carvalho, Andre
    Rodrigues, Jether
    Maciel, Adeline
    Camargo, Claudinei
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (11)
  • [34] A Large-scale Recipe and Meal Data Collection as Infrastructure for Food Research
    Harashima, Jun
    Ariga, Michiaki
    Murata, Kenta
    Ioki, Masayuki
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 2455 - 2459
  • [35] Large-Scale Crowdsourcing by Vehicular Data Packets in a Sparse Roadside Infrastructure
    Zhanikeev, Marat
    2017 IEEE 10TH CONFERENCE ON SERVICE-ORIENTED COMPUTING AND APPLICATIONS (SOCA), 2017, : 66 - 72
  • [36] An interconnected data infrastructure to support large-scale rare disease research
    Johansson, Lennart F.
    Laurie, Steve
    Spalding, Dylan
    Gibson, Spencer
    Ruvolo, David
    Thomas, Coline
    Piscia, Davide
    de Andrade, Fernanda
    Been, Gerieke
    Bijlsma, Marieke
    Brunner, Han
    Cimerman, Sandi
    Dizjikan, Farid Yavari
    Ellwanger, Kornelia
    Fernandez, Marcos
    Freeberg, Mallory
    van de Geijn, Gert-Jan
    Kanninga, Roan
    Maddi, Vatsalya
    Mehtarizadeh, Mehdi
    Neerincx, Pieter
    Ossowski, Stephan
    Rath, Ana
    Roelofs-Prins, Dieuwke
    Stok-Benjamins, Marloes
    van der Velde, K. Joeri
    Veal, Cohn
    van der Vries, Gerben
    Wadsley, Marc
    Warren, Gregory
    Zurek, Birte
    Keane, Thomas
    Graessner, Holm
    Beltran, Sergi
    Swertz, Morris A.
    Brookes, Anthony J.
    GIGASCIENCE, 2024, 13
  • [37] Performance Evaluation of Blind Modal Identification in Large-Scale Civil Infrastructure
    Abasi, Ali
    Sadhu, Ayan
    INFRASTRUCTURES, 2022, 7 (08)
  • [38] A Data Locality Optimization Algorithm for Large-scale Data Processing in Hadoop
    Zhao, Yanrong
    Wang, Weiping
    Meng, Dan
    Yang, Xiufeng
    Zhang, Shubin
    Li, Jun
    Guan, Gang
    2012 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2012, : 655 - 661
  • [39] Automating Workload Analysis of Large-Scale Supercomputer Systems
    Shvets, P. A.
    Voevodin, V. V.
    Zhumatiy, S. A.
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2021, 42 (07) : 1547 - 1559
  • [40] Automating Workload Analysis of Large-Scale Supercomputer Systems
    P. A. Shvets
    V. V. Voevodin
    S. A. Zhumatiy
    Lobachevskii Journal of Mathematics, 2021, 42 : 1547 - 1559