ASKALON: a tool set for cluster and Grid computing

被引:100
|
作者
Fahringer, T
Jugravu, A
Pllana, S
Prodan, R
Seragiotto, CJ
Truong, HL
机构
[1] Univ Innsbruck, Inst Comp Sci, A-6020 Innsbruck, Austria
[2] Univ Vienna, Inst Software Sci, A-1090 Vienna, Austria
来源
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE | 2005年 / 17卷 / 2-4期
关键词
cluster computing; Grid computing; parallel and distributed applications; performance prediction; measurement and analysis; bottleneck detection; experiment management;
D O I
10.1002/cpe.929
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Performance engineering of parallel and distributed applications is a complex task that iterates through various phases, ranging from modeling and prediction, to performance measurement, experiment management, data collection, and bottleneck analysis. There is no evidence so far that all of these phases should/can be integrated into a single monolithic tool. Moreover, the emergence of computational Grids as a common single wide-area platform for high-performance computing raises the idea to provide tools as interacting Grid services that share resources, support interoperability among different users and tools, and, most importantly, provide omnipresent services over the Grid. We have developed the ASKALON tool set to support performance-oriented development of parallel and distributed (Grid) applications. ASKALON comprises four tools, coherently integrated into a service-oriented architecture. SCALEA is a performance instrumentation, measurement, and analysis tool of parallel and distributed applications. ZENTURIO is a general purpose experiment management tool with advanced support for multi-experiment performance analysis and parameter studies. AKSUM provides semi-automatic highlevel performance bottleneck detection through a special-purpose performance property specification language. The PerformanceProphet enables the user to model and predict the performance of parallel applications at the early stages of development. In this paper we describe the overall architecture of the ASKALON tool set and outline the basic functionality of the four constituent tools. The structure of each tool is based on the composition and sharing of remote Grid services, thus enabling tool interoperability. In addition, a data repository allows the tools to share the common application performance and output data that have been derived by the individual tools. A service repository is used to store common portable Grid service implementations. A general-purpose Factory service is employed to create service instances on arbitrary remote Grid sites. Discovering and dynamically binding to existing remote services is achieved through registry services. The ASKALON visualization diagrams support both online and postmortem visualization of performance and output data. We demonstrate the usefulness and effectiveness of ASKALON by applying the tools to real-world applications. Copyright (C) 2005 John Wiley Sons, Ltd.
引用
收藏
页码:143 / 169
页数:27
相关论文
共 50 条
  • [31] Research on performance tool in HPC and grid computing
    Shi, PZ
    Li, SL
    CCECE 2003: CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, PROCEEDINGS: TOWARD A CARING AND HUMANE TECHNOLOGY, 2003, : 1143 - 1146
  • [32] Semi-automatic Composition of Ontologies for ASKALON Grid Workflows
    Malik, Muhammad Junaid
    Fahringer, Thomas
    Prodan, Radu
    EURO-PAR 2011: PARALLEL PROCESSING WORKSHOPS, PT I, 2012, 7155 : 169 - 180
  • [33] Towards a light-weight workflow engine in the ASKALON Grid environment
    Qin, Jun
    Wieczorek, Marek
    Plankensteiner, Kassian
    Fahringer, Thomas
    TOWARDS NEXT GENERATION GRIDS, 2007, : 239 - +
  • [34] Online analysis and runtime steering of dynamic workflows in the ASKALON grid enviromnent
    Prodan, Radu
    CCGRID 2007: SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, 2007, : 389 - +
  • [35] A logical cluster-based Grid computing platform
    Koh, KW
    Park, KL
    Lee, HJ
    Kim, SD
    PDPTA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-4, 2003, : 64 - 70
  • [36] Extending an SSI cluster for resource discovery in grid computing
    Echaiz, Javier
    Ardenghi, Jorge
    GCC 2005: FIFTH INTERNATIONAL CONFERENCE ON GRID AND COOPERATIVE COMPUTING, PROCEEDINGS, 2006, : 287 - +
  • [37] Chirp: a practical global filesystem for cluster and Grid computing
    Thain, Douglas
    Moretti, Christopher
    Hemmes, Jeffrey
    JOURNAL OF GRID COMPUTING, 2009, 7 (01) : 51 - 72
  • [38] Topic 6: Grid, Cluster and Cloud Computing (Introduction)
    Laure, Erwin
    Kao, Odej
    Badia, Rosa M.
    Lefevre, Laurent
    Di Martino, Beniamino
    Prodan, Radu
    Turilli, Matteo
    Warneke, Daniel
    EURO-PAR 2013 PARALLEL PROCESSING, 2013, 8097 : 241 - 241
  • [39] A user-centric cluster and grid computing portal
    Elmroth, E
    Nylén, M
    Oscarsson, R
    2005 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS, PROCEEDINGS, 2005, : 103 - 110
  • [40] A user-centric cluster and grid computing portal
    Elmroth, Erik
    Nylen, Mats
    Oscarsson, Roger
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2009, 4 (02) : 127 - 134