An autonomic operating environment for large-scale distributed applications

被引:0
|
作者
Lehman, TJ [1 ]
Deen, RG [1 ]
Kaufman, JH [1 ]
机构
[1] IBM Corp, Almaden Res Ctr, Div Res, San Jose, CA 95120 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents the OptimalGrid Operating Environment, and shows how it tackles the difficulty of automatically building, distributing and running connected distributed parallel programs. One of the main goals of OptimalGrid is to hide the complexity of working with grid applications - both the creation of the grid application itself and the preparation of it on the grid infrastructure. The OptimalGrid Operating Environment hides much of the complexity of building, deploying and running of applications by employing the use of autonomic techniques, such as goal-oriented operations, alternative workflow schedules and agent-peer communication. Unlike conventional script-based systems, OptimalGrid uses the abstraction of target goals to allow it more flexibility in handling errors and unexpected system events. The target goals can be achieved in multiple ways, and the selection of a goal solution can be based on several factors, including previous success, peer information and user assistance. The main environment components, the Grid Director and Grid Manager, interoperate in a high-level workflow environment, so target goals can be retried with alternative solutions or achieved with human interaction. The use of high-level goals, with multiple solutions that are determined by past success, peer cooperation or human interaction, coupled with a flexible retry mechanism, results in a novel approach for distributed operating environments.
引用
收藏
页码:81 / 99
页数:19
相关论文
共 50 条
  • [1] A distributed algorithm for operating large-scale ridesourcing systems
    Zhang, Ruolin
    Masoud, Neda
    [J]. TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2021, 156
  • [2] Workflow resiliency for large-scale distributed applications
    Toan Nguyen
    Desideri, Jean-Antoine
    Selmin, Vittorio
    [J]. 2009 THIRD INTERNATIONAL CONFERENCE ON ADVANCED ENGINEERING COMPUTING AND APPLICATIONS IN SCIENCES (ADVCOMP 2009), 2009, : 7 - +
  • [3] Autonomic runtime system for large scale parallel and distributed applications
    Yang, JM
    Chen, HP
    Kim, YU
    Hariri, S
    Parashar, M
    [J]. UNCONVENTIONAL PROGRAMMING PARADIGMS, 2005, 3566 : 297 - 311
  • [4] A Distributed Business-Aware Storage Execution Environment Towards Large-Scale Applications
    Jiang, Feng
    Cheng, Yongyang
    Dong, Changkun
    Hui, Zhao
    Yan, Ruibo
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT II, 2020, 12453 : 142 - 156
  • [5] QuaSR: A large-scale automated, distributed testing environment
    Grady, S
    Madhusudan, GS
    Sugiyama, M
    [J]. PROCEEDINGS OF THE FOURTH ANNUAL TCL/TK WORKSHOP, 1996, : 61 - 68
  • [6] Autonomic Service Hosting for Large-Scale Distributed MOVE-services
    Van Den Bossche, Bruno
    De Turck, Filip
    Dhoedt, Bart
    Demeester, Piet
    [J]. 2009 IFIP/IEEE INTERNATIONAL SYMPOSIUM ON INTEGRATED NETWORK MANAGEMENT (IM 2009) VOLS 1 AND 2, 2009, : 81 - 88
  • [7] TRACES GENERATION TO SIMULATE LARGE-SCALE DISTRIBUTED APPLICATIONS
    Dalle, Olivier
    Mancini, Emilio P.
    [J]. PROCEEDINGS OF THE 2011 WINTER SIMULATION CONFERENCE (WSC), 2011, : 2993 - 3001
  • [8] Towards a common infrastructure for large-scale distributed applications
    Nikolaou, C
    Marazakis, M
    Papadakis, D
    Yeorgiannakis, Y
    Sairamesh, J
    [J]. RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 1997, 1324 : 173 - 193
  • [9] Sensyml: Simulation Environment for large-scale IoT Applications
    Haris, Isakovic
    Bisanovic, Vanja
    Wally, Bernhard
    Rausch, Thomas
    Ratasich, Denise
    Dustdar, Schahram
    Kappel, Gerti
    Grosu, Radu
    [J]. 45TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY (IECON 2019), 2019, : 3024 - 3030
  • [10] An Emulation Environment for Vulnerability Analysis of Large-scale Distributed System
    Zhao, Gang
    Kuang, Xiao-hui
    Zheng, Weimin
    [J]. 2009 EIGHTH INTERNATIONAL CONFERENCE ON GRID AND COOPERATIVE COMPUTING, PROCEEDINGS, 2009, : 97 - +