The hector distributed run-time environment

被引:12
|
作者
Russ, SH
Robinson, J
Flachs, BK
Heckel, B
机构
[1] Mississippi State Univ, Engn Res Ctr, Mississippi State, MS 39762 USA
[2] Adv Microelect, Ridgeland, MS 39157 USA
[3] IBM Corp, Austin Res Lab, Austin, TX 78758 USA
[4] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA
关键词
parallel computing; load balancing; fault tolerance; resource allocation; task migration;
D O I
10.1109/71.735957
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Harnessing the computational capabilities of a network of workstations promises to off-load work from overloaded supercomputers onto largely idle resources overnight. Several capabilities are needed to do this, including support for an architecture-independent parallel programming environment, task migration, automatic resource allocation, and fault tolerance. the Hector distributed run-time environment is designed to present these capabilities transparently to programmers. MPI programs can be run under this environment on homogeneous clusters with no modifications to their source code needed. The design of Hector, its internal structure, and several benchmarks and tests are presented.
引用
收藏
页码:1102 / 1114
页数:13
相关论文
共 50 条
  • [41] A unified codesign run-time environment for the UltraSONIC reconfigurable computer
    Wiangtong, T
    Cheung, PYK
    Luk, W
    [J]. FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, 2003, 2778 : 396 - 405
  • [42] Design of run-time configuration manager for a multimedia network environment
    Park, TK
    Kim, C
    Hong, JW
    [J]. PROCEEDINGS OF THE IEEE SECOND INTERNATIONAL WORKSHOP ON SYSTEMS MANAGEMENT, 1996, : 9 - 14
  • [43] Enhance Run-time Performance with a Collaborative Distributed Speech Recognition Framework
    Kurpukdee, Nattapong
    Sertsi, Phuttapong
    Chunwijitra, Sila
    Chunwijitra, Vataya
    Chotimongkol, Ananlada
    Wutiwiwatchai, Chai
    [J]. 2015 INTERNATIONAL COMPUTER SCIENCE AND ENGINEERING CONFERENCE (ICSEC), 2015, : 204 - 209
  • [44] Run-time technique for parallel loop identification based on distributed system
    Yang, Xue-Lin
    Yu, Meng
    Chen, Dao-Xu
    Xie, Li
    [J]. Ruan Jian Xue Bao/Journal of Software, 2002, 13 (08): : 1718 - 1722
  • [45] A Run-time System for Efficient Execution of Scientific Workflows on Distributed Environments
    George Teodoro
    Tulio Tavares
    Renato Ferreira
    Tahsin Kurc
    Wagner Meira
    Dorgival Guedes
    Tony Pan
    Joel Saltz
    [J]. International Journal of Parallel Programming, 2008, 36 : 250 - 266
  • [46] JADE: a versatile run-time for distributed applications on mobile terminals and networks
    Caire, G
    Rimassa, G
    Bellifernine, F
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 1882 - 1888
  • [47] Automated Code Synthesis for Run-Time Verification of Distributed Embedded Systems
    Majzik, Istvan
    Horanyi, Gergo
    [J]. 12TH SYMPOSIUM ON PROGRAMMING LANGUAGES AND SOFTWARE TOOLS, SPLST' 11, 2011, : 161 - 172
  • [48] Schooner: An object-oriented run-time support for distributed applications
    Furmento, N
    Baude, F
    [J]. PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS - PROCEEDINGS OF THE ISCA 9TH INTERNATIONAL CONFERENCE, VOLS I AND II, 1996, : 31 - 36
  • [49] DVE-RTI: Distributed interactive simulation run-time infrastructure
    Lu, Liang-Quan
    Zhou, Zhong
    Wu, Wei
    Zhao, Qin-Ping
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2004, 41 (05): : 828 - 834
  • [50] A run-time system for efficient execution of scientific workflows on distributed environments
    Teodoro, George
    Tavares, Tulio
    Ferreira, Renato
    Kurc, Tahsin
    Meira, Wagner, Jr.
    Guedes, Dorgival
    Pan, Tony
    Saltz, Joel
    [J]. SBAC-OAD 2006: 18TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, 2006, : 81 - +