RAMSYS: Resource-Aware Asynchronous Data Transfer with Multicore SYStems

被引:8
|
作者
Li, Tan [1 ]
Ren, Yufei [2 ]
Yu, Dantong [3 ,4 ]
Jin, Shudong [2 ]
机构
[1] VMware Inc, Palo Alto, CA 94304 USA
[2] SUNY Stony Brook, Stony Brook, NY 11794 USA
[3] New Jersey Inst Technol, Newark, NJ 07102 USA
[4] Brookhaven Natl Lab, Upton, NY 11973 USA
基金
美国能源部;
关键词
Multicore systems; input/output; high-speed data transfer; parallelism; asynchronous processing; pipelining; DESIGN;
D O I
10.1109/TPDS.2016.2619344
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
High-speed data transfer is vital to data-intensive computing that often requires moving large data volumes efficiently within a local data center and among geographically dispersed facilities. Effective utilization of the abundant resources in modern multicore environments for data transfer remains a persistent challenge, particularly, for Non-Uniform Memory Access (NUMA) systems wherein the locality of data accessing is an important factor. This requires rethinking how to exploit parallel access to data and to optimize the storage and network I/Os. We address this challenge and present a novel design of asynchronous processing and resource-aware task scheduling in the context of high-throughput data replication. Our software allocates multiple sets of threads to different stages of the processing pipeline, including storage I/O and network communication, based on their capacities. Threads belonging to each stage follow an asynchronous model, and attain high performance via multiple locality-aware and peer-aware mechanisms, such as task grouping, buffer sharing, affinity control and communication protocols. Our design also integrates high performance features to enhance the scalability of data transfer in several scenarios, e.g., file-level sorting, block-level asynchrony, and thread-level pipelining. Our experiments confirm the advantages of our software under different types of workloads and dynamic environments with contention for shared resources, including a 28-160 percent increase in bandwidth for transferring large files, 1.7-66 times speed-up for small files, and up to 108 percent larger throughput for mixed workloads compared with three state of the art alternatives, GridFTP, BBCP and Aspera.
引用
收藏
页码:1430 / 1444
页数:15
相关论文
共 50 条
  • [1] Resource-Aware Partitioned Scheduling for Heterogeneous Multicore Real-Time Systems
    Han, Jian-Jun
    Cai, Wen
    Zhu, Dakai
    [J]. 2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2018,
  • [2] MCEA: A Resource-Aware Multicore CGRA Architecture for the Edge
    Korol, Guilherme
    Jordan, Michael Guilherme
    Brandalero, Marcelo
    Huebner, Michael
    Rutzig, Mateus Beck
    Schneider Beck, Antonio Carlos
    [J]. 2020 30TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2020, : 33 - 39
  • [3] PARTIAL EXPANSION OF DATAFLOW GRAPHS FOR RESOURCE-AWARE SCHEDULING OF MULTICORE SIGNAL PROCESSING SYSTEMS
    Zaki, George
    Plishker, William
    Bhattacharyya, Shuvra S.
    Fruth, Frank
    [J]. CONFERENCE RECORD OF THE 2014 FORTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2014, : 385 - 392
  • [4] Resource-aware mining of data streams
    Gaber, MM
    Krishnaswamy, S
    Zaslavsky, A
    [J]. JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2005, 11 (08) : 1440 - 1453
  • [5] Resource-Aware Asynchronous Online Federated Learning for Nonlinear Regression
    Gauthier, Francois
    Gogineni, Vinay Chakravarthi
    Werner, Stefan
    Huang, Yih-Fang
    Kuh, Anthony
    [J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 2828 - 2833
  • [6] Resource-Aware Scheduling for Dependable Multicore Real-Time Systems: Utilization Bound and Partitioning Algorithm
    Han, Jian-Jun
    Wang, Zhenjiang
    Gong, Sunlu
    Miao, Tianpeng
    Yang, Laurence T.
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (12) : 2806 - 2819
  • [7] Resource-Aware Data Parallel Array Processing
    Clemens Grelck
    Cédric Blom
    [J]. International Journal of Parallel Programming, 2020, 48 : 652 - 674
  • [8] Ubiquitous Resource-Aware Clustering of Data Streams
    Chao, Ching-Ming
    Chao, Guan-Lin
    [J]. ENTERPRISE INFORMATION SYSTEMS, ICEIS 2011, 2012, 102 : 81 - 97
  • [9] Resource-Aware Data Parallel Array Processing
    Grelck, Clemens
    Blom, Cedric
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2020, 48 (04) : 652 - 674
  • [10] Mobile object systems: Resource-aware computation
    Bryce, C
    Czajkowski, C
    [J]. OBJECT-ORIENTED TECHNOLOGY, 2003, 3013 : 86 - 91