MC Framework: High-performance Distributed Framework for Standalone Data Analysis Packages over Hadoop-based Cloud

被引:0
|
作者
Chen, Chao-Chun [1 ]
Giang, Nguyen Huu Tinh [1 ]
Lin, Tzu-Chao [1 ]
Hung, Min-Hsiung [2 ]
机构
[1] Natl Cheng Kung Univ, Inst Mfg Info & Sys, Dept Comp Sci & Info Engr, Tainan 70101, Taiwan
[2] Chinese Culture Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
关键词
MapReduce; Hadoop; cloud adaptor; multi-users scheduling;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Hadoop MapReduce is the programming model of designing the scalable distributed computing applications, that provides developers can attain automatic parallelization. However, most complex manufacturing systems are arduous and restrictive to migrate to private clouds, due to the platform incompatible and tremendous complexity of system reconstruction. For increasing the efficiency of manufacturing systems with minimum efforts on modifying source codes, a high-performance framework is designed in this paper, called Multi-users-based Cloud-Adaptor Framework (MC-Framework), which provides the simple interface to users for fairly executing requested tasks worked with traditional standalone data analysis packages in MapReduce-based private cloud environments. Moreover, this framework focuses on multiuser workloads, but the default Hadoop scheduling scheme, i.e., FIFO, would increase delay under multiuser scenarios. Hence, a new scheduling mechanism, called Job-Sharing Scheduling, is designed to explore and fairly share the jobs to machines in the private cloud. Then, we prototype an experimental virtual-metrology module of a manufacturing system as a case study to verify and analysis the proposed MC-Framework. The results of our experiments indicate that our proposed framework enormously improved the time performance compared with the original package.
引用
下载
收藏
页码:27 / 32
页数:6
相关论文
共 50 条
  • [41] HPDL: Towards a General Framework for High-performance Distributed Deep Learning
    Li, Dongsheng
    Lai, Zhiquan
    Ge, Keshi
    Zhang, Yiming
    Zhang, Zhaoning
    Sun, Tao
    Wang, Qinglin
    Wang, Huaimin
    2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 1742 - 1753
  • [42] HDSVM: A High Efficiency Distributed SVM Framework over Data Stream
    Hou, Yan
    Wang, Yijie
    Ma, Xingkong
    Cheng, Li
    2017 15TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS AND 2017 16TH IEEE INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS (ISPA/IUCC 2017), 2017, : 352 - 359
  • [43] Software-Defined Networking for Scalable Cloud-based Services to Improve System Performance of Hadoop-based Big Data Applications
    Hagos, Desta Haileselassie
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2016, 8 (02) : 1 - 22
  • [44] A High-Performance Heterogeneous Critical Path Analysis Framework
    Zamani, Yasin
    Huang, Tsung-Wei
    2021 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2021,
  • [45] MaReIA: a cloud MapReduce based high performance whole slide image analysis framework
    Vo, Hoang
    Kong, Jun
    Teng, Dejun
    Liang, Yanhui
    Aji, Ablimit
    Teodoro, George
    Wang, Fusheng
    DISTRIBUTED AND PARALLEL DATABASES, 2019, 37 (02) : 251 - 272
  • [46] MaReIA: a cloud MapReduce based high performance whole slide image analysis framework
    Hoang Vo
    Jun Kong
    Dejun Teng
    Yanhui Liang
    Ablimit Aji
    George Teodoro
    Fusheng Wang
    Distributed and Parallel Databases, 2019, 37 : 251 - 272
  • [47] High-performance meteorological data processing framework for real-time analysis and visualization
    Mbogo, Gali-Ketema
    Rakitin, Stepan, V
    Visheratin, Alexander
    6TH INTERNATIONAL YOUNG SCIENTIST CONFERENCE ON COMPUTATIONAL SCIENCE, YSC 2017, 2017, 119 : 334 - 340
  • [48] A Coflow-based Co-optimization Framework for High-performance Data Analytics
    Cheng, Long
    Wang, Ying
    Pei, Yulong
    Epema, Dick
    2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2017, : 392 - 401
  • [49] Real-time digital forensic triaging for cloud data analysis using MapReduce on Hadoop framework
    Povar, Digambar
    Saibharath
    Geethakumari, G.
    INTERNATIONAL JOURNAL OF ELECTRONIC SECURITY AND DIGITAL FORENSICS, 2015, 7 (02) : 119 - 133
  • [50] A High-Performance Learning-Based Framework for Monocular 3-D Point Cloud Reconstruction
    Zamani, AmirHossein
    Ghaffari, Kamran
    Aghdam, Amir G.
    IEEE JOURNAL OF RADIO FREQUENCY IDENTIFICATION, 2024, 8 : 695 - 712