MC Framework: High-performance Distributed Framework for Standalone Data Analysis Packages over Hadoop-based Cloud

被引:0
|
作者
Chen, Chao-Chun [1 ]
Giang, Nguyen Huu Tinh [1 ]
Lin, Tzu-Chao [1 ]
Hung, Min-Hsiung [2 ]
机构
[1] Natl Cheng Kung Univ, Inst Mfg Info & Sys, Dept Comp Sci & Info Engr, Tainan 70101, Taiwan
[2] Chinese Culture Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
关键词
MapReduce; Hadoop; cloud adaptor; multi-users scheduling;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Hadoop MapReduce is the programming model of designing the scalable distributed computing applications, that provides developers can attain automatic parallelization. However, most complex manufacturing systems are arduous and restrictive to migrate to private clouds, due to the platform incompatible and tremendous complexity of system reconstruction. For increasing the efficiency of manufacturing systems with minimum efforts on modifying source codes, a high-performance framework is designed in this paper, called Multi-users-based Cloud-Adaptor Framework (MC-Framework), which provides the simple interface to users for fairly executing requested tasks worked with traditional standalone data analysis packages in MapReduce-based private cloud environments. Moreover, this framework focuses on multiuser workloads, but the default Hadoop scheduling scheme, i.e., FIFO, would increase delay under multiuser scenarios. Hence, a new scheduling mechanism, called Job-Sharing Scheduling, is designed to explore and fairly share the jobs to machines in the private cloud. Then, we prototype an experimental virtual-metrology module of a manufacturing system as a case study to verify and analysis the proposed MC-Framework. The results of our experiments indicate that our proposed framework enormously improved the time performance compared with the original package.
引用
下载
收藏
页码:27 / 32
页数:6
相关论文
共 50 条
  • [21] A security framework in G-Hadoop for big data computing across distributed Cloud data centres
    Zhao, Jiaqi
    Wang, Lizhe
    Tao, Jie
    Chen, Jinjun
    Sun, Weiye
    Ranjan, Rajiv
    Kolodziej, Joanna
    Streit, Achim
    Georgakopoulos, Dimitrios
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2014, 80 (05) : 994 - 1007
  • [22] SSFile: A novel column-store for efficient data analysis in Hadoop-based distributed systems
    Son, Jihoon
    Ryu, Hyoseok
    Yi, Sungmin
    Chung, Yon Dohn
    INFORMATION SCIENCES, 2015, 316 : 68 - 86
  • [23] An Efficient Hadoop-Based Framework for Data Storage and Fault Recovering in Large-Scale Multimedia Sensor Networks
    Saad, Ghina
    Harb, Hassan
    Abouaissa, Abdelhafid
    Idoumghar, Lhassane
    Charara, Nour
    2020 16TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC, 2020, : 316 - 321
  • [24] A COMPARATIVE ANALYSIS OF CONVENTIONAL HADOOP WITH PROPOSED CLOUD ENABLED HADOOP FRAMEWORK FOR SPATIAL BIG DATA PROCESSING
    Tripathi, A. K.
    Agrawal, S.
    Gupta, R. D.
    ISPRS TC V MID-TERM SYMPOSIUM GEOSPATIAL TECHNOLOGY - PIXEL TO PEOPLE, 2018, 4-5 : 425 - 430
  • [25] A Hadoop-Based Framework for Large-Scale Landmine Detection Using Ubiquitous Big Satellite Imaging Data
    El-Kazzaz, Sahar
    El-Mahdy, Ahmed
    23RD EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2015), 2015, : 274 - 278
  • [26] Robustness Analysis Framework for High-Performance Data Centers based on Immune Genetic Algorithm
    Mai, Xiaodong
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT-2020), 2020, : 459 - 462
  • [27] NetANNS: A High-Performance Distributed Search Framework Based On In-Network Computing
    Zhang, Penghao
    Pan, Heng
    Li, Zhenyu
    Xie, Gaogang
    Cui, Penglai
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 271 - 278
  • [28] Performance Evaluation of Unstructured NoSQL data over distributed framework
    Nyati, Suyog S.
    Pawar, Shivanand
    Ingle, Rajesh
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1623 - 1627
  • [29] Payload fragmentation framework for high-performance computing in cloud environment
    Vivek, V.
    Srinivasan, R.
    Blessing, R. Elijah
    Dhanasekaran, R.
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (05): : 2789 - 2804
  • [30] Payload fragmentation framework for high-performance computing in cloud environment
    V. Vivek
    R. Srinivasan
    R. Elijah Blessing
    R. Dhanasekaran
    The Journal of Supercomputing, 2019, 75 : 2789 - 2804