SPBD:Streamlining Big-Data Processing in Cloud Environments

被引:1
|
作者
Tung Nguyen [1 ]
Jingwen Zhang [1 ]
Weisong Shi [1 ]
机构
[1] Department of Computer Science,Wayne State University
关键词
bigdata; genomics; NGS; MapReduce; cloud;
D O I
暂无
中图分类号
TP311.13 [];
学科分类号
1201 ;
摘要
Many applications,such as those in genomics,are designed for one machine.This is not problematic if the input data set is small and can fit into the memory of a single powerful machine.However,the application and its algorithms are limited by the capacity and performance of the machine(the application cannot run in parallel).A single machine cannot handle very large data sets.In recent research,cloud computing and MapReduce have been used together to store and process big data.There are three main steps in handling data in the cloud:1) the user uploads the data,2) the data is processed,and 3) results are returned.When the size of the data reaches a certain scale,transmission time becomes the dominant factor;however,most research to date has only been focused on reducing the processing time.Also,it is generally assumed that the data is already stored in the cloud.This assumption does not hold because many organizations now store their data locally.In this paper,we propose SPBD(pronounced"speed") to minimize overall user wait time.We abstract overall processing time as an optimization problem and derive the optimal solution.When evaluated on our private cloud platform,SPBD is shown to reduce user wait time by up to 34% for a traditional WordCount application and up to 31% for a metagenomic application.
引用
收藏
页码:30 / 37
页数:8
相关论文
共 50 条
  • [1] Big Data Processing in Cloud Environments
    Tsuchiya, Satoshi
    Sakamoto, Yoshinori
    Tsuchimoto, Yuichi
    Lee, Vivian
    [J]. FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 2012, 48 (02): : 159 - 168
  • [2] Reliable and Scalable Big-Data Applications in Edge Cloud Environments
    Ko, In -Young
    Srivastava, Abhishek
    Mrissa, Michael
    [J]. JOURNAL OF WEB ENGINEERING, 2023, 22 (02): : V - VII
  • [3] Big Data Processing in Cloud Computing Environments
    Ji, Changqing
    Li, Yu
    Qiu, Wenming
    Awada, Uchechukwu
    Li, Keqiu
    [J]. PROCEEDINGS OF THE 2012 12TH INTERNATIONAL SYMPOSIUM ON PERVASIVE SYSTEMS, ALGORITHMS, AND NETWORKS (I-SPAN 2012), 2012, : 17 - 23
  • [4] Big Data Processing in Cloud Computing Environments
    Noraziah, A.
    Fakherldin, Mohammed Adam Ibrahim
    Adam, Khalid
    Majid, Mazlina Abdul
    [J]. ADVANCED SCIENCE LETTERS, 2017, 23 (11) : 11092 - 11095
  • [5] GEOSPATIAL BIG DATA PROCESSING IN HYBRID CLOUD ENVIRONMENTS
    Simonis, Ingo
    [J]. IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 419 - 421
  • [6] Big-data in cloud computing: a taxonomy of risks
    Miller, Holmes E.
    [J]. INFORMATION RESEARCH-AN INTERNATIONAL ELECTRONIC JOURNAL, 2013, 18 (01):
  • [8] A big-data processing framework for uncertainties in transportation data
    Yang, Jie
    Ma, Jun
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2015), 2015,
  • [9] Data Modifications in Blockchain Architecture for Big-Data Processing
    Tulkinbekov, Khikmatullo
    Kim, Deok-Hwan
    [J]. SENSORS, 2023, 23 (21)
  • [10] Analysis and Optimization of Big-Data Stream Processing
    Vakilinia, Shahin
    Zhang, Xinyao
    Qiu, Dongyu
    [J]. 2016 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2016,