A data placement strategy for big data based on DCC in cloud computing systems

被引:2
|
作者
Wang, Tao [1 ,2 ]
Yao, Shihong [2 ]
Xu, Zhengquan [1 ,2 ]
Jia, Shan [2 ]
Xu, Qiang [2 ]
机构
[1] Collaborat Innovat Ctr Geospatial Technol, Wuhan, Peoples R China
[2] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan, Peoples R China
来源
2015 IEEE INTERNATIONAL CONFERENCE ON SMART CITY/SOCIALCOM/SUSTAINCOM (SMARTCITY) | 2015年
基金
美国国家科学基金会;
关键词
Big Data; data placement; data scheduling; dynamic computation correlation;
D O I
10.1109/SmartCity.2015.139
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In complex and data-intensive applications, data scheduling between data centers must occur when multiple datasets stored in distributed data centers are processed by one computation. To store massive datasets effectively and reduce data scheduling between data centers during the execution of computations, a mathematical model of data scheduling between data centers in cloud computing is built and dynamic computation correlation (DCC) between datasets is defined. Then a data placement strategy for big data based on DCC is proposed. Datasets with high DCC are placed into the same data center, and new datasets are dynamically distributed into the most appropriate data center. Comprehensive experiments show that the proposed strategy can effectively reduce the number of data scheduling between data centers and has a considerably low and almost constant computational complexity when the number of data centers increases and the datasets are massive. It can be expected that the proposed strategy will be applicable to the practical large-scale distributed storage systems for big data management.
引用
收藏
页码:623 / 630
页数:8
相关论文
共 50 条
  • [41] Cloud Computing Oriented Retrieval Technology based on Big Data
    Wang Xiao-shu
    Xie Yao
    Luo Huan
    2015 SEVENTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2015), 2015, : 275 - 278
  • [42] AN ALGORITHM OF APRIORI BASED ON MEDICAL BIG DATA AND CLOUD COMPUTING
    Cui, Xiaoyan
    Yang, Shimeng
    Wang, Daming
    PROCEEDINGS OF 2016 4TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (IEEE CCIS 2016), 2016, : 361 - 365
  • [43] Study and Application of Big Data Mining Based on Cloud Computing
    Luo, Jinwei
    Li, Chunfei
    Huang, Fuping
    PROCEEDINGS OF THE 2016 6TH INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS, ENVIRONMENT, BIOTECHNOLOGY AND COMPUTER (MMEBC), 2016, 88 : 221 - 224
  • [44] Soft computing techniques for big data and cloud computing
    Gupta, B. B.
    Agrawal, Dharma P.
    Yamaguchi, Shingo
    Sheng, Michael
    SOFT COMPUTING, 2020, 24 (08) : 5483 - 5484
  • [45] Soft computing techniques for big data and cloud computing
    B. B. Gupta
    Dharma P. Agrawal
    Shingo Yamaguchi
    Michael Sheng
    Soft Computing, 2020, 24 : 5483 - 5484
  • [46] An Adaptive Data Placement Strategy in scientific workflows over Cloud Computing Environments
    Kim, Heewon
    Kim, Yoonhee
    NOMS 2018 - 2018 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, 2018,
  • [47] Cognitive systems and operations research in big data and cloud computing PREFACE
    Ogiela, Marek R.
    Ko, Hoon
    ANNALS OF OPERATIONS RESEARCH, 2018, 265 (02) : 183 - 186
  • [48] Research on the change strategy of hospital informatization in the age of the cloud computing and the big data
    Shi, Sen
    Fan, Jie
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 171 - 171
  • [49] Big Data and Cloud Computing: Pitfalls and Advantages in Data Management
    Shah, Neepa K.
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 643 - 648
  • [50] Research on Ship Data Big Data Parallel Scheduling Algorithm Based on Cloud Computing
    Li, Xin
    Guo, Jingjing
    JOURNAL OF COASTAL RESEARCH, 2019, : 535 - 539