Storage Solution: A Virtual Distributed Storage And Migration Architecture For Big Data

被引:0
|
作者
Oluwarotimi, Randle [1 ]
Fezile, Matsebula [1 ]
Tranos, Zuva [2 ]
机构
[1] Sol Plaatje Univ, Kimberley, South Africa
[2] Vaal Univ Technol, Vanderbijlpark, South Africa
关键词
Virtualization; data Migration; big data; distributed computing; cloud computing;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The big data revolution has provided organizations with a large variety of data which assist in making decisions as well as providing data analysts with high volumes of data for prediction and pattern recognition. This data is stored in the cloud as it has proven to be a good storage environment due to its accessibility and security benefits.The cloud provides data for various applications such as gaming activities and data for prediction analysis in various sectors of the economy.The provision of these data to end-users such as data analysts is best provided through the use virtual desktops which require data regularly (real-time) and at a high speed, efficiency and performance levels. To achieve this users expect services to be hosted on virtual machines in interrelated data centres and that these virtual machines will migrate dynamically to locations best suited for the user as well as connect the new users.This leads to our current problem which is how can we provide high performance to manage large volumes of data from the cloud as well as how can data can be stored in such a manner that they can be easily retrieved and migrated between servers. We propose an Architecture in this paper by using Alluxio and our novel Dynamic Virtual Machine Server (DVMS) to speed up the process as well as ensure there is no delay. We further apply two plugins to Hadoop which are Sqoop and Network levitated Merge (NLM) which will assist to improve the transfer speed of data from the cloud to Hadoop to increase efficiency. The dynamic virtual machine manages the large and growing data load by categorising the data into 3 categories of pools called (1) Raw aggregated data pool, (2) Aggregated data to send and (3) Processed aggregated data pool which works in a loop to increase data migration speed as well as provide a medium to store data in preparation for new users.
引用
收藏
页码:260 / 264
页数:5
相关论文
共 50 条
  • [1] Big Data Storage Solution: Collinear Holographic Data Storage System
    Tan, Xiaodi
    Horimai, Hideyoshi
    Arai, Ryo
    Ikeda, Junichi
    Inoue, Mitsuteru
    Lin, Xiao
    Xu, Ke
    Liu, Jinpeng
    Huang, Yong
    [J]. 2016 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE (ACP), 2016,
  • [2] Scalable Distributed Storage for Big Scientific Data
    Kokoulin, Andrey N.
    Yuzhakov, Aleksandr A.
    Kiryanov, Dmitriy A.
    [J]. PROCEEDINGS OF THE 2018 IEEE CONFERENCE OF RUSSIAN YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING (EICONRUS), 2018, : 1099 - 1103
  • [3] Architecture of Distributed Data Storage for Astroparticle Physics
    Kryukov A.P.
    Demichev A.P.
    [J]. Lobachevskii Journal of Mathematics, 2018, 39 (9) : 1199 - 1206
  • [4] A Big Data Storage Scheme Based on Distributed Storage Locations and Multiple Authorizations
    Al-Odat, Zeyad A.
    Al-Qtiemat, Eman M.
    Khan, Samee U.
    [J]. 2019 IEEE 5TH INTL CONFERENCE ON BIG DATA SECURITY ON CLOUD (BIGDATASECURITY) / IEEE INTL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING (HPSC) / IEEE INTL CONFERENCE ON INTELLIGENT DATA AND SECURITY (IDS), 2019, : 13 - 18
  • [5] BlueDBM: Distributed Flash Storage for Big Data Analytics
    Jun, Sang-Woo
    Liu, Ming
    Lee, Sungjin
    Hicks, Jamey
    Ankcorn, John
    King, Myron
    Xu, Shuotao
    Arvind
    [J]. ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2016, 34 (03):
  • [6] Big Data Storage Architecture Design in Cloud Computing
    Chen, Xuebin
    Wang, Shi
    Dong, Yanyan
    Wang, Xu
    [J]. BIG DATA TECHNOLOGY AND APPLICATIONS, 2016, 590 : 7 - 14
  • [7] Boafft: Distributed Deduplication for Big Data Storage in the Cloud
    Luo, Shengmei
    Zhang, Guangyan
    Wu, Chengwen
    Khan, Samee U.
    Li, Keqin
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2020, 8 (04) : 1199 - 1211
  • [8] Big Data Distributed Storage and Processing Case Studies
    Islam, Tariqul
    Abid, Mehedi Hasan
    [J]. THIRD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND CAPSULE NETWORKS (ICIPCN 2022), 2022, 514 : 826 - 837
  • [9] A Virtual Cloud Storage Architecture for Enhanced Data Security
    Kumar, M. Antony Joans
    Columbus, C. Christopher
    Ben George, E.
    Raj, T. Ajith Bosco
    [J]. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 44 (02): : 1735 - 1747
  • [10] Challenges and Benefits of Deploying Big Data Storage Solution
    Kachaoui, Jabrane
    Belangour, Abdessamad
    [J]. PROCEEDINGS OF THE SECOND CONFERENCE OF THE MOROCCAN CLASSIFICATION SOCIETY: NEW CHALLENGES IN DATA SCIENCES (SMC '2019), 2019, : 150 - 154