Storage Solution: A Virtual Distributed Storage And Migration Architecture For Big Data

被引:0
|
作者
Oluwarotimi, Randle [1 ]
Fezile, Matsebula [1 ]
Tranos, Zuva [2 ]
机构
[1] Sol Plaatje Univ, Kimberley, South Africa
[2] Vaal Univ Technol, Vanderbijlpark, South Africa
关键词
Virtualization; data Migration; big data; distributed computing; cloud computing;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The big data revolution has provided organizations with a large variety of data which assist in making decisions as well as providing data analysts with high volumes of data for prediction and pattern recognition. This data is stored in the cloud as it has proven to be a good storage environment due to its accessibility and security benefits.The cloud provides data for various applications such as gaming activities and data for prediction analysis in various sectors of the economy.The provision of these data to end-users such as data analysts is best provided through the use virtual desktops which require data regularly (real-time) and at a high speed, efficiency and performance levels. To achieve this users expect services to be hosted on virtual machines in interrelated data centres and that these virtual machines will migrate dynamically to locations best suited for the user as well as connect the new users.This leads to our current problem which is how can we provide high performance to manage large volumes of data from the cloud as well as how can data can be stored in such a manner that they can be easily retrieved and migrated between servers. We propose an Architecture in this paper by using Alluxio and our novel Dynamic Virtual Machine Server (DVMS) to speed up the process as well as ensure there is no delay. We further apply two plugins to Hadoop which are Sqoop and Network levitated Merge (NLM) which will assist to improve the transfer speed of data from the cloud to Hadoop to increase efficiency. The dynamic virtual machine manages the large and growing data load by categorising the data into 3 categories of pools called (1) Raw aggregated data pool, (2) Aggregated data to send and (3) Processed aggregated data pool which works in a loop to increase data migration speed as well as provide a medium to store data in preparation for new users.
引用
收藏
页码:260 / 264
页数:5
相关论文
共 50 条
  • [41] IoT Meets Blockchain: Parallel Distributed Architecture for Data Storage and Sharing
    Liu, Shaowei
    Wu, Jing
    Long, Chengnian
    [J]. IEEE 2018 INTERNATIONAL CONGRESS ON CYBERMATICS / 2018 IEEE CONFERENCES ON INTERNET OF THINGS, GREEN COMPUTING AND COMMUNICATIONS, CYBER, PHYSICAL AND SOCIAL COMPUTING, SMART DATA, BLOCKCHAIN, COMPUTER AND INFORMATION TECHNOLOGY, 2018, : 1355 - 1360
  • [42] A Flexible Data Migration Strategy for Power Savings in Distributed Storage Systems
    Hasebe, Koji
    Takai, Sho
    Kato, Kazuhiko
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON SMART CITIES AND GREEN ICT SYSTEMS (SMARTGREENS), 2018, : 352 - 357
  • [43] Distributed Data Storage with Data Versioning
    Hejtmanek, Lukas
    [J]. CESNET CONFERENCE 2006: FIRST CESNET CONFERENCE ON ADVANCED COMMUNICATIONS AND GRIDS, 2006, : 93 - 104
  • [44] Optimizing Virtual Machine Live Storage Migration in Heterogeneous Storage Environment
    Zhou, Ruijin
    Liu, Fang
    Li, Chao
    Li, Tao
    [J]. ACM SIGPLAN NOTICES, 2013, 48 (07) : 73 - 84
  • [45] An Efficient Approach for Storage of Big Data Streams in Distributed Stream Processing Systems
    Alshamrani, Sultan
    Waseem, Quadri
    Alharbi, Abdullah
    Alosaimi, Wael
    Turabieh, Hamza
    Alyami, Hashem
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 91 - 98
  • [46] Modeling of distributed file System in big data storage by event-B
    Ali, Ammar Alhaj
    Varacha, Pavel
    Krayem, Said
    Jasek, Roman
    Zacek, Petr
    Chramcov, Bronislav
    [J]. 22ND INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, COMMUNICATIONS AND COMPUTERS (CSCC 2018), 2018, 210
  • [47] DLSM: Decoupled Live Storage Migration with Distributed Device Mapper Storage
    Zhang, Zhaoning
    Li, Ziyang
    Wu, Kui
    Li, Huiba
    Peng, Yuxing
    Lu, Xicheng
    [J]. 2014 IEEE 8TH INTERNATIONAL SYMPOSIUM ON SERVICE ORIENTED SYSTEM ENGINEERING (SOSE), 2014, : 82 - 89
  • [48] Distributed Storage for Data Security
    Bracher, Annina
    Hof, Eran
    Lapidoth, Amos
    [J]. 2014 IEEE INFORMATION THEORY WORKSHOP (ITW), 2014, : 506 - 510
  • [49] CBase: Fast Virtual Machine storage data migration with a new data center structure
    Zhang, Fei
    Liu, Guangming
    Zhao, Bo
    Kasprzak, Piotr
    Fu, Xiaoming
    Yahyapour, Ramin
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 124 : 14 - 26
  • [50] Distributed deduplication with fingerprint index management model for big data storage in the cloud
    S. Sabeetha Saraswathi
    N. Malarvizhi
    [J]. Evolutionary Intelligence, 2021, 14 : 683 - 690