Data De duplication Using N0SQL Databases in Cloud

被引:0
|
作者
Backialakshmi, N. [1 ]
Manikandan, M. [2 ]
机构
[1] Adhiyamaan Coll Engn, Comp Sci & Engn, Hosur, Tamil Nadu, India
[2] Adhiyamaan Coll Engn, Dept CSE, Hosur, Tamil Nadu, India
关键词
Data De duplication; NoSQL Databases; Map reduce;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data de-duplication has an important role in reducing storage consumption to make it affordable to manage in today's explosive data growth. Numerous DD methodologies like chunking and, delta encoding are available to optimize the use of storage. These technologies approach DD at file and/or sub-file level but this approach has never been optimal for NoSQL DBs. This method proposes data De-Duplication in NoSQL Databases (DDNSDB) which makes use of a DD approach at a higher level of abstraction. The main goals of this research is, to maximally reduce the amount of duplicates in one type of NoSQL DBs, namely the key-value store, to maximally increase the process performance such that the backup window is marginally affected, and to design with horizontal scaling in mind such that it would run on a Cloud Platform competitively. Also, by following an optimized adapted Map Reduce architecture, DDNSDB proves to have competitive performance advantage in a horizontal scaling cloud environment compared with a vertical scaling environment.
引用
收藏
页数:4
相关论文
共 50 条
  • [11] Internet of Things Data Storage Infrastructure in the Cloud Using NoSQL Databases
    Vanelli, B.
    da Silva, M. P.
    Manerichi, G.
    Pinto, A. R.
    Dantas, M. A. R.
    Ferrandin, M.
    Boava, A.
    IEEE LATIN AMERICA TRANSACTIONS, 2017, 15 (04) : 737 - 743
  • [12] QUERYING UNCERTAIN DATA IN GEOSPATIAL OBJECT-RELATIONAL DATABASES USING SQL AND FUZZY SETS
    Duraciova, R.
    SLOVAK JOURNAL OF CIVIL ENGINEERING, 2013, 21 (04) : 1 - 12
  • [13] Multi-bit data de-duplication-based cloud storage channel covert
    Han, Li Chu
    MEASUREMENT, 2019, 144 : 52 - 57
  • [14] Improving Accessing Efficiency of Cloud Storage Using De-Duplication and Feedback Schemes
    Wu, Tin-Yu
    Pan, Jeng-Shyang
    Lin, Chia-Fan
    IEEE SYSTEMS JOURNAL, 2014, 8 (01): : 208 - 218
  • [15] Efficient Retrieval of Data from Cloud Databases using Hash Partitioned Buckets
    Nair, Abhishek M.
    Dewangan, Aman
    Mary, Geetha A.
    2019 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2019,
  • [16] Decentralized and Privacy Sensitive Data De-Duplication Framework for Convenient Big Data Management in Cloud Backup Systems
    Jeslin, J. Gnana
    Kumar, P. Mohan
    SYMMETRY-BASEL, 2022, 14 (07):
  • [17] Finite State Automata Based Cryptosystem for Secure Data Sharing and De-duplication in Cloud Computing
    Basappa B. Kodada
    Demian Antony D’Mello
    D. K. Santhosh Kumar
    SN Computer Science, 5 (6)
  • [18] Large-Scale Data Management System Using Data De-duplication System
    Abirami, S.
    Vikraman, Rashmi
    Murugappan, S.
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGIES, IC3T 2015, VOL 1, 2016, 379 : 225 - 234
  • [19] Albatross: Lightweight Elasticity in Shared Storage Databases for the Cloud using Live Data Migration
    Das, Sudipto
    Nishimura, Shoji
    Agrawal, Divyakant
    El Abbadi, Amr
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (08): : 494 - 505
  • [20] Parity Data De-Duplication in All Flash Array-Based OpenStack Cloud Block Storage
    Heo, Huiseong
    Ahn, Cheongjin
    Kim, Deok-Hwan
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (05): : 1384 - 1387