Data De duplication Using N0SQL Databases in Cloud

被引:0
|
作者
Backialakshmi, N. [1 ]
Manikandan, M. [2 ]
机构
[1] Adhiyamaan Coll Engn, Comp Sci & Engn, Hosur, Tamil Nadu, India
[2] Adhiyamaan Coll Engn, Dept CSE, Hosur, Tamil Nadu, India
关键词
Data De duplication; NoSQL Databases; Map reduce;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data de-duplication has an important role in reducing storage consumption to make it affordable to manage in today's explosive data growth. Numerous DD methodologies like chunking and, delta encoding are available to optimize the use of storage. These technologies approach DD at file and/or sub-file level but this approach has never been optimal for NoSQL DBs. This method proposes data De-Duplication in NoSQL Databases (DDNSDB) which makes use of a DD approach at a higher level of abstraction. The main goals of this research is, to maximally reduce the amount of duplicates in one type of NoSQL DBs, namely the key-value store, to maximally increase the process performance such that the backup window is marginally affected, and to design with horizontal scaling in mind such that it would run on a Cloud Platform competitively. Also, by following an optimized adapted Map Reduce architecture, DDNSDB proves to have competitive performance advantage in a horizontal scaling cloud environment compared with a vertical scaling environment.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] THERMOCHEMISTRY OF THE AZIDE ANION - ASSIGNMENT OF DELTA(F)H(0)(N3-,G) USING VISCOSITY B-COEFFICIENT DATA
    JENKINS, HDB
    JOURNAL OF PHYSICAL CHEMISTRY, 1993, 97 (30): : 7876 - 7879
  • [42] On the Issue of Incomplete and Missing Water-Quality Data in Mine Site Databases: Comparing Three Imputation Methods; [ZUR PROBLEMATIK VON UNVOLLSTÄNDIGEN UND FEHLENDEN BESCHAFFENHEITSDATEN IN DATENBANKEN VON BERGBAUSTANDORTEN: VERGLEICH VON DREI BERECHNUNGSMETHODEN]; [Sobre el problema de datos de calidad de agua incompletos y perdidos en bases de datos de sitios de minas: Comparación de tres métodos de asignación]
    Betrie G.D.
    Sadiq R.
    Tesfamariam S.
    Morin K.A.
    Mine Water and the Environment, 2016, 35 (1) : 3 - 9
  • [43] The effectiveness of chemotherapy for patients with pT3N0M0 renal pelvic urothelial carcinomas: An inverse probability of treatment weighting comparison using Surveillance, Epidemiology, and End Results data
    Liu, Zefu
    Huang, Jialing
    Li, Xiangdong
    Huang, Chaowen
    Ye, Yunlin
    Zhang, Jinxin
    Liu, Zhouwei
    CANCER MEDICINE, 2020, 9 (16): : 5756 - 5766
  • [44] 320,0 0 0 years of interaction between a fast-spreading ridge and nearby seamounts monitored using major, trace and isotope composition data from oceanic basalts: Zoom at 15.6 °N on the East Pacific Rise
    Mougel, Berengere
    Agranier, Arnaud
    Gente, Pascal
    Hemond, Christophe
    DATA IN BRIEF, 2022, 44
  • [45] An estimation of the height system bias parameter N0 using least squares collocation from observed gravity and GPS-levelling data
    Muhammad Sadiq
    Carl C. Tscherning
    Zulfiqar Ahmad
    Studia Geophysica et Geodaetica, 2009, 53 : 375 - 388
  • [46] A Framework for Using Point Cloud Data of Heritage Buildings Toward Geometry Modeling in A BIM Context: A Case Study on Santa Maria La Real De Mave Church
    Jose Lopez, Facundo
    Lerones, Pedro M.
    Llamas, Jose
    Gomez-Garcia-Bermejo, Jaime
    Zalama, Eduardo
    INTERNATIONAL JOURNAL OF ARCHITECTURAL HERITAGE, 2017, 11 (07) : 965 - 986
  • [47] Selection and Validation of Induction Chemotherapy Beneficiaries Among Patients With T3N0, T3N1, T4N0 Nasopharyngeal Carcinoma Using Epstein-Barr Virus DNA: A Joint Analysis of Real-World and Clinical Trial Data
    Xu, Cheng
    Zhang, Shu
    Li, Wen-Fei
    Chen, Lei
    Mao, Yan-Ping
    Guo, Ying
    Liu, Qing
    Ma, Jun
    Tang, Ling-Long
    FRONTIERS IN ONCOLOGY, 2019, 9
  • [48] Evaluation of the Soil Conservation Service curve number methodology using data from agricultural plots; [Evaluation de la méthode du numéro de courbe du Service de la Conservation des Sols à partir de données provenant de parcelles agricoles]; [Avaliação da metodologia do número da curva do Serviço de Conservação do Solo utilizando dados de parcelas agrícolas]; [Evaluación de la metodología de número de curva del Servicio de Conservación de Suelos con datos de parcelas agrícolas]
    Lal M.
    Mishra S.K.
    Pandey A.
    Pandey R.P.
    Meena P.K.
    Chaudhary A.
    Jha R.K.
    Shreevastava A.K.
    Kumar Y.
    Hydrogeology Journal, 2017, 25 (1) : 151 - 167
  • [49] Comparison of methods to Estimate Basic Reproduction Number (R0) of influenza, Using Canada 2009 and 2017-18 A (H1N1) Data
    Nikbakht, Roya
    Baneshi, Mohammad Reza
    Bahrampour, Abbas
    Hosseinnataj, Abolfazl
    JOURNAL OF RESEARCH IN MEDICAL SCIENCES, 2019, 24
  • [50] Proposal and Assessment of a De-Identification Strategy to Enhance Anonymity of the Observational Medical Outcomes Partnership Common Data Model (OMOP-CDM) in a Public Cloud-Computing Environment: Anonymization of Medical Data Using Privacy Models
    Jeon, Seungho
    Seo, Jeongeun
    Kim, Sukyoung
    Lee, Jeongmoon
    Kim, Jong-Ho
    Sohn, Jang Wook
    Moon, Jongsub
    Joo, Hyung Joon
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (11)