The Dawn of Big Data - Hbase

被引:0
|
作者
Bhupathiraju, Vijayalakshmi [1 ]
Ravuri, Ravi Prasad [1 ]
机构
[1] Padmasri Dr BV Raju Inst Tehnol, Dept MCA, Hyderabad, Andhra Pradesh, India
关键词
HBase; Hadoop Distributed File System (HDFS); HBase column oriented table;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
HBase is a distributed column-oriented database built on top of HDFS. HBase is the Hadoop application to use when you require real-time read/write random access to very large datasets. HBase is a scalable data store targeted at random read and write access of (fairly-) structured data. It's modeled after Google's Big table and targeted to support large tables, on the order of billions of rows and millions of columns. It uses HDFS as the underlying file system and is designed to be fully distributed and highly available. Version 0.20 introduces significant performance improvement. Base's Table Input Format is designed to allow a Map Reduce program to operate on data stored in an HBase table. Table Output Format is for writing Map Reduce outputs into an HBase table. HBase has different storage characteristics than HDFS, such as the ability to do row updates and column indexing, so we can expect to see these features used by Hive in future releases. It is already possible to access HBase tables from Hive. This paper includes the step by step introduction to the HBase, Identify differences between apache HBase and a traditional RDBMS, The Problem with Relational Database Systems, Relation between the Hadoop and HBase, How an Apache HBase table is physically stored on disk. Later part of this paper introduces Map Reduce, HBase table and how Apache HBase Cells stores data, what happens to data when it is deleted. Last part explains difference between Big Data and HBase, Conclusion followed with the References.
引用
收藏
页数:4
相关论文
共 50 条
  • [31] HBase storage schemas for massive spatial vector data
    Yong Wang
    Chengjun Li
    Meng Li
    Zhenling Liu
    Cluster Computing, 2017, 20 : 3657 - 3666
  • [32] Big data (Big data)
    Miguel Castagnino, Juan
    ACTA BIOQUIMICA CLINICA LATINOAMERICANA, 2018, 52 (03): : 279 - 280
  • [33] Big data is or big data are
    Samaranayake, L.
    BRITISH DENTAL JOURNAL, 2018, 224 (12) : 916 - 916
  • [34] Big data is or big data are
    L. Samaranayake
    British Dental Journal, 2018, 224 : 916 - 916
  • [35] Massive AIS Data Management Based on HBase and Spark
    Qin, Jiwei
    Ma, Liangli
    Niu, Jinghua
    2018 3RD ASIA-PACIFIC CONFERENCE ON INTELLIGENT ROBOT SYSTEMS (ACIRS 2018), 2018, : 112 - 117
  • [36] Benchmarking Encrypted Data Storage in HBase and Cassandra with YCSB
    Waage, Tim
    Wiese, Lena
    FOUNDATIONS AND PRACTICE OF SECURITY (FPS 2014), 2015, 8930 : 311 - 325
  • [37] HBase storage schemas for massive spatial vector data
    Wang, Yong
    Li, Chengjun
    Li, Meng
    Liu, Zhenling
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (04): : 3657 - 3666
  • [38] ATLAS Data Management Accounting with Hadoop Pig and HBase
    Lassnig, Mario
    Garonne, Vincent
    Dimitrov, Gancho
    Canali, Luca
    INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS 2012 (CHEP2012), PTS 1-6, 2012, 396
  • [39] BESIII Physics Data Storing and Processing on HBase and MapReduce
    Lei, Xiaofeng
    Li, Qiang
    Kan, Bowen
    Sun, Gongxing
    Sun, Zhenyu
    21ST INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP2015), PARTS 1-9, 2015, 664
  • [40] Massive Image Data Management using HBase and MapReduce
    Liu, Yuehu
    Chen, Bin
    He, Wenxi
    Fang, Yu
    2013 21ST INTERNATIONAL CONFERENCE ON GEOINFORMATICS (GEOINFORMATICS), 2013,