The Dawn of Big Data - Hbase

被引:0
|
作者
Bhupathiraju, Vijayalakshmi [1 ]
Ravuri, Ravi Prasad [1 ]
机构
[1] Padmasri Dr BV Raju Inst Tehnol, Dept MCA, Hyderabad, Andhra Pradesh, India
关键词
HBase; Hadoop Distributed File System (HDFS); HBase column oriented table;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
HBase is a distributed column-oriented database built on top of HDFS. HBase is the Hadoop application to use when you require real-time read/write random access to very large datasets. HBase is a scalable data store targeted at random read and write access of (fairly-) structured data. It's modeled after Google's Big table and targeted to support large tables, on the order of billions of rows and millions of columns. It uses HDFS as the underlying file system and is designed to be fully distributed and highly available. Version 0.20 introduces significant performance improvement. Base's Table Input Format is designed to allow a Map Reduce program to operate on data stored in an HBase table. Table Output Format is for writing Map Reduce outputs into an HBase table. HBase has different storage characteristics than HDFS, such as the ability to do row updates and column indexing, so we can expect to see these features used by Hive in future releases. It is already possible to access HBase tables from Hive. This paper includes the step by step introduction to the HBase, Identify differences between apache HBase and a traditional RDBMS, The Problem with Relational Database Systems, Relation between the Hadoop and HBase, How an Apache HBase table is physically stored on disk. Later part of this paper introduces Map Reduce, HBase table and how Apache HBase Cells stores data, what happens to data when it is deleted. Last part explains difference between Big Data and HBase, Conclusion followed with the References.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Dawn of big data
    Champagne, Dave
    CHEMISTRY & INDUSTRY, 2013, 77 (05) : 4 - 5
  • [2] YEfficient Spatial Big Data Storage and Query in HBase
    Wang, Ping
    Xu, Fanhua
    Ma, Meng
    Duan, Lihua
    4TH IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD 2019) / 3RD INTERNATIONAL SYMPOSIUM ON REINFORCEMENT LEARNING (ISRL 2019), 2019, : 149 - 155
  • [3] A Distributed Storage Model for Healthcare Big Data Designed on HBase
    Zhang, Lu
    Li, Qi
    Li, Ye
    Cai, Yunpeng
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 4101 - 4105
  • [4] Application and Research of Massive Big Data Storage System Based on HBase
    Pan Zhengjun
    Zhao Lianfen
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2018, : 219 - 223
  • [5] Using Distributed Data over HBase in Big Data Analytics Platform for Clinical Services
    Chrimes, Dillon
    Zamani, Hamid
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2017, 2017
  • [7] On Performance Modeling and Prediction for Spark-HBase Applications in Big Data Systems
    AlQuwaiee, Haifa
    Wu, Chase
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 3685 - 3690
  • [8] Financial Big Data Hot and Cold Separation Scheme Based on HBase and Redis
    Li, Kunhui
    Guo, Kun
    Guo, Hong
    2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019), 2019, : 1612 - 1617
  • [9] Feasibility Analysis of Big Log Data Real Time Search Based on Hbase and ElasticSearch
    Bai, Jun
    2013 NINTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2013, : 1166 - 1170
  • [10] HBASE Performance Analysis in Big Datasets Processing
    Mladenova, Tsvetelina
    Kalmkov, Yordan
    Marinov, Milko
    Valova, Irena
    TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2021, 10 (03): : 1051 - 1057