The Dawn of Big Data - Hbase

被引:0
|
作者
Bhupathiraju, Vijayalakshmi [1 ]
Ravuri, Ravi Prasad [1 ]
机构
[1] Padmasri Dr BV Raju Inst Tehnol, Dept MCA, Hyderabad, Andhra Pradesh, India
关键词
HBase; Hadoop Distributed File System (HDFS); HBase column oriented table;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
HBase is a distributed column-oriented database built on top of HDFS. HBase is the Hadoop application to use when you require real-time read/write random access to very large datasets. HBase is a scalable data store targeted at random read and write access of (fairly-) structured data. It's modeled after Google's Big table and targeted to support large tables, on the order of billions of rows and millions of columns. It uses HDFS as the underlying file system and is designed to be fully distributed and highly available. Version 0.20 introduces significant performance improvement. Base's Table Input Format is designed to allow a Map Reduce program to operate on data stored in an HBase table. Table Output Format is for writing Map Reduce outputs into an HBase table. HBase has different storage characteristics than HDFS, such as the ability to do row updates and column indexing, so we can expect to see these features used by Hive in future releases. It is already possible to access HBase tables from Hive. This paper includes the step by step introduction to the HBase, Identify differences between apache HBase and a traditional RDBMS, The Problem with Relational Database Systems, Relation between the Hadoop and HBase, How an Apache HBase table is physically stored on disk. Later part of this paper introduces Map Reduce, HBase table and how Apache HBase Cells stores data, what happens to data when it is deleted. Last part explains difference between Big Data and HBase, Conclusion followed with the References.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] HGrid: A Data Model for Large Geospatial Data Sets in HBase
    Han, Dan
    Stroulia, Eleni
    2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 910 - 917
  • [22] Geospatial data storage based on HBase and MapReduce
    Gao, Fan
    Yue, Peng
    Wu, Zhaoyan
    Zhang, Mingda
    2017 6TH INTERNATIONAL CONFERENCE ON AGRO-GEOINFORMATICS, 2017, : 55 - 58
  • [23] DISTRIBUTED STORAGE OF NETWORK MEASUREMENT DATA ON HBASE
    Ding, Haijie
    Jin, Yuehui
    Cui, Yidong
    Yang, Tan
    2012 IEEE 2ND INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENT SYSTEMS (CCIS) VOLS 1-3, 2012, : 716 - 720
  • [24] An agricultural data storage mechanism based on HBase
    Li C.
    Zhang Q.
    He P.
    Wang Z.
    Chen L.
    International Journal of Information and Communication Technology, 2019, 14 (04) : 456 - 469
  • [25] Temporal RDF(S) data storage and query with HBase
    Yan L.
    Zhang Z.
    Yang D.
    Journal of Computing and Information Technology, 2019, 27 (04): : 17 - 30
  • [26] The Dynamic Index Design of Network Management Data on HBase
    Yang, J. D.
    Ren, C. S.
    Mao, W.
    Zhang, D. Y.
    INTERNATIONAL CONFERENCE ON ADVANCED MANAGEMENT SCIENCE AND INFORMATION ENGINEERING (AMSIE 2015), 2015, : 169 - 175
  • [27] MSDB: A Massive Sensor Data Processing Middleware for HBase
    Liu, Bowei
    Huang, Ruizhang
    Huang, Ting
    Yan, Yingying
    2017 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC), 2017, : 450 - 456
  • [28] Hadoop-HBase for Large-Scale Data
    Vora, Mehul Nalin
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 601 - 605
  • [29] HBaseSpatial: a Scalable Spatial Data Storage Based on HBase
    Zhang, Ningyu
    Zheng, Guozhou
    Chen, Huajun
    Chen, Jiaoyan
    Chen, Xi
    2014 IEEE 13TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM), 2014, : 644 - 651
  • [30] A novel HBase data storage in wireless sensor networks
    Xiang Li
    Zhuo Li
    Xirong Ma
    Can Liu
    EURASIP Journal on Wireless Communications and Networking, 2017