Efficient Random Data Accessing in MapReduce

被引:0
|
作者
Mittal, Mamta [1 ]
Singh, Hari [2 ]
Paliwal, K. K. [2 ]
Goyal, Lalit Mohan [3 ]
机构
[1] GB Pant Govt Engn Coll, New Delhi, India
[2] Panipat Inst Engn & Technol, Panipat, Haryana, India
[3] Bharati Vidyapeeths Coll Engn, New Delhi, India
关键词
Hadoop; MapReduce; HDFS; B-Tree; Index; FRAMEWORK;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The voluminous data can not be handled using traditional serial programming methods. It needs to be dealt effectively using parallel programming methods in a distributed environment. Emerging technologies for parallel processing has been changing the concept of programming, storage and operating system in distributed environment. Grid Computing and MapReduce technologies have been proven very handy in processing huge volume of simple and multi-dimensional data. The MapReduce in the Hadoop provides an abstract environment for parallel processing of jobs. The framework is well-known for its data analysis capability. However, it is efficient for sequential read and writes. It does not show good performance for random read and writes. It is so because the Hadoop is based on the key-value storage concept and does not work on any indexed dataset. A lot of work has been done to improve the performance of the Hadoop. Indexing input dataset in the Hadoop is one such area. In this paper, a B-Tree index construction process in traditional programming environment is described and a conceptual idea of realizing it in the MapReduce-Hadoop is presented.
引用
收藏
页码:552 / 556
页数:5
相关论文
共 50 条
  • [31] Efran (Ω): "Efficient Scalar Homomorphic Scheme on MapReduce for Data Privacy Preserving"
    Martin, Konan
    Wang, Wenyong
    Agyemang, Brighter
    2016 IEEE 3RD INTERNATIONAL CONFERENCE ON CYBER SECURITY AND CLOUD COMPUTING (CSCLOUD), 2016, : 66 - 74
  • [32] MapReduce++ - Efficient processing of MapReduce jobs in the cloud
    Zhang, Guigang
    Li, Chao
    Zhang, Yong
    Xing, Chunxiao
    Yang, Jijiang
    Journal of Computational Information Systems, 2012, 8 (14): : 5757 - 5764
  • [33] An efficient mapping schema for storing and accessing XML data in relational databases
    Wu, Jun
    Huang, Shang-Yi
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2009, 5 (03) : 327 - +
  • [34] Spreading Fuzzy Random Forests with MapReduce
    Bechini, Alessio
    De Matteis, Adriano Donato
    Marcelloni, Francesco
    Segatori, Armando
    2016 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2016, : 2641 - 2646
  • [35] Dynamic energy efficient data placement and cluster reconfiguration algorithm for MapReduce framework
    Maheshwari, Nitesh
    Nanduri, Radheshyam
    Varma, Vasudeva
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2012, 28 (01): : 119 - 127
  • [36] A New Data Layout Scheme for Energy-Efficient MapReduce Processing Tasks
    Tran, Xuan T.
    Tien Van Do
    Rotter, Csaba
    Hwang, Dosam
    JOURNAL OF GRID COMPUTING, 2018, 16 (02) : 285 - 298
  • [37] LiPS: A Cost-Efficient Data and Task Co-Scheduler for MapReduce
    Ehsan, Moussa
    Chen, Yao
    Kang, Hui
    Sion, Radu
    Wong, Jennifer
    2013 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2013, : 49 - 58
  • [38] Efficient Mining of Frequent itemsets in Social Network Data based on MapReduce Framework
    Farzanyar, Zahra
    Cercone, Nick
    2013 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2013, : 1183 - 1188
  • [39] Architecture of Efficient Word Processing using Hadoop MapReduce for Big Data Applications
    Mandal, Bichitra
    Sahoo, Ramesh Kumar
    Sethi, Srinivas
    PROCEEDINGS 2015 INTERNATIONAL CONFERENCE ON MAN AND MACHINE INTERFACING (MAMI), 2015,
  • [40] A New Data Layout Scheme for Energy-Efficient MapReduce Processing Tasks
    Xuan T. Tran
    Tien Van Do
    Csaba Rotter
    Dosam Hwang
    Journal of Grid Computing, 2018, 16 : 285 - 298