Efficient Way of Searching Data in MapReduce Paradigm

被引:0
|
作者
Shah, Gita
Annappa
Shet, K. C.
机构
关键词
Hadoop; indexing; jetty server; load balancing; MapReduce;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Cloud computing has emerged as an effective solution in the computing world. When the cloud is used for large amounts of data storage, searching for any required data takes lots of time. A,framework is required to distribute the work of searching and fetching from thousands of computers. The data in Hadoop Distributed File System is scattered and needs lots of time to retrieve. MapReduce function on data sets of key & value pair is the programming paradigm of large distributed operation. The proposed work aims to minimize the data retrieval time taken by the MapReduce program in the cloud. The major idea is to design a web server in the map phase using the jetty web server which shall give a fast and efficient way of searching data in MapReduce paradigm. For real time processing on Hadoop, a search mechanism is implemented in HDFS. The load balancer is used to balance the workload across servers to improve its availability,,performance and scalability.
引用
收藏
页码:305 / 310
页数:6
相关论文
共 50 条
  • [1] Genome Data Analysis using MapReduce Paradigm
    Pahadia, Mayank
    Srivastava, Akash
    Srivastava, Divyang
    Patil, Nagamma
    2015 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATION ENGINEERING ICACCE 2015, 2015, : 556 - 559
  • [2] Efficient Random Data Accessing in MapReduce
    Mittal, Mamta
    Singh, Hari
    Paliwal, K. K.
    Goyal, Lalit Mohan
    2017 INTERNATIONAL CONFERENCE ON INFOCOM TECHNOLOGIES AND UNMANNED SYSTEMS (TRENDS AND FUTURE DIRECTIONS) (ICTUS), 2017, : 552 - 556
  • [3] Private Searching on MapReduce
    Zhu, Huafei
    Bao, Feng
    TRUST, PRIVACY AND SECURITY IN DIGITAL BUSINESS, 2010, 6264 : 93 - 101
  • [4] A kind of Prefetching Data Way to Hadoop MapReduce Environments
    Xia, Hui
    Wu, Peng
    PROCEEDINGS OF THE 2016 4TH INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS AND INFORMATION TECHNOLOGY APPLICATIONS, 2016, 71 : 1278 - 1283
  • [5] Data Matching Engine for Efficient Big Data Searching Systems Using a RRAM Based Novel Computing Paradigm
    Song, S. Y.
    Zhang, Y. Z.
    Kang, J.
    Zheng, K.
    Hai, B. W.
    Liul, L. F.
    Liu, X. Y.
    Kang, J. F.
    Huang, P.
    2023 SILICON NANOELECTRONICS WORKSHOP, SNW, 2023, : 5 - 6
  • [7] Classification of Multi-Genomic Data using MapReduce Paradigm
    Pahadia, Mayank
    Srivastava, Akash
    Srivastava, Divyang
    Patil, Nagamma
    2015 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION & AUTOMATION (ICCCA), 2015, : 678 - 682
  • [8] Efficient Big Data Processing in Hadoop MapReduce
    Dittrich, Jens
    Quiane-Ruiz, Jorge-Arnulfo
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (12): : 2014 - 2015
  • [9] Virtual Shuffling for Efficient Data Movement in MapReduce
    Yu, Weikuan
    Wang, Yandong
    Que, Xinyu
    Xu, Cong
    IEEE TRANSACTIONS ON COMPUTERS, 2015, 64 (02) : 556 - 568
  • [10] Considering Data Skew in Multi-way Joins for MapReduce
    Wu, Lei
    Zhang, Changchun
    Meng, Haiyan
    Li, Jing
    2013 8TH CHINAGRID ANNUAL CONFERENCE (CHINAGRID), 2013, : 69 - 73