Efficient Way of Searching Data in MapReduce Paradigm

被引:0
|
作者
Shah, Gita
Annappa
Shet, K. C.
机构
关键词
Hadoop; indexing; jetty server; load balancing; MapReduce;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Cloud computing has emerged as an effective solution in the computing world. When the cloud is used for large amounts of data storage, searching for any required data takes lots of time. A,framework is required to distribute the work of searching and fetching from thousands of computers. The data in Hadoop Distributed File System is scattered and needs lots of time to retrieve. MapReduce function on data sets of key & value pair is the programming paradigm of large distributed operation. The proposed work aims to minimize the data retrieval time taken by the MapReduce program in the cloud. The major idea is to design a web server in the map phase using the jetty web server which shall give a fast and efficient way of searching data in MapReduce paradigm. For real time processing on Hadoop, a search mechanism is implemented in HDFS. The load balancer is used to balance the workload across servers to improve its availability,,performance and scalability.
引用
收藏
页码:305 / 310
页数:6
相关论文
共 50 条
  • [31] Data declustering for efficient range and similarity searching
    Prabhakar, S
    Agrawal, D
    El Abbadi, A
    MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS III, 1998, 3527 : 419 - 430
  • [32] Efficient Snapshot KNN Join Processing for Large Data Using MapReduce
    Hu, Yupeng
    Yang, Chong
    Ji, Cun
    Xu, Yang
    Li, Xueqing
    2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2016, : 713 - 720
  • [33] A Generalized MapReduce Approach for Efficient mining of Large data Sets in the GRID
    Roehm, Matthias
    Grabert, Matthias
    Schweiggert, Franz
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, GRIDS, AND VIRTUALIZATION (CLOUD COMPUTING 2010), 2010, : 14 - 19
  • [34] Efficient Alignment of Next Generation Sequencing Data Using MapReduce on the Cloud
    AlSaad, Rawan
    Malluhi, Qutaibah
    Abouelhoda, Mohamed
    2012 CAIRO INTERNATIONAL BIOMEDICAL ENGINEERING CONFERENCE (CIBEC), 2012, : 18 - 22
  • [35] CSRA: An Efficient Resource Allocation Algorithm in MapReduce Considering Data Skewness
    Qi, Ling
    Tang, Zhuo
    Qin, Yunchuan
    Ye, Yu
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2015, 2015, 9403 : 651 - 662
  • [36] Efficient Querying Distributed Big-XML Data using MapReduce
    Song Kunfang
    Hongwei Lu
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2016, 8 (03) : 70 - 79
  • [37] Efficient MapReduce Kernel k-Means for Big Data Clustering
    Tsapanos, Nikolaos
    Tefas, Anastasios
    Nikolaidis, Nikolaos
    Pitas, Ioannis
    9TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE (SETN 2016), 2016,
  • [38] Efficient Distributed Density Peaks for Clustering Large Data Sets in MapReduce
    Zhang, Yanfeng
    Chen, Shimin
    Yu, Ge
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (12) : 3218 - 3230
  • [39] Utilizing the Buckshot Algorithm for Efficient Big Data Clustering in the MapReduce Model
    Gerakidis, Sergios
    Mamalis, Basilis
    PROCEEDINGS OF THE 23RD PAN-HELLENIC CONFERENCE OF INFORMATICS (PCI 2019), 2019, : 112 - 117
  • [40] Efficient finer-grained incremental processing with MapReduce for big data
    Zhang, Liang
    Feng, Yuanyuan
    Shen, Peiyi
    Zhu, Guangming
    Wei, Wei
    Song, Juan
    Shah, Syed Afaq Ali
    Bennamoun, Mohammed
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 80 : 102 - 111