Efficient Way of Searching Data in MapReduce Paradigm

被引:0
|
作者
Shah, Gita
Annappa
Shet, K. C.
机构
关键词
Hadoop; indexing; jetty server; load balancing; MapReduce;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Cloud computing has emerged as an effective solution in the computing world. When the cloud is used for large amounts of data storage, searching for any required data takes lots of time. A,framework is required to distribute the work of searching and fetching from thousands of computers. The data in Hadoop Distributed File System is scattered and needs lots of time to retrieve. MapReduce function on data sets of key & value pair is the programming paradigm of large distributed operation. The proposed work aims to minimize the data retrieval time taken by the MapReduce program in the cloud. The major idea is to design a web server in the map phase using the jetty web server which shall give a fast and efficient way of searching data in MapReduce paradigm. For real time processing on Hadoop, a search mechanism is implemented in HDFS. The load balancer is used to balance the workload across servers to improve its availability,,performance and scalability.
引用
收藏
页码:305 / 310
页数:6
相关论文
共 50 条
  • [21] Efficient data distribution and results merging for parallel data clustering in mapreduce environment
    Bousbaci, Abdelhak
    Kamel, Nadjet
    APPLIED INTELLIGENCE, 2018, 48 (08) : 2408 - 2428
  • [22] Efficient Data Preprocessing for Genetic-Fuzzy Mining with MapReduce
    Hong, Tzung-Pei
    Liu, Yu-Yang
    Wu, Min-Thai
    Tsai, Chun-Wei
    2015 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2015, : 88 - 89
  • [23] Towards Efficient Big Data Storage With MapReduce Deduplication System
    Joe, Vijesh
    Raj, Jennifer S.
    Smys, S.
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY AND WEB ENGINEERING, 2021, 16 (02) : 45 - 57
  • [24] Efficient Results Merging for Parallel Data Clustering Using MapReduce
    Bousbaci, Abdelhak
    Kamel, Nadjet
    DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, (DCAI 2016), 2016, 474 : 349 - 357
  • [25] An efficient MapReduce scheduling scheme for processing large multimedia data
    Kyoungsoo Bok
    Jaemin Hwang
    Jongtae Lim
    Yeonwoo Kim
    Jaesoo Yoo
    Multimedia Tools and Applications, 2017, 76 : 17273 - 17296
  • [26] Scalable Query Optimization for Efficient Data Processing using MapReduce
    Shan, Yi
    Chen, Yi
    2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 649 - 652
  • [27] An enhanced and efficient clustering algorithm for large data using MapReduce
    Li, Hongbiao
    Liu, Ruiying
    Wang, Jingdong
    Wu, Qilong
    IAENG International Journal of Computer Science, 2019, 46 (01)
  • [28] Efficient data distribution and results merging for parallel data clustering in mapreduce environment
    Abdelhak Bousbaci
    Nadjet Kamel
    Applied Intelligence, 2018, 48 : 2408 - 2428
  • [29] Efficient Range Searching for Categorical and Plain Data
    Nekrich, Yakov
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2014, 39 (01):
  • [30] Efficient Indexing and Searching Framework for Unstructured Data
    Aye, Kyar Nyo
    Thein, Ni Lar
    FOURTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2011): MACHINE VISION, IMAGE PROCESSING, AND PATTERN ANALYSIS, 2012, 8349