Efficient Way of Searching Data in MapReduce Paradigm

被引：0

作者：

Shah, Gita

Annappa

Shet, K. C.

机构：

来源：

2014 INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM) | 2014年

关键词：

Hadoop; indexing; jetty server; load balancing; MapReduce;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Cloud computing has emerged as an effective solution in the computing world. When the cloud is used for large amounts of data storage, searching for any required data takes lots of time. A,framework is required to distribute the work of searching and fetching from thousands of computers. The data in Hadoop Distributed File System is scattered and needs lots of time to retrieve. MapReduce function on data sets of key & value pair is the programming paradigm of large distributed operation. The proposed work aims to minimize the data retrieval time taken by the MapReduce program in the cloud. The major idea is to design a web server in the map phase using the jetty web server which shall give a fast and efficient way of searching data in MapReduce paradigm. For real time processing on Hadoop, a search mechanism is implemented in HDFS. The load balancer is used to balance the workload across servers to improve its availability,,performance and scalability.

引用

页码：305 / 310

页数：6

共 50 条

[21] Efficient data distribution and results merging for parallel data clustering in mapreduce environment
Bousbaci, Abdelhak
Kamel, Nadjet
APPLIED INTELLIGENCE, 2018, 48 (08) : 2408 - 2428
[22] Efficient Data Preprocessing for Genetic-Fuzzy Mining with MapReduce
Hong, Tzung-Pei
Liu, Yu-Yang
Wu, Min-Thai
Tsai, Chun-Wei
2015 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2015, : 88 - 89
[23] Towards Efficient Big Data Storage With MapReduce Deduplication System
Joe, Vijesh
Raj, Jennifer S.
Smys, S.
INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY AND WEB ENGINEERING, 2021, 16 (02) : 45 - 57
[24] Efficient Results Merging for Parallel Data Clustering Using MapReduce
Bousbaci, Abdelhak
Kamel, Nadjet
DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, (DCAI 2016), 2016, 474 : 349 - 357
[25] An efficient MapReduce scheduling scheme for processing large multimedia data
Kyoungsoo Bok
Jaemin Hwang
Jongtae Lim
Yeonwoo Kim
Jaesoo Yoo
Multimedia Tools and Applications, 2017, 76 : 17273 - 17296
[26] Scalable Query Optimization for Efficient Data Processing using MapReduce
Shan, Yi
Chen, Yi
2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 649 - 652
[27] An enhanced and efficient clustering algorithm for large data using MapReduce
Li, Hongbiao
Liu, Ruiying
Wang, Jingdong
Wu, Qilong
IAENG International Journal of Computer Science, 2019, 46 (01)
[28] Efficient data distribution and results merging for parallel data clustering in mapreduce environment
Abdelhak Bousbaci
Nadjet Kamel
Applied Intelligence, 2018, 48 : 2408 - 2428
[29] Efficient Range Searching for Categorical and Plain Data
Nekrich, Yakov
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2014, 39 (01):
[30] Efficient Indexing and Searching Framework for Unstructured Data
Aye, Kyar Nyo
Thein, Ni Lar
FOURTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2011): MACHINE VISION, IMAGE PROCESSING, AND PATTERN ANALYSIS, 2012, 8349

← 1 2 3 4 5 →