Efficient Way of Searching Data in MapReduce Paradigm

被引：0

作者：

Shah, Gita

Annappa

Shet, K. C.

机构：

来源：

2014 INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM) | 2014年

关键词：

Hadoop; indexing; jetty server; load balancing; MapReduce;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Cloud computing has emerged as an effective solution in the computing world. When the cloud is used for large amounts of data storage, searching for any required data takes lots of time. A,framework is required to distribute the work of searching and fetching from thousands of computers. The data in Hadoop Distributed File System is scattered and needs lots of time to retrieve. MapReduce function on data sets of key & value pair is the programming paradigm of large distributed operation. The proposed work aims to minimize the data retrieval time taken by the MapReduce program in the cloud. The major idea is to design a web server in the map phase using the jetty web server which shall give a fast and efficient way of searching data in MapReduce paradigm. For real time processing on Hadoop, a search mechanism is implemented in HDFS. The load balancer is used to balance the workload across servers to improve its availability,,performance and scalability.

引用

页码：305 / 310

页数：6

共 50 条

[31] Data declustering for efficient range and similarity searching
Prabhakar, S
Agrawal, D
El Abbadi, A
MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS III, 1998, 3527 : 419 - 430
[32] Efficient Snapshot KNN Join Processing for Large Data Using MapReduce
Hu, Yupeng
Yang, Chong
Ji, Cun
Xu, Yang
Li, Xueqing
2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2016, : 713 - 720
[33] A Generalized MapReduce Approach for Efficient mining of Large data Sets in the GRID
Roehm, Matthias
Grabert, Matthias
Schweiggert, Franz
PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, GRIDS, AND VIRTUALIZATION (CLOUD COMPUTING 2010), 2010, : 14 - 19
[34] Efficient Alignment of Next Generation Sequencing Data Using MapReduce on the Cloud
AlSaad, Rawan
Malluhi, Qutaibah
Abouelhoda, Mohamed
2012 CAIRO INTERNATIONAL BIOMEDICAL ENGINEERING CONFERENCE (CIBEC), 2012, : 18 - 22
[35] CSRA: An Efficient Resource Allocation Algorithm in MapReduce Considering Data Skewness
Qi, Ling
Tang, Zhuo
Qin, Yunchuan
Ye, Yu
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2015, 2015, 9403 : 651 - 662
[36] Efficient Querying Distributed Big-XML Data using MapReduce
Song Kunfang
Hongwei Lu
INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2016, 8 (03) : 70 - 79
[37] Efficient MapReduce Kernel k-Means for Big Data Clustering
Tsapanos, Nikolaos
Tefas, Anastasios
Nikolaidis, Nikolaos
Pitas, Ioannis
9TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE (SETN 2016), 2016,
[38] Efficient Distributed Density Peaks for Clustering Large Data Sets in MapReduce
Zhang, Yanfeng
Chen, Shimin
Yu, Ge
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (12) : 3218 - 3230
[39] Utilizing the Buckshot Algorithm for Efficient Big Data Clustering in the MapReduce Model
Gerakidis, Sergios
Mamalis, Basilis
PROCEEDINGS OF THE 23RD PAN-HELLENIC CONFERENCE OF INFORMATICS (PCI 2019), 2019, : 112 - 117
[40] Efficient finer-grained incremental processing with MapReduce for big data
Zhang, Liang
Feng, Yuanyuan
Shen, Peiyi
Zhu, Guangming
Wei, Wei
Song, Juan
Shah, Syed Afaq Ali
Bennamoun, Mohammed
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 80 : 102 - 111

← 1 2 3 4 5 →