Parallel Processing of Big Data using Power Iteration Clustering over MapReduce

被引：2

作者：

Jayalatchumy, D. ^{[1
]}

Thambidurai, P. ^{[1
]}

Alamelu, A. Vasumathi ^{[1
]}

机构：

[1] PKIET, CSE, Karaikal, India

来源：

2014 WORLD CONGRESS ON COMPUTING AND COMMUNICATION TECHNOLOGIES (WCCCT 2014) | 2014年

关键词：

p-PIC; Hadoop; Fault tolerance; GBC;

D O I：

10.1109/WCCCT.2014.16

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Extracting useful information from dataset measuring in gigabytes and tetrabytes is a real challenge for data miners. Clustering algorithm have the problem of scalability while dealing with big data. The problem can be handled using parallel algorithm by executing them along with input data on high performance computer. The problem with graph based application requires much time for computation. PIC is an algorithm that is simple, fast, relatively scalable which requires the data and its associated matrix to fit in memory and this becomes infeasible for big data applications. Scalability has been increased using p-PIC and this paper focus on exploring different parallelization strategies for minimizing and compelling communication cost. The algorithm works on with a parallel framework MapReduce. p-PIC algorithm deals with Hadoop cloud a parallel store and computing platform implementing p-PIC using Hadoop framework.

引用

页码：176 / 178

页数：3

共 50 条

[1] p-PIC: Parallel power iteration clustering for big data
Yan, Weizhong
Brahmakshatriya, Umang
Xue, Ya
Gilder, Mark
Wise, Bowden
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2013, 73 (03) : 352 - 359
[2] Clustering on Big Data Using Hadoop MapReduce
Akthar, Nadeem
Ahamad, Mohd Vasim
Khan, Shahbaz
2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 789 - 795
[3] MapReduce Clustering for Big Data
Ghattas, Badih
Pinto, Antoine
Diao, Sambou
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5116 - 5124
[4] Privacy Preserving Parallel Clustering Based Anonymization for Big Data Using MapReduce Framework
Lawrance, Josephine Usha
Jesudhasan, Jesu Vedha Nayahi
APPLIED ARTIFICIAL INTELLIGENCE, 2021, 35 (15) : 1587 - 1620
[5] Parallel Clustering Optimization Algorithm Based on MapReduce in Big Data Mining
Zhang, Huajie
Song, Lei
Zhang, Sen
IAENG International Journal of Applied Mathematics, 2023, 53 (01):
[6] Improved CURE Clustering for Big Data using Hadoop and Mapreduce
Lathiya, Piyush
Rani, Rinkle
2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 3, 2015, : 241 - 245
[7] Event Segmentation using MapReduce based Big Data Clustering
Shafiq, M. Omair
2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 1857 - 1866
[8] Parallel Fuzzy C-Means Clustering Based Big Data Anonymization Using Hadoop MapReduce
Lawrance, Josephine Usha
Jesudhasan, Jesu Vedha Nayahi
Rittammal, Jerald Beno Thampiraj
WIRELESS PERSONAL COMMUNICATIONS, 2024, 135 (04) : 2103 - 2130
[9] PARALLEL KNOWLEDGE ACQUISITION ALGORITHM FOR BIG DATA USING MAPREDUCE
Qian, Jin
Xia, Min
Lv, Ping
PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL. 1, 2015, : 316 - 321
[10] Parallel knowledge acquisition algorithms for big data using MapReduce
Jin Qian
Min Xia
Xiaodong Yue
International Journal of Machine Learning and Cybernetics, 2018, 9 : 1007 - 1021

← 1 2 3 4 5 →