Analyzing Patterns in Large-Scale Graphs Using MapReduce in Hadoop

被引:0
|
作者
Schultz, Joshua [1 ]
Vierya, Jonathan
Lu, Enyue [1 ]
机构
[1] Salisbury Univ, Dept Math & Comp Sci, Salisbury, CT USA
关键词
MapReduce; Cloud Computing; Graph Algorithms; Pattern Detection; Cohesive Components;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Analyzing patterns in large-scale graphs, such as social networks (e.g. Facebook, Linkedin, Twitter) has many applications including community identification, blog analysis, intrusion and spamming detections. Currently, it is impossible to process information in large-scale graphs with millions even billions of edges with a single computer. In this paper, we take advantage of MapReduce, a programming model for processing large datasets, to detect important graph patterns using open source Hadoop on Amazon EC2. The aim of this paper is to show how MapReduce cloud computing with the application of graph pattern detection scales on real world data. We implement Cohen's MapReduce graph algorithms to enumerate patterns including triangles, rectangles, trusses and barycentric clusters using real world data taken from Snap Stanford. In addition, we create a visualization algorithm to visualize the detected graph patterns. The performance of MapReduce graph algorithms has been discussed too.
引用
收藏
页码:1457 / +
页数:2
相关论文
共 50 条
  • [1] Analyzing Patterns in Large-Scale Graphs Using MapReduce in Hadoop
    Schultz, Joshua
    Vieyra, Jonathan
    Lu, Enyue
    [J]. 2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1459 - 1459
  • [2] Hadoop-EDF: Large-scale Distributed Processing of Electrophysiological Signal Data in Hadoop MapReduce
    Wu, Yuanyuan
    Li, Xiaojin
    Liu, Jinze
    Cui, Licong
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 2265 - 2271
  • [3] Design of Large-scale Content-based Recommender System using Hadoop MapReduce Framework
    Saravanan, S.
    [J]. 2015 EIGHTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2015, : 302 - 307
  • [4] Hadoop MapReduce Framework to Implement Molecular Docking of Large-Scale Virtual Screening
    Zhao, Jing
    Zhang, Ruisheng
    Zhao, Zhili
    Chen, Dianwei
    Hou, Lujie
    [J]. 2012 IEEE ASIA-PACIFIC SERVICES COMPUTING CONFERENCE (APSCC), 2012, : 350 - 353
  • [5] Parallel clustering over large-scale data stream based on grid density using Hadoop MapReduce
    Cai, Binlei
    Zhu, Shiwei
    Guo, Qin
    Yu, Junfeng
    [J]. ICIC Express Letters, 2013, 7 (11): : 3075 - 3081
  • [6] Optimization of hadoop cluster for analyzing large-scale sequence data in bioinformatics
    Toth, Adam
    Karimi, Ramin
    [J]. ANNALES MATHEMATICAE ET INFORMATICAE, 2019, 50 : 187 - 202
  • [7] Efficient large-scale data analysis using mapreduce
    [J]. Kubo, R, 1600, Nippon Telegraph and Telephone Corp. (10):
  • [8] Efficient Large-scale Trace Checking Using MapReduce
    Bersani, Marcello M.
    Bianculli, Domenico
    Ghezzi, Carlo
    Krstic, Srdan
    San Pietro, Pierluigi
    [J]. 2016 IEEE/ACM 38TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2016, : 888 - 898
  • [9] Large-scale L-BFGS using MapReduce
    Chen, Weizhu
    Wang, Zhenghao
    Zhou, Jingren
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [10] Monitoring and Analyzing Big Traffic Data of a Large-Scale Cellular Network with Hadoop
    Liu, Jun
    Liu, Feng
    Ansari, Nirwan
    [J]. IEEE NETWORK, 2014, 28 (04): : 32 - 39