A MapReduce-Based Algorithm for Parallelizing Collusion Detection in Hadoop

被引:0
|
作者
Mortazavi, Mahmood [1 ]
Ladani, Behrouz Tork [2 ]
机构
[1] Sheihkbahaee Univ, Dept Software Engn & Informat Technol, Esfahan, Iran
[2] Isfahan Univ, Fac Software Engn, Esfahan, Iran
关键词
Collusion detection; MapReduce; Hadoop;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
MapReduce as a programming model for parallel data processing has been used in many open systems such as cloud computing and service-oriented computing. Collusive behavior of worker entities in MapReduce model can violate integrity concern of open systems. In this paper, a MapReduce-based algorithm for parallel collusion detection of malicious workers has been proposed. This algorithm uses a voting matrix that is represented as a list of voting values of different workers. Three phases of majority selection, correlation counting and correlation computing are designed and implemented in this paper. Preliminary results show that speedup of 1.8 and efficiency of about 70% is achieved using data set containing 2000 worker's votes.
引用
下载
收藏
页数:5
相关论文
共 50 条
  • [41] MR-DBSCAN: a scalable MapReduce-based DBSCAN algorithm for heavily skewed data
    Yaobin He
    Haoyu Tan
    Wuman Luo
    Shengzhong Feng
    Jianping Fan
    Frontiers of Computer Science, 2014, 8 : 83 - 99
  • [42] SALA: A Skew-Avoiding and Locality-Aware Algorithm for MapReduce-Based Join
    Lin, Ziyu
    Cai, Minxing
    Huang, Ziming
    Lai, Yongxuan
    WEB-AGE INFORMATION MANAGEMENT (WAIM 2015), 2015, 9098 : 311 - 323
  • [43] MapReduce-Based Parallel Genetic Algorithm for CpG-Site Selection in Age Prediction
    Momeni, Zahra
    Abadeh, Mohammad Saniee
    GENES, 2019, 10 (12)
  • [44] A novel real-time scheduling algorithm and performance analysis of a MapReduce-based cloud
    Fei Teng
    Frédéric Magoulès
    Lei Yu
    Tianrui Li
    The Journal of Supercomputing, 2014, 69 : 739 - 765
  • [45] MR-DBSCAN:a scalable MapReduce-based DBSCAN algorithm for heavily skewed data
    Yaobin HE
    Haoyu TAN
    Wuman LUO
    Shengzhong FENG
    Jianping FAN
    Frontiers of Computer Science, 2014, 8 (01) : 83 - 99
  • [46] Distributed forests for MapReduce-based machine learning
    Wakayama, Ryoji
    Murata, Ryuei
    Kimura, Akisato
    Yamashita, Takayoshi
    Yamauchi, Yuji
    Fujiyoshi, Hironobu
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 276 - 280
  • [47] MapReduce-based Dimensional ETL Made Easy
    Liu, Xiufeng
    Thomsen, Christian
    Pedersen, Torben Bach
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (12): : 1882 - 1885
  • [48] An Experimental Survey of MapReduce-Based Similarity Joins
    Silva, Yasin N.
    Reed, Jason
    Brown, Kyle
    Wadsworth, Adelbert
    Rong, Chuitian
    SIMILARITY SEARCH AND APPLICATIONS, SISAP 2016, 2016, 9939 : 181 - 195
  • [49] Load Balancing for MapReduce-based Entity Resolution
    Kolb, Lars
    Thor, Andreas
    Rahm, Erhard
    2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 618 - 629
  • [50] A MapReduce-Based ELM for Regression in Big Data
    Wu, B.
    Yan, T. H.
    Xu, X. S.
    He, B.
    Li, W. H.
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2016, 2016, 9937 : 164 - 173