MapReduce based Method for Big Data Semantic Clustering

被引:10
|
作者
Yang, Jie [1 ]
Li, Xiaoping [2 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Publ Secur Bur Jiangsu Prov, Nanjing, Jiangsu, Peoples R China
[2] Southeast Univ, Sch Comp Sci & Engn, Nanjing, Jiangsu, Peoples R China
关键词
cloud computing; big data; MapReduce; Ant colony; k-means;
D O I
10.1109/SMC.2013.480
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Big data analysis is very hot in cloud computing environments. How to automatically map heterogeneous data with the same semantics is one of the key problems in big data analysis. A big data clustering method based on the MapReduce framework is proposed in this paper. Big data are decomposed into many data chunks for parallel clustering, which is implemented by Ant Colony. Data elements are moved and clustered by ants according to the presented criterion. The proposed method is compared with the MapReduce framework based k-means clustering algorithm on a great amount of practical data. Experimental results show that the proposal is much effective for big data clustering.
引用
收藏
页码:2814 / 2819
页数:6
相关论文
共 50 条
  • [31] Utilizing the Buckshot Algorithm for Efficient Big Data Clustering in the MapReduce Model
    Gerakidis, Sergios
    Mamalis, Basilis
    [J]. PROCEEDINGS OF THE 23RD PAN-HELLENIC CONFERENCE OF INFORMATICS (PCI 2019), 2019, : 112 - 117
  • [32] The method and application of big data mining formobile trajectory of taxi based on MapReduce
    Kong, Fansheng
    Lin, Xiaola
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 5): : 11435 - 11442
  • [33] Technological Surveillance in Big Data Environments by using a MapReduce-based Method
    Daniel San Martin Pascal Filho
    Douglas Dyllon Jeronimo de Macedo
    Moisés Lima Dutra
    [J]. Mobile Networks and Applications, 2022, 27 : 1931 - 1940
  • [34] The method and application of big data mining for mobile trajectory of taxi based on MapReduce
    Fansheng Kong
    Xiaola Lin
    [J]. Cluster Computing, 2019, 22 : 11435 - 11442
  • [35] Technological Surveillance in Big Data Environments by using a MapReduce-based Method
    Pascal Filho, Daniel San Martin
    Jeronimo de Macedo, Douglas Dyllon
    Dutra, Moises Lima
    [J]. MOBILE NETWORKS & APPLICATIONS, 2022, 27 (05): : 1931 - 1940
  • [36] A Parallel Clustering Method Study Based on MapReduce
    Sun Zhanquan
    [J]. PROCEEDINGS OF THE 1ST INTERNATIONAL WORKSHOP ON CLOUD COMPUTING AND INFORMATION SECURITY (CCIS 2013), 2013, 52 : 416 - 419
  • [37] Student Behavior Clustering Method Based on Campus Big Data
    Ding, Dong
    Li, Junhuai
    Wang, Huaijun
    Liang, Zhu
    [J]. 2017 13TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2017, : 500 - 503
  • [38] Atrak: a MapReduce-based data warehouse for big data
    Mohammadhossein Barkhordari
    Mahdi Niamanesh
    [J]. The Journal of Supercomputing, 2017, 73 : 4596 - 4610
  • [39] Atrak: a MapReduce-based data warehouse for big data
    Barkhordari, Mohammadhossein
    Niamanesh, Mahdi
    [J]. JOURNAL OF SUPERCOMPUTING, 2017, 73 (10): : 4596 - 4610
  • [40] Parallel Fuzzy C-Means Clustering Based Big Data Anonymization Using Hadoop MapReduce
    Lawrance, Josephine Usha
    Jesudhasan, Jesu Vedha Nayahi
    Rittammal, Jerald Beno Thampiraj
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2024, 135 (04) : 2103 - 2130