A MapReduce-based approach to social network big data mining

被引:1
|
作者
Qi, Fuli [1 ]
机构
[1] Shanghai Zhongqiao Vocat & Tech Univ, Sch Informat Engn, Shanghai 201514, Peoples R China
关键词
Social network; big data; MapReduce; parallel K-means clustering algorithm; Weibo topic; ALGORITHM;
D O I
10.3233/JCM-226903
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The rapid development of social networks has facilitated the convenience of users to receive information. As a network communication platform for people's daily use, microblog has countless information data. In view of the low efficiency and poor clustering effect of K-means algorithm, a parallel K-means clustering algorithm based on MapReduce model is studied; In order to alleviate the difficulty in calculating the similarity of microblog topic text, the space vector model and semantic similarity are used to calculate the similarity between texts to improve the quality of microblog text classification. The data expansion rate of corresponding nodes under different data sets shows that the average expansion rate of the parallel K-means algorithm reaches 0.89, and the running rate is the highest. The results show that the parallel K-means algorithm has good clustering stability and the highest clustering quality, reaching 1.24; The clustering time of the algorithm is the shortest, the average clustering time is 1.27 minutes, and the clustering effect and efficiency of the algorithm are the best. In the quality analysis of Weibo topic recommendation, the accuracy of P-K-means recommendation is 95.64%, user satisfaction is 98.64%, and the recommendation effect is also the best. It shows that the research on the parallel K-means clustering algorithm based on MapReduce has the best performance in microblogging topic mining and recommendation, which can efficiently recommend topics of interest to users and enhance users' microblogging experience.
引用
收藏
页码:2535 / 2547
页数:13
相关论文
共 50 条
  • [21] Technological Surveillance in Big Data Environments by using a MapReduce-based Method
    Pascal Filho, Daniel San Martin
    Jeronimo de Macedo, Douglas Dyllon
    Dutra, Moises Lima
    [J]. MOBILE NETWORKS & APPLICATIONS, 2022, 27 (05): : 1931 - 1940
  • [22] MapReduce-based Parallelized Approximation of Frequent Itemsets Mining in Uncertain Data
    Xu, Jing
    Mao, Xiao-Jiao
    Lu, Wen-Yang
    Zhu, Qi-Hai
    Li, Ning
    Yang, Yu-Bin
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2015, PT IV, 2015, 9492 : 136 - 144
  • [23] Research on MapReduce-based fuzzy associative classifier for big probabilistic numerical data
    Pei, Bin
    Wang, Fenmei
    Wang, Xiuzhen
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2016, : 903 - 906
  • [24] MapReduce-based Data Processing on IoT
    Satoh, Ichiro
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE (ITHINGS) - 2014 IEEE INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) - 2014 IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL-SOCIAL COMPUTING (CPS), 2014, : 161 - 168
  • [25] MrFIM: A MapReduce Approach for Frequent Itemset Mining in Big Data
    Rahman, Abdul
    Manjaramkar, Arati
    [J]. 2018 4TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [26] MapReduce-based big data classification model using feature subset selection and hyperparameter tuned deep belief network
    Rajendran, Surendran
    Khalaf, Osamah Ibrahim
    Alotaibi, Youseef
    Alghamdi, Saleh
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [27] Efficient Mining of Frequent itemsets in Social Network Data based on MapReduce Framework
    Farzanyar, Zahra
    Cercone, Nick
    [J]. 2013 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2013, : 1183 - 1188
  • [28] Scaling up MapReduce-based Big Data Processing on Multi-GPU systems
    Jiang, Hai
    Chen, Yi
    Qiao, Zhi
    Weng, Tien-Hsiung
    Li, Kuan-Ching
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (01): : 369 - 383
  • [29] Distributed Big Data Clustering using MapReduce-based Fuzzy C-Medoids
    Sardar T.H.
    Ansari Z.
    [J]. Journal of The Institution of Engineers (India): Series B, 2022, 103 (01) : 73 - 82
  • [30] Community structure mining in big data social media networks with MapReduce
    Jin, Songchang
    Lin, Wangqun
    Yin, Hong
    Yang, Shuqiang
    Li, Aiping
    Deng, Bo
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (03): : 999 - 1010